Practical Foundations of Mathematics

Paul Taylor

4.5 A Universal Property: Products

At first sight, the definition of a category is not sufficiently general to interpret l-abstraction or many-argument operations: it seems that it is only suitable for unary ones. In Section 4.3 we effectively extended the definition of category by admitting a list of objects as the source of each morphism; such structures, called multi -categories, were formalised by Joachim Lambek [ Lam89]. But this is the thin end of a wedge: for every new constructor we would need to modify the definition of a category again to make the semantics follow slavishly after the syntax. For example allowing function-types in the source too would be useful to simplify the definition of sums and lists (Remarks 2.3.11 and 2.7.10).

Our strategy will be to keep faith with categories as they are, uncovering their latent structure - what is there already. We shall show that this matches the type-theoretic phenomena of interest. Even graph theory does this in a primitive fashion, by studying features such as bridges, ie edges whose removal would disconnect the graph. Particular nodes and edges may be characterised à la Leibniz (Proposition 2.8.7) by such features - by the way in which the others see them.

But a graph may have many (different) bridges: we need descriptions (Definition 1.2.10) - properties for which the definite article ( the) can be used. We use superlatives such as greatest. These must be justified by comparatives ( greater) with all other objects. Indeed French and Italian ( il più bel prodotto) use the definite article to make superlatives out of comparatives. In a category the comparatives are the morphisms.

The Leibnizian method of description is based on quantification over the whole category, and we shall use either G or Q for the bound variable. This quantification means that universal properties are impredicative definitions; they are (loosely) related to those in Remark 2.8.11.

Variation over the ambient category may instead be expressed as an axiom scheme with an instance for each comparative object (G or Q). The maps a:GŽ X used to test a universal property may be seen as terms or elements of (type) X, so quantifying over the objects G in the category says ``for all terms a:X'' without specifying the parameters G occurring in them. But the definitions must be preserved by substitution for these parameters, cf Lemma 1.6.3.

The terminal object We shall begin by looking closely at the very simplest case. Category theory attracts from mathematicians derisive names such as ``abstract nonsense'' and ``empty-set theory'' because of definitions like this one and the remarks which follow it. Informaticians will be better aware of the need to get base cases right: although they appear trivial, they are where the actual work of a construction gets done, and are also where most program bugs lie. In category theory, any universal construction is the terminal object of some category.

DEFINITION 4.5.1 1 Î obC is a terminator, terminal object or final object if for each object G Î obC there is a unique morphism \terminalproj_G :GŽ 1.

EXAMPLES 4.5.2

(a): A terminal object of a preorder qua category is a greatest element (Definition 3.2.1(d)); truth is the terminal formula under provability.
(b): {*} is the terminal object of Set, and also of Preord, Pos, CSLat , SLat, Lat, DLat, HSL, Heyt, Frm, Dcpo, IPO, Sp, Mon, Gp , CMon and AbGp with the unique structure. Lemma 2.1.9(e) showed that there is exactly one total function, xŽ *.
(c): Ć is the terminal object of Rel and Pfn; ! is now the empty relation.
(d): The empty sequence of types is the terminal object of the category of contexts and substitutions; ! is the empty sequence of terms.
(e): The terminal type in C and JAVA is called void.

REMARK 4.5.3 In Set, for each element a Î X there is a function {*} Ž X selecting a (Lemma 2.1.9(d)), so maps from the terminal object are called global elements. Although this usage came from topology via sheaf theory (Example 8.3.7(a)) its meaning is exactly what might be expected from programming: a term in the global (empty) context. By Corollary 4.3.13, a general map a:GŽ X is a term G\vdash a:X in the arbitrary or local context of the parameters G, so it is called a generalised or local element.

The distinction is well known and important in recursion theory: we say that two functions f,g:NŽ N are numeralwise equal if f(0) = g(0), f(1) = g(1), f(2) = g(2), ..., ie the two composites

omitted diagram environment

agree ( cf a predicate f[n] provable for each n separately), but this is not enough to prove f = g or "x.f[x] (Remark 2.4.9, Theorem 9.6.2).

Proposition 4.2.9 showed that every category has a faithful covariant action H_X on its generalised elements (see also Theorem 4.8.12).

DEFINITION 4.5.4 If G Ě obC is such that the restriction of the action to (\G_X = Č_{G Î G}C(G,X):X Î obC) remains faithful then it is a class of generators. Classically this is often stated as follows: if f š g:X\rightrightarrows Y in C then there are some G Î G and a:GŽ X with fo a š go a. To say that there is some set G of generators is a widely applicable way of restricting the size of a category ( cf locally small, Remark 4.1.10).

EXAMPLES 4.5.5

(a): In Set the singleton G = {1} suffices; a category for which {1} is a set of generators is sometimes called well pointed.
(b): Pos is well pointed in the weak sense that Pos(1,-):PosŽ Set is faithful, but this functor doesn't reflect invertibility. Theorem 4.7.13 needs this stronger property, which G = {1,{^ < T}} does satisfy.
(c): A well pointed semantics for the l-calculus is called a model, where the word algebra is used for the general case.

Some writers have specialised the term concrete category to this case, where the action on global elements is faithful. The standard usage of this term is that there is some faithful action. For example CSLat is manifestly described ``concretely,'' but CSLat(1,X) always has exactly one element, so {1} cannot generate. There are other categories which we would like to call concrete but in which there are few maps 1Ž X, or which don't have a terminal object at all. What ``concrete'' really means is that ``the action you first thought of'' is faithful, so neither definition is particularly satisfactory. However, there is no real need for any of this terminology. See also Exercise 4.12.

Unique up to unique isomorphism The account of the theory of descriptions which we gave in Section 1.2 was more liberal than Russell's, allowing interchangeable objects to share the description. This is the nature of universal properties.

THEOREM 4.5.6 The terminal object, if it exists, is unique up to unique isomorphism. Conversely, any object which is isomorphic to a terminal object is itself terminal.

PROOF: Let \typeT₁ and \typeT₂ be terminal objects. Then putting \typeT₁ for 1 and \typeT₂ for G, there is a unique map u:\typeT₂Ž \typeT₁, and with the opposite assignment a unique map |:\typeT₁Ž \typeT₂.

To show that these are mutually inverse, consider \id_\typeT₁ and |;u. These are both maps \typeT₁Ž \typeT₁, but casting \typeT₁ in both roles (as 1 and as G), there is only one such map and so the two candidates for it must be the same, cf Lemma 1.2.11. So |;u = \id_\typeT₁ and similarly u;| = \id_\typeT₂.

omitted diagram environment

Finally, let \typeT₁ be terminal and suppose u and | form an isomorphism with \typeT₂. Then for any object G, the unique map GŽ \typeT₁ extends by | to \typeT₂, but if a,b:G\rightrightarrows \typeT₂ then a;u = b;u since \typeT₁ is terminal, and then

a = a;u;| = b;u;| = b since

u;| = \id_\typeT₂. []

Notice how the uniqueness of maps to 1 enabled us to deduce the equality of id and u;| and hence transfer known properties from one to the other. Mere existence of morphisms between objects is nothing like as useful.

Products The cartesian product can also be described in this way. It is what we use in type theory to describe functions of two variables.

DEFINITION 4.5.7 Let P, X and Y be objects of a category C. Then (p₀:PŽ X,p₁:PŽ Y) is a product if for every object G Î C and pair of morphisms (a:GŽ X,b:GŽ Y) in C there is a unique mediating map f:GŽ P such that p₀o f = a and p₁o f = b. The product is written Xx Y, and the mediator f, which is called the pair , is written a,b.

omitted diagram environment

Like the terminal object, the product together with the projections, p₀ and p₁, is unique up to unique isomorphism. Conversely, any isomorphic object u:Q ş Xx Y may be given the structure of the product, with projections u;p₀ and u;p₁ and pairs u^-1;a,b.

The p notation signifies selection ( 4.5.8(f)), but we often want to specify the omission of a component (4.5.8(i)); for this we use the hat notation ([^(x)]) and the triangle arrowheads. Beware that the latter indicate a type-theoretic interpretation and not an intrinsic property such as surjectivity.

EXAMPLES 4.5.8

(a)

A product in a poset or preorder is a meet (Definition 3.2.4(b), Proposition 3.2.11) and in particular is conjunction for provability of formulae (Definition 1.4.2).

(b)

The product in Set is the cartesian product (Remark 2.2.2) with its usual projection functions. Given a:GŽ X and b:GŽ Y, the mediator is a,b:[(z)\vec]Ž a([(z)\vec]),b([(z)\vec]), where

G = [[(z)\vec]: [(Z)\vec]].

(c)

The componentwise operations on the cartesian product give the categorical product in SLat, Lat, DLat, HSL, Heyt, BA, Frm and Gp.

(d)

The componentwise order gives the product in Pos, Preord, SLat, CSLat, Dcpo and IPO by Propositions 3.5.1 and 3.5.2.

(e)

This is also the componentwise order in Lat, DLat, HSL, Heyt and Frm by Proposition 3.2.11.

(f)

A record consists of data assignments t(i) Î \typeX_i to each field i Î I. The type of records whose fields are of given types is the product of those types. A selector is a component projection p_i.

(g)

Let C be the monoid of primitive recursive functions, qua category with one object called N. Then NxN ş N in C.

(h)

Remark 2.2.2 gave the type-theoretic rules for pairing,

omitted prooftree environment

though this only defines the product for two types, not for objects of Cn^×_L, which are contexts.

(i)

For two disjoint contexts [[(x)\vec]:[(X)\vec]]x[[(y)\vec]:[(Y)\vec]], the product is their concatenation [[(x)\vec]:[(X)\vec],[(y)\vec]:[(Y)\vec]], and the projections p₀ and p₁ are the sub-sequences of variables [[(x)\vec]: = [(x)\vec]] and [[(y)\vec]: = [(y)\vec]]. The pair [[(x)\vec]: = [(a)\vec]],[[(y)\vec]: = [(b)\vec]] is also the concatenation [[(x)\vec]: = [(a)\vec],[(y)\vec]: = [(b)\vec]] (Corollary 4.3.13(d)).

See also Exercises 4.17, 4.21, 4.22, 5.22 and 5.30.

REMARK 4.5.9 The product X×_C Y in an abstract category and the cartesian product of hom-sets in Set are related by the equation

C(G,X)×_SetC(G,Y) ş C(G,X×_C Y).

This is a bijection a,bŤ f because ( cf Corollary 4.3.13(d))

p₀a,b = a p₁a,b = b p₀(f),p₁(f) = f.

Notice that 1 has just one generalised element GŽ 1, whilst the set of elements of Xx Y is the cartesian product of those of X and of Y.

Preservation and creation of products There is no need to make a choice of products to define what it means for a functor to preserve them: if one product P is preserved then so is any other Q ş P.

DEFINITION 4.5.10 Let X,Y Î obC. A functor U:CŽ D

(a)

preserves the product of X and Y if, whenever X\gets PŽ Y obeys the universal property defining a product cone in C, then so does UX\gets UPŽ UY, but in D ( cf Definition 3.2.6 for posets);

(b)

preserves this product on the nose if choices of product cones have been made in both categories and U takes one choice to the other;

(c)

creates the product if

for each product cone UXp₀Ź Q p₁ Ž UY which exists in D,
there is a cone Xa\gets PbŽ Y in C with an isomorphism omitted diagram environment in D, such that this diagram is unique up to unique isomorphism (as such, without the next condition),
and moreover Xa\gets PbŽ Y is a product cone in C.

So if U is a forgetful functor there is a unique structure that can be put on the product of the underlying sets and is consistent with the structure of the given objects.

EXAMPLES 4.5.11 Recall Examples 4.4.9(f) and (g).

(a): The forgetful functor LatŽ Set (or Mod(L)Ž Set for any single-sorted algebraic theory L) creates products: once we have found the product of the carriers in a diagram of lattices, there is only one structure such that the projections are homomorphisms.
(b): A functor creates unary products ( sic) iff it reflects invertibility, cf Definition 4.4.8(e).
(c): The forgetful functor SpŽ Set preserves products, but it does not create them: apart from the Tychonov topology (which gives the categorical product), the projections are also continuous when the product of the underlying sets is given the discrete topology. []

Using the existence of products To say that a category C ``has products'' is a statement of the form

"X,Y:obC. $P.$p₀,p₁. Xp₀Ź P p₁Ž Y is a product cone,

which is proved and used by the rules for " and $ in Section 1.5.

REMARK 4.5.12 In particular, whenever we have two particular objects A and B, we may (instantiate the universal quantifiers and) suppose that some product cone AŹ PŽ B is at our disposal. The way that we do this is just the same as the idiom for ($E ) in any other circumstances (Remark 1.6.5): it is a formal property of the existential quantifier that we may invent a name for such a cone and continue to use it for as long as the given A and B remain in scope. Nothing in this procedure assumes a global assignment of products to all pairs of objects in the category.

Suppose, on the other hand, that an object P has already been introduced by some other means, and that we can show that it possesses the universal property needed to be a product. Suppose also that no other name has so far been given to any product of X and Y. Then, since being a product is a description up to isomorphism (Lemma 1.2.11, Theorem 4.5.6), we may write P = XxY, electing P as the specified product, instead of P ş XxY.

Usually in category theory, as throughout mathematics in action, the quantifiers do their entrances and exits unannounced. The process of inventing names for witnesses for existential statements may be repeated any finite number of times (and that number may be unspecified: we may use products of pairs of objects from a list of indeterminate length, Exercise 2.43). In a sense it may even be done infinitely: if, by methods such as in Chapter VIII, we have some way of collecting an ``indexed family'' as one object then the uniqueness feature of universal properties means that one application of a product construction suffices to provide it uniformly for all members of the family.

The point at which such methods break down is where we want to make some construction (usually a functor) globally throughout the category. There are three ways to proceed. Classically, the axiom of choice selects products once and for all. Logically, such constructions may be regarded as schemes : to be instantiated to each situation as required. The third way is to replace the category itself with an equivalent (interchangeable) one on which products are indeed defined globally. We shall show how to do this in Section 7.6, without using the axiom of choice.

Universal properties give functors We see why it is important for mediators to be unique when we extend universal constructions to maps.

PROPOSITION 4.5.13 Let U be an object of a category C for which all products Xx U for X Î obC exist and are specified. Then there is a unique functor CŽ C whose effect on objects is XŽ Xx U and such that (fx U);p₀ = p₀;f and (fx U);p₁ = p₁. It is called (-)x U.

omitted diagram environment

PROOF: The equations amount to one side of the commutative diagram. Clearly the universal property of products provides a unique fill-in, but it still remains for us to check that this is a functor. In the case f = \id_X, the identity \id_{Xx U} is such a fill-in, and so by uniqueness this must be it: (\id_Xx U) = \id_{Xx U}. Similarly, ignoring Yx U, there is a unique fill-in (f;g)x U:Xx UŽ Zx U, but (fx U);(gx U) also makes the diagram commute, and so by uniqueness they are equal. []

There is similarly a functor x:CxCŽ C of two arguments which yields the product of an arbitrary pair of objects or morphisms.

The terminal object and binary product are the nullary and binary cases of an n-ary connective on objects, but as usual they suffice.

PROPOSITION 4.5.14 If a category has a terminal object and a product for every pair of objects then it has a product for every finite list of objects. []

REMARK 4.5.15 The cartesian products Xx(Yx Z) and (Xx Y)x Z of sets are not equal: their typical elements are x,y,z and x,y,z respectively. Although what these products are is different, what they do is the same: they both satisfy the universal property of a product of three objects, and so they must be uniquely isomorphic. That is why definition up to isomorphism is the (accepted) norm, and definition up to equality is meaningless.

Set has a canonical way of assigning binary products, but we have just seen that this is not associative. By convention, we shall take the left-associated product (···(((1x \typeX₁)x\typeX₂)x\typeX₃)x···x\typeX_n-1)x\typeX_n, but notice that this depends on a notion of atomic type. We use it in Remark 4.6.5 to define [[G,x:X]] = [[G]] x X inductively and hence to interpret expressions in algebras and programming languages.

All universal properties are terminal objects

LEMMA 4.5.16 Let X,Y Î obC. Then there is a category, which we call CŻ X,Y, whose objects are spans, ie pairs of C-maps X aŹ U bŽ Y, and whose morphisms from this are C-maps f:UŽ U˘ making the two triangles below commute.

omitted diagram environment

The terminal object (if it exists) of this category is the product cone for X and Y in C.

PROOF: As with the ``concrete'' categories of Proposition 4.1.4, to show that CŻ X,Y is a category we just have to verify that (the identity \id_U is a morphism and) composition preserves the defining property. This is easy, but note that it uses the associativity law in C:

a = f;a˘ = f;(g;a˘˘) = (f;g );a˘˘.

Comparing the definitions of CŻ X,Y and Xx Y shows that the terminal object in CŻ X,Y is the product cone in C. []

Mainstream mathematicians tend to view this as a rather abstruse way of saying something quite simple, but the method is like programming with abstract data types. The complexity of the constructions (in this case the commutativity of the triangles in the above diagram) is hidden from the user and later applications work in an apparently effortless way, but at a certain price. The components (modules) must be fully equipped with their ancillary operations and specifications: in this case the product projections and the associativity law of the given category, which are needed to make the definitions of respectively the objects and maps meaningful. What seems to be secondary, even optional, structure in the data turns out to be primary and necessary structure in the derived constructions; conversely ``trivial'' features of the derived construction unravel into something non-trivial in the given presentation.

In order to see this ancillary structure in action, you should demonstrate in detail that the product of two objects (if it exists) in a category is unique up to unique isomorphism, both directly in terms of the definition and indirectly via the category CŻ X,Y.

The type disciplines of category theory and the abstract data type idiom in programming often ensure that the ancillary structure is complete in the following sense: When the Real Mathematician - and this applies equally to the Real Programmer - is up to his elbows in the grime of a difficult procedure, he frequently needs to prove (or to leave to the reader) lemmas and sub-lemmas (respectively to program sub-routines or perform in-line hacks) which import the notation of the immediate application but in fact only reproduce in a multitude of special cases that ancillary structure which he had disregarded as trivial. By contrast, the categorist begins by applying the parsimonious categorical tools which she has learned to trust. She then approaches applications with the calm self-confidence of one who knows that she is prepared for the occasion.

We shall collect more examples of universal properties in Chapter VII.