Math 202A: Lecture 1

As in common parlance, a mathematical category \mathbf{Cat} consists firstly of a collection \mathrm{Ob}(\mathbf{Cat}) of objects which are distinct but similar in some way. In category theory the relationships between the objects of \mathbf{Cat} are morphisms, and for each pair of objects X,Y we have a corresponding collection of morphisms \mathrm{Hom}(X,Y). Morphisms in \mathrm{Hom}(X,X) are called endomorphisms, and the collection \mathrm{Hom}(X,X) itself is denoted \mathrm{End}(X). Morphisms can be composed: given objects X,Y,Z and morphisms f \in \mathrm{Hom}(X,Y) and g \in \mathrm{Hom}(Y,Z) we have a corresponding morphism g \circ f \in \mathrm{Hom}(X,Z). Composition is both associative and unital. Associativity means that for any morphisms such that h \circ (g \circ f) and (h \circ g) \circ f both make sense, they in fact coincide. Unitarity means that for every pair of objects X,Y there exist morphisms \mathrm{id}_X \in \mathrm{Hom}(X,X) and \mathrm{id}_Y \in \mathrm{Hom}(Y,Y) such that

f \circ \mathrm{id}_X = f =\mathrm{id}_Y \circ f

holds for all morphisms f \in \mathrm{Hom}(X,Y). We say that two objects X,Y are isomorphic if there exist morphisms f \in \mathrm{Hom}(X,Y) and g \in \mathrm{Hom}(Y,X) such that

g \circ f = \mathrm{id}_X \quad\text{and}\quad f \circ g = \mathrm{id}_Y.

We denote by \mathrm{Iso}(X,Y) the subcollection of \mathrm{Hom}(X,Y) consisting of isomorphisms. Thus, X and Y are isomorphic if \mathrm{Iso}(X,Y) is a nonempty collection. Isomorphism is a weaker notion than equality, since \mathrm{Iso}(X,Y) may contain multiple isomorphisms. We write \mathrm{Aut}(X) = \mathrm{Iso}(X,X) and call morphisms in \mathrm{Aut}(X) automorphisms of X.

Problem 1.1. Show that \mathrm{Aut}(X) is nonempty.

You might ask: what is the point of the category formalism introduced above? My preferred answer to this question is another question: what is the point of anything? This ontological response is sincere but runs the risk of being perceived as facetious. A more practical answer is that category theory is provides a very general and flexible way to organize objects and relationships, and is applied as such in the sciences. For more about this you can browse the Topos Institute website, or watch the short video below.

The fundamental example of a category is the category \mathbf{Set} whose objects are sets and whose morphisms are functions. The law of composition in \mathbf{Set} is composition of functions.

Problem 1.2. Prove that two sets X,Y are isomorphic if and only if there exists a bijective function f \colon X \to Y. 

Looking at the category \mathbf{Set} calls attention to the use of the term “collection” in the general definition of a category. This term is used because the collection \mathrm{Ob}(\mathbf{Set}) of all sets is not itself a set and attempting to view it as such leads to paradoxes. Thus, in order to think about categories in general we need to consider aggregations of objects which are in some sense larger than sets, and we refer to these as collections. Here you may feel the urge to object that the term “collection” has not been clearly defined. However, unless you also took issue with the use of the word “set” during your undergraduate studies, you forfeited the right to this objection long ago and I will happily ignore it. Of course, Math 202A does not occupy all of reality and you are welcome to examine these foundational issues yourself in its complement.

Having pointed out that the collection of all sets is a somewhat subtle concept, let us point out that the collections \mathrm{Hom}(X,Y) which make up the morphisms are not so bad: they can be considered as sets without running into any foundational issues. Indeed, if you believe that X and Y being sets is enough to certify

X \times Y = \{(x,y) \colon x \in X,\ y \in Y\}

as a set, than there is no further issue because a function f \colon X \to Y is by definition a subset f \subseteq X \times Y with the property that f contains exactly one point of the form (x,*) for each x \in X. Thus, the collection of all functions \mathrm{Hom}(X,Y) is contained in the collection of all subsets of the set X \times Y, and hence is contained in a set.

The upshot is that categories are more general than sets, with \mathbf{Set} being just one example of a category. A helpful heuristic to have in mind is that categories are like directed graphs, possibly very large ones, with objects being vertices and morphisms being directed edges. Composing morphisms is analogous to concatenating arrows. A subcategory should then be like a subgraph, and a function from one category to another should be like a graph homomorphism. We will come back to this.

Definition 1.1: Combinatorics is the study of the category \mathbf{FSet} whose objects are finite sets and whose morphisms are functions.

From the general perspective of Definition 1, the typical problems of combinatorics can be formulated along the following lines. First there is the counting problem: given a finite nonempty set X defined in some explicit but not transparent way (for example, the set of alternating sign matrices), what is the natural number n \in \mathbb{N} such that X is isomorphic to \{1,\dots,n\}? The study of questions of this kind is called enumerative combinatorics. Another type of question is: given two finite nonempty sets X,Y, again defined in some explicit but non-transparent way, is \mathrm{Iso}(X,Y) nonempty? If so, can we explicitly describe an isomorphism f \in \mathrm{Iso}(X,Y)? This endeavor is usually called bijective combinatorics.

The category \mathbf{FSet} is a subcategory of \mathbf{Set}. Hoping that the collection of all finite sets can be thought of as a set without running into paradoxes is a bit too optimistic (to see why, observe that for any set X, the set \{X\} is finite…). However, the set \mathrm{Hom}(X,Y) of functions between two finite sets is finite, and more than this functions between finite sets can be efficiently encoded as matrices. Indeed, suppose |X|=n and |Y|=m. By definition, this means that there exist isomorphisms

\lambda \colon \{1,\dots,n\} \longrightarrow X

and

\mu \colon \{1,\dots,m\} \longrightarrow Y.

It is natural to think of these isomorphisms as labelings of the elements of X and Y, writing

x_1=\lambda(1),\ \dots,\ x_n=\lambda(n)

and

y_1 = \mu(1),\ \dots,\ y_m=\mu(m).

With these labelings in place, we can represent the function f as the m \times n matrix [f] = [f]_{\lambda\mu} whose (i,j)-entry is 1 if f(x_j)=y_i and 0 if not. Thus, every f \in \mathrm{Hom}(X,Y) is represented by an element of \mathbb{Z}_2^{m \times n}, i.e. an m \times n matrix with entries in \mathbb{Z}_2=\{0,1\}. The cardinality of all such binary matrices in |\mathbb{Z}_2^{m \times n}|=2^{mn}, but the cardinality of \mathrm{Hom}(X,Y)| is only m^n = 2^{\log_2(m)n} because the matrix representation of a function has exactly one 1 in each column. It is important to note that the matrix representation of a given function f \in \mathrm{Hom}(X,Y) is not unique because it depends on the labelings \lambda,\mu. In particular, the number of distinct matrix representations of a given function f \in \mathrm{Hom}(X,Y) is at most m!n!.

Since every finite set is a set, every object of \mathbf{FSet} is an object of \mathbf{Set}. Moreover, for any two finite sets X,Y the collection of morphisms \mathrm{Hom}(X,Y) is the same whether we view X,Y as objects of \mathbf{Set} or as objects of \mathbf{FSet}. We say that \mathbf{FSet} is a full subcategory of \mathbf{Set} to indicate that no morphisms are lost. Contrast this with the category \mathbf{FSet}_* whose objects are finite sets and whose morphisms are injective functions; this is a subcategory of \mathbf{Set} but not a full subcategory. Indeed, \mathbf{FSet}_* is not even a full subcategory of \mathbf{FSet} — it has the same collection of objects but a restricted collection of morphisms. One says that \mathbf{FSet}_* is a wide subcategory of \mathbf{FSet}.

Problem 1.3. Verify that \mathbf{FSet}_* really is a category, i.e. obeys the axioms stipulated above. Calculate |\mathrm{Hom}(X,Y)| and |\mathrm{Hom}(Y,X)|.

Continuing onward, most courses you took as an undergraduate can be defined as the study of a particular category. Here is another example.

Definition 1.2. Group theory is the study of the category \mathbf{Grp} whose objects are groups and whose morphisms are group homomorphisms.

Groups form an extremely important and fundamental category because they occur in every category, more or less. The “more or less” is due to the fact that a group is, by definition, a set. A category \mathbf{Cat} is said to be locally small if \mathrm{Hom}(X,Y) is a set for every pair of objects X,Y \in \mathrm{Ob}(X,Y). In particular, every subcategory of \mathbf{Set} is a locally small category.

Problem 1.4. Prove that in any locally small category, \mathrm{Aut}(X) is a group for every object X.

Thus, every locally small category gives us a group for each of its objects. The subject known as representation theory is concerned with reversing this observation by representing the elements of a given group G as automorphisms in a preferred locally small category \mathbf{Cat}. For any object X \in \mathrm{Ob}(\mathbf{Cat}) we can consider actions of G on X, which by definition are group homomorphisms

\varphi \colon G \longrightarrow \mathrm{Aut}(X)

Now we can consider all ways in which G can act on the objects of \mathbf{Cat}, and assemble these into a new category \mathbf{Cat}_G whose objects are pairs (X,\varphi) as above. Such pairs are called representations of G, and for every representation \varphi(G)=\{\varphi(g) \colon g \in G\} is a subgroup of \mathrm{Aut}(X). Thus, we are representing the elements of G as automorphisms of the object X. This might result in losing some of the structure of G, since several group elements could map to the same automorphism of X. For example, the extreme case is the trivial representation where \varphi(g)=\mathrm{id}_X for every g \in G. In the opposite extreme case, where \varphi is injective and \varphi(G) is a subgroup of \mathrm{Aut}(X) isomorphic to G, the we say that (X,\varphi) is a faithful representation of G.

It remains to define the morphisms of our nascent category \mathbf{Cat}_G. For any two representations (X,\varphi) and (Y,\psi) we define the corresponding set of morphisms to be the subset \mathrm{Hom}_G(X,Y) of \mathrm{Hom}(X,Y) consisting of so-called G-equivariant morphisms f \in \mathrm{Hom}(X,Y). By definition, these are morphisms which intertwine the actions \varphi,\psi in the sense that

f \circ \varphi(g) = \psi(g) \circ f

holds for every group element g \in G.

Let us consider the above construction in the case where \mathbf{Cat}=\mathbf{Set}. For a set X, the group \mathrm{Aut}(X) is called the permutation group of \mathbf{X} and its elements are called permutations of X. Consequently, objects (X,\varphi) of \mathbf{Set}_G are called permutation representations of \mathbf{G}. Now, since a group is by definition a set, G is itself an object of \mathbf{Set}. Furthermore, there is a canonical action

\rho \colon G \longrightarrow \mathrm{Aut}(G)

of G on itself defined by \rho(g)h=gh, where g,h are elements of G and gh is the product of these elements in G. This gives a permutation representation (G,\rho) called the regular representation of G.

Problem 1.5. Prove that \rho as defined above really is a group homomorphism, and that \rho(g) really is a set-theoretic automorphism of G for every g \in G. Finally, prove Cayley’s theorem: the groups G and \rho(G) are isomorphic.

Cayley’s theorem is a very familiar result from elementary group theory which, in the language introduced above, says that every group has a faithful representation in the category of sets. Representations of groups on the category of vector spaces are called linear representations and we will study these in Math 202B. In Math 202A, we have a much more modest task, namely that of studying the category of vector spaces as is, without any group acting in the background.

Definition 1.3. Linear algebra is the study of the category \mathbf{Vec} whose objects are vector spaces and whose morphisms are linear transformations.

Technically, Definition 1.3 is incomplete because we must specify the field over which our vector spaces are defined. We will do this next lecture, where we further argue that finite-dimensional linear algebra can be viewed as a “quantization” of combinatorics.

Leave a Reply