Quantcast
Channel: June 2013 – Annoying Precision
Viewing all articles
Browse latest Browse all 4

Operations and Lawvere theories

$
0
0

Groups are in particular sets equipped with two operations: a binary operation (the group operation) (x_1, x_2) \mapsto x_1 x_2 and a unary operation (inverse) x_1 \mapsto x_1^{-1}. Using these two operations, we can build up many other operations, such as the ternary operation (x_1, x_2, x_3) \mapsto x_1^2 x_2^{-1} x_3 x_1, and the axioms governing groups become rules for deciding when two expressions describe the same operation (see, for example, this previous post).

When we think of groups as objects of the category \text{Grp}, where do these operations go? They’re certainly not morphisms in the corresponding categories: instead, the morphisms are supposed to preserve these operations. But can we recover the operations themselves?

It turns out that the answer is yes. The rest of this post will describe a general categorical definition of n-ary operation and meander through some interesting examples. After discussing the general notion of a Lawvere theory, we will then prove a reconstruction theorem and then make a few additional comments.

The general definition

Since operations are functions involving underlying sets, the appropriate categorical setting here is not a bare category C but a concrete category, namely a category equipped with a faithful functor U : C \to \text{Set}, which we will think of as the underlying set (forgetful) functor.

What should “operations on the objects in a concrete category” mean, then? Well, an n-ary operation is in particular a family of functions

\displaystyle \eta_c : U(c)^n \to U(c)

for all objects c \in C. Morphisms ought to preserve operations, hence if f : c \to d is a morphism then we ought to have

\displaystyle \eta_d \circ U(f)^n = U(f) \circ \eta_c.

But this is precisely the condition that \eta_c defines a natural transformation between the functors U^n and U! Hence we have our definition: an n-ary operation on the objects of a concrete category (C, U) is a natural transformation U^n \to U.

The following straightforward theorem will determine what the n-ary operations are in many familiar concrete categories.

Theorem: Let (C, U) be a concrete category with finite coproducts such that U is representable by an object u \in C. Then the n-ary operations on objects of C can be naturally identified with morphisms u \to u \sqcup ... \sqcup u (the coproduct of n copies of c), or equivalently with elements of the underlying set U(u \sqcup ... \sqcup u).

Corollary: In particular, the above conclusion holds if U has a left adjoint.

Proof. Under the hypotheses, it follows that U^n is represented by u \sqcup ... \sqcup u, and then the conclusion follows from the Yoneda lemma. \Box

Proof of corollary. If U has a left adjoint F : \text{Set} \to C, then \text{Hom}(F(1), c) \cong \text{Hom}(1, U(c)) \cong U(c), from which it follows that F(1) represents U. Note that in this case F(n) represents U^n, where n denotes an n-element set, so we get that n-ary operations can be identified with elements of the underlying set U(F(n)). \Box

Examples. Let C = \text{Grp}, \text{Ab}, \text{Ring}, \text{CRing}. In each of these familiar algebraic examples the usual forgetful functor has a left adjoint, from which it follows that the n-ary operations can be identified with the underlying set of the free object on n elements in each case, namely F_n, \mathbb{Z}^n, \mathbb{Z} \langle x_1, ... x_n \rangle, \mathbb{Z}[x_1, ... x_n]. In each case this recovers the n-ary operations in the usual sense (namely all of the operations one can write down starting from the operations given by the axioms). For example, associated to any element (c_1, ... c_n) \in \mathbb{Z}^n is the operation

\displaystyle A^n \ni (a_1, ... a_n) \mapsto \sum_{i=1}^n c_i a_i \in A

on abelian groups, and associated to any polynomial f \in \mathbb{Z}[x_1, ... x_n] is the operation

\displaystyle R^n \ni (r_1, ... r_n) \mapsto f(r_1, ... r_n) \in R

on commutative rings.

Example. Let C = M\text{-Set} be the category of M-sets for M a monoid. Again the usual forgetful functor has a left adjoint, so the n-ary operations can be identified with the underlying set of the free object on n elements, namely M \sqcup ... \sqcup M (where M appears n times). Unlike the above cases, in this case the only interesting operations are the unary operations given by multiplication by elements of M: all the n-ary operations are given by a projection map U^n \to U followed by a unary operation.

Example. Let C = R\text{-Mod} be the category of R-modules for R a ring. Again the usual forgetful functor has a left adjoint, so the n-ary operations can be identified with the underlying set of the free object on n elements, namely R^n. An element (r_1, ... r_n) \in R^n gives rise to an operation

\displaystyle M^n \ni (m_1, ... m_n) \to \sum_{i=1}^n r_i m_i \in M

in a natural generalization of the case C = \text{Ab}. There would not be much to learn from this case except that there are various ways to define a category equivalent to R\text{-Mod} for R a ring without directly specifying R. For example, the category of representations of a group G over a field k is equivalent to k[G]\text{-Mod}, and similarly the category of representations of a Lie algebra \mathfrak{g} over a field k is equivalent to U(\mathfrak{g})\text{-Mod}. In both cases thinking about n-ary operations reveals the presence of a ring whose elements give us additional operations we may not otherwise have noticed: in a representation of G we can multiply by any sum of elements of G rather than just by elements of G, and similarly in a representation of \mathfrak{g}. In the case of finite groups this lets us define useful operations like multiplication by \frac{1}{|G|} \sum_{g \in G} g, and in the case of Lie algebras this lets us define useful operations like multiplication by Casimir elements.

Reflecting on the above examples, they should strike you as unsurprising in hindsight: “free object on n elements” should mean nothing other than “anything I can get by applying an n-ary operation to n elements.”

This answers a question which used to bug me, namely “in the standard commutative algebra description of how polynomials work, where did polynomial composition go?” The answer is that to recover composition of, say, polynomials in one variable it suffices to identify k[x] with natural transformations U \to U where U : k\text{-Alg} \to \text{Set} is the forgetful functor. Similarly to recover composition of polynomials in more than one variable.

A quick remark

Studying arbitrary n-ary operations is the basic starting point of universal algebra. When we look at n-ary operations on an arbitrary concrete category, we are asking in some sense how much of the structure of objects in that category is algebraic structure. In the above cases, where we started with algebraic objects, the answer is more or less “all of it.” In the examples below this will no longer be the case, but it may still be surprising how much algebraic structure we can find.

Less algebraic examples

Example. First, a boring example. Let C = \text{Top} be the category of topological spaces. The forgetful functor C \to \text{Set} has left adjoint given by the functor which equips a set with the discrete topology. Hence there are no interesting n-ary operations: all we get are projections

\displaystyle X^n \ni (x_1, ... x_n) \mapsto x_i \in X.

Even if we look for infinitary operations U^S \to U, where S is an infinite set, we only find projection operations. So the structure of topological spaces is not well captured by operations; in some sense, \text{Top} is very far from being algebraic.

Example. Let C = \text{CHaus} be the category of compact Hausdorff spaces. The forgetful functor C \to \text{Set} has left adjoint given by the Stone-Čech compactification S \mapsto \beta(S) on discrete spaces, which is just the functor which associates to a set the set of ultrafilters on it (see, for example, this previous post). It follows that the ultrafilters \beta(S) on a set S give us the |S|-ary operations on compact Hausdorff spaces.

These operations are precisely given by taking limits with respect to the given ultrafilter. For finite S they are uninteresting as in the case of topological spaces, since all ultrafilters on a finite set are principal. For infinite S we get an interesting operation for every non-principal ultrafilter on S. Taking limits with respect to ultrafilters in this manner completely recovers the topology on a compact Hausdorff space X, and in fact it is possible to define the category of compact Hausdorff spaces in this manner (it is precisely the category of algebras over the ultrafilter monad).

Example. Let C = \text{Ban}_1 be the category of complex Banach spaces and weak contractions (see, for example, this previous post). The representable functor \text{Hom}(\mathbb{C}, -) sends a Banach space to its unit ball; we will take this as the forgetful functor U. Then U has a left adjoint S \mapsto \ell^1(S), so the unit ball of \ell^1(S) gives us the |S|-ary operations on unit balls of Banach spaces (we can consider natural transformations U^S \to U for arbitrary S).

The finitary operations are given by combinations of addition and scalar multiplication (subject to the constraint that we stay in the unit ball), but we get interesting infinitary operations that invoke completeness: for example, for any sequence c_1, c_2, ... in the unit ball of \ell^1(\mathbb{N}) (so \sum_{i=1}^{\infty} |c_i| \le 1), we get an infinitary operation

\displaystyle (x_1, x_2, ... ) \mapsto \sum_{i=1}^{\infty} c_i x_i

on elements x_1, x_2, ... of the unit ball of some Banach space.

Example. Let C = \text{Man}^{op} be the opposite of the category of smooth manifolds. Remembering the slogan that algebra is dual to geometry, this category ought to have something to do with algebras of some kind. Accordingly, for the forgetful functor we will choose the functor M \mapsto C^{\infty}(M) sending a smooth manifold to the algebra of smooth real-valued functions on it. This functor is represented by \mathbb{R}, and C has finite coproducts, so the n-ary operations on \text{Man}^{op} are given by the underlying set of the coproduct of n copies of \mathbb{R} (in the opposite category) or, in other words, the smooth functions C^{\infty}(\mathbb{R}^n) on \mathbb{R}^n. If f : \mathbb{R}^n \to \mathbb{R} is such a function, the corresponding operation is given by

\displaystyle C^{\infty}(M)^n \ni (g_1, ... g_n) \mapsto f(g_1, ... g_n) \in C^{\infty}(M).

Example. Let C be the category of commutative Banach algebras. The holomorphic functional calculus guarantees that every holomorphic function \mathbb{C}^n \to \mathbb{C} gives an n-ary operation U^n \to U. These are precisely all of the n-ary operations; see this MO question for a sketch of an argument, although we will give a more general argument in a later post. Note that U is not representable; roughly speaking, there is no free commutative Banach algebra on an element because that element must have some fixed norm and hence cannot have arbitrarily large complex numbers in its spectrum.

Example. Similarly, let C be the category of commutative C*-algebras. The continuous functional calculus guarantees that every continuous function \mathbb{C}^n \to \mathbb{C} gives an n-ary operation U^n \to U. These are precisely all of the n-ary operations by an argument similar to the above argument. Note again that U is not representable.

Lawvere theories

When (C, U) is a concrete category as above, it’s natural to think of operations more generally as morphisms between the objects 1, U, U^2, U^3, ... in C \Rightarrow \text{Set} (the category of functors C \to \text{Set}), where 1 denotes the functor which sends every object to 1 \in \text{Set}. By the universal property of products, a morphism U^n \to U^m is precisely an m-tuple of morphisms U^n \to U, so classifying such morphisms reduces to the n-ary case we’ve already considered, but allowing operations to have multiple outputs as well as inputs highlights the fact that we’re really looking at a certain full subcategory of C \Rightarrow \text{Set}. If U is representable by an object u \in C, this is the opposite category of the full subcategory of C on the objects 0, u, u \sqcup u, ... (where 0 denotes an initial object), and if U has a left adjoint, this is the opposite category of the full subcategory of C on the free objects F(0), F(1), F(2), .... This category is a Lawvere theory, namely a category with finite products together with a distinguished object (in this case U) such that each object is some power of it.

(This formalism doesn’t capture infinitary operations as in the case of compact Hausdorff spaces above; there we need to consider infinitary Lawvere theories. Here we consider only the finitary case for simplicity.)

Edit, 3/17/16: It’s been pointed out to me that there’s a size issue here: it could happen that the collection of natural transformations U^n \to U does not form a set, so that the Lawvere theory of a concrete category (C, U) as defined above might fail to be locally small, even if C is assumed to be locally small. An explicit example where this happens is when C is the category of ordinal-graded vector spaces with finite support: that is, the category of formal finite direct sums of objects of the form V_{\alpha}, where \alpha is an ordinal. Here we can take U : C \to \text{Set} to be the forgetful functor assigning such a formal direct sum the correponding actual direct sum of vector spaces. The natural endomorphisms U \to U are given by multiplication by a scalar c_{\alpha} on V_{\alpha} for every ordinal \alpha, and these don’t form a set.

I believe this is not an issue if U is assumed to be a colimit of representables in the category of functors C \to \text{Set}, which is true in all of the examples we looked at and, I think, true in all cases of interest. (For me all colimits are over essentially small diagrams; the Yoneda lemma only tells you that every functor C \to \text{Set} is a possibly large colimit of representables.) /Edit

Thinking of a Lawvere theory as a specification of a collection of operations and rules for how to compose them, the natural thing to do with a Lawvere theory is to write down its category of models. What should this mean? If T is a Lawvere theory with distinguished object x, a model of the theory T should be a set M such that we assign to each morphism f : x^n \to x^m a function M^n \to M^m in a way which is compatible with composition and products.

In short, a model of a Lawvere theory T should be a product-preserving functor T \to \text{Set}. The category of models of T (in \text{Set}) is therefore the category \text{Prod}(T, \text{Set}) of such functors. It naturally comes equipped with a faithful forgetful functor to \text{Set} given by evaluating a functor on the object x \in T, making it a concrete category.

Example. If C is any of the algebraic examples \text{Grp}, \text{Ab}, \text{Ring}, \text{CRing} we encountered earlier, the categories of models of the corresponding Lawvere theories T_{\text{Grp}}, T_{\text{Ab}}, T_{\text{Ring}}, T_{\text{CRing}} are precisely the original categories \text{Grp}, \text{Ab}, \text{Ring}, \text{CRing} respectively.

Example. If C = \text{Ban}_1 with the unit ball forgetful functor, the category of models of the corresponding infinitary Lawvere theory (where we include the countable product U^{\mathbb{N}} and require models to be functors preserving countable products) is the category of totally convex spaces. This is in some sense the closest algebraic theory approximating Banach spaces.

Example. If C = \text{Man}^{op} with the C^{\infty} forgetful functor, the category of models of the corresponding Lawvere theory is the category of smooth algebras. The algebras C^{\infty}(M) are natural examples of smooth algebras, but smooth algebras can also contain nilpotents like \mathbb{R}[x]/x^2. The object \text{Spec } \mathbb{R}[x]/x^2 in the opposite of the category of smooth algebras, which behaves like a category of generalized smooth manifolds, represents the tangent bundle functor.

Lawvere theories give us a natural way to construct functors between categories of algebraic objects. First, note that Lawvere theories themselves form a category: a morphism between Lawvere theories should be a product-preserving functor T_1 \to T_2 sending the fixed object of T_1 to the fixed object of T_2. Next observe that a morphism of Lawvere theories induces a functor between the categories of models \text{Prod}(T_2, \text{Set}) \to \text{Prod}(T_1, \text{Set}) in the other direction.

Example. There is a natural inclusion T_{\text{Ab}} \to T_{\text{Ring}} of Lawvere theories given by mapping the addition operations on abelian groups to the addition operations on underlying groups of rings. It induces the underlying group forgetful functor \text{Ring} \to \text{Ab}.

Example. There is a natural quotient T_{\text{Ring}} \to T_{\text{CRing}} of Lawvere theories given by abelianization (via the natural map \mathbb{Z} \langle x_1, ... x_n \rangle \to \mathbb{Z}[x_1, ... x_n]). It induces the forgetful functor \text{CRing} \to \text{Ring}.

Remark. One reason to favor working with Lawvere theories is that we can consider models in categories other than \text{Set}: if C is any category with finite products, we can define a model of a Lawvere theory T to be a product-preserving functor T \to C. For example, a topological group is a model T_{\text{Grp}} \to \text{Top} of the Lawvere theory of groups in the category of topological spaces; similarly, a Lie group is a model T_{\text{Grp}} \to \text{Man}.

With this generalized notion of model, every Lawvere theory has an initial model, namely the identity functor \text{id}_T : T \to T, which exhibits the distinguished object of T as the universal model of T. This generalizes the observation made in a previous post about the distinguished object in T_{\text{Grp}} being the universal group. Moreover, with this notion of model, a morphism of Lawvere theories is just a description of the distinguished object of one Lawvere theory as a model for another Lawvere theory. The two examples we gave above can be summarized by the slogans “the universal ring is an abelian group” and “the universal commutative ring is a ring.”

The category of Lawvere theories gives us a natural interpretation of the fact that the category of models of a Lawvere theory is concrete. Namely, there is an initial Lawvere theory: it has an object x and finite products 1, x, x^2, ... and is the free category on this data. The only morphisms one can write down from this data are the unique morphism x \to 1, the projection maps x^2 \to x, and products and compositions of these. This gives a category equivalent to \text{FinSet}^{op}. We encountered this Lawvere theory when considering \text{Top} with the usual forgetful functor. The category of models of this Lawvere theory in \text{Set} is naturally equivalent to \text{Set}.

By the universal property, every Lawvere theory admits a unique morphism from the initial Lawvere theory, and this induces a functor on categories of models in the other direction which is precisely the usual forgetful functor.

A reconstruction theorem

(Below, “model” means “model in \text{Set}.”)

Above we saw that in some examples, starting from a concrete category (C, U), forming the corresponding Lawvere theory T, and considering its category of models \text{Prod}(T, \text{Set}), we naturally reconstruct the original category. In fact the following general result holds.

Theorem: Let (C, U) be a concrete category and let T be the Lawvere theory generated by U. Then there is a naturally defined functor from C to the category of models of T. This functor is a morphism of concrete categories, and it is initial among all morphisms of concrete categories from (C, U) to the category of models of a Lawvere theory.

The theorem tells us that the category of models of T is in some sense the closest approximation to (C, U) by the category of models of a Lawvere theory.

Corollary: If (C, U) is itself the category of models of a Lawvere theory, then the category of models of T is naturally equivalent to (C, U) as a concrete category.

Proof. The idea of the proof is that when we define T we’ve already described all n-ary operations on the underlying sets U preserved by the morphisms of C. Any other morphism of concrete categories into the category of models of a Lawvere theory can only supply us with n-ary operations we already have.

First, we want to show that there is a natural functor from (C, U) to the category of models of T. This should be clear by definition, but it is good to be explicit. In general, for any categories C, D satisfying mild size conditions, there is a natural evaluation functor

\displaystyle C \times (C \Rightarrow D) \to D

where as usual C \Rightarrow D is the category of functors C \to D. We can restrict this functor to various subcategories in either factor. In particular, if (C, U) is a concrete category, restricting the natural evaluation functor

\displaystyle C \times (C \Rightarrow \text{Set}) \to \text{Set}

to the Lawvere theory T (regarded as a subcategory of C \Rightarrow \text{Set}) generated by U gives a functor

\displaystyle C \times T \to \text{Set}

and, since functor categories are exponential objects, a functor

\displaystyle C \to (T \Rightarrow \text{Set}).

Explicitly, this functor sends every object c \in C to the functor which assigns an object x^n \in T the object U^n(c) \in \text{Set}; in particular, it’s a functor to the category of models of T.

We want to prove that this functor is initial among functors to categories of models of Lawvere theories, so let F : (C, U) \to (C', U') be a morphism of concrete categories (a functor F : C \to C' such that U' \circ F = U) such that (C', U') is the category of models of a Lawvere theory T'. Then F defines a functor

\displaystyle C \to (T' \Rightarrow \text{Set})

which gives a functor

\displaystyle C \times T' \to \text{Set}

which in turn gives a functor

\displaystyle T' \to (C \Rightarrow \text{Set}).

Letting x' \in T' be the distinguished object, this functor sends x' to U and x'^n to U^n (by the assumption that U' \circ F = U) and sends morphisms x'^n \to x'^m to natural transformations U^n \to U^m, hence gives a morphism of Lawvere theories T' \to T inducing a functor \text{Prod}(T, \text{Set}) \to \text{Prod}(T', \text{Set}) \cong C' on categories of models in the other direction through which F : C \to C' factors by inspection. The conclusion follows. \Box

Revisiting topological spaces

This post can be thought of as addressing the following natural question: concrete categories (C, U) usually arise as categories of structured sets and morphisms preserving that structure. Given a concrete category, can we reconstruct the structure that its morphisms preserve in some appropriate sense? The post shows that the answer is yes for categories of models of a Lawvere theory, where the structure is n-ary operations, but it is emphatically no in general, since not all structure is captured by n-ary operations. Our worst failure was on the category \text{Top}, where we only managed to get the initial Lawvere theory.

Following the general slogan that algebra is dual to topology, as well as taking a cue from the example of \text{Man}^{op}, we might decide to look at \text{Top}^{op} instead. A natural forgetful functor to choose here is the functor X \mapsto O(X) sending a topological space to its open subsets, which is represented by the Sierpinski space S = \{ 0, 1 \} (with the open sets \{ \emptyset, \{ 1 \}, \{ 0, 1 \} \}); here we identify an open subset with its indicator function as an element of \text{Hom}(X, S). (We will ignore the fact that this functor is not faithful, hence strictly speaking does not define a concretization.)

Now we have lots of operations, but on open subsets rather than points of topological spaces. The obvious guess is that the finitary operations should be generated by finite intersection and union, since these are all preserved by the action of continuous maps f : X \to Y on open sets f^{-1} : O(Y) \to O(X). To verify this, since O is represented by S, the n-ary operations O^n \to O are given by morphisms S^n \to S, or equivalently the open subsets of S^n. Any such open subset is the union of sets of the form \prod_{i=1}^n V_i where V_i is one of \emptyset, \{ 1 \}, \{ 0, 1 \}. If any of the V_i are empty then so is the product, in which case the corresponding operation is the one sending any n-tuple of open subsets to the empty set. Otherwise, the corresponding operation is the one sending an n-tuple of open subsets to an intersection of some of them (the ones at every index i such that V_i = \{ 1 \}). Taking unions of open subsets of S^n corresponds to taking the pointwise union of the corresponding operations, and we recover the desired result.

The Lawvere theory we obtain in this way is the Lawvere theory T_{\text{DLat}} of (edit:) distributive lattices (posets with finite products and coproducts which distribute over each other). Since open subsets are closed under arbitrary unions but only finite intersection, we get an interesting infinitary Lawvere theory if we look at infinitary operations, namely the Lawvere theory of frames (posets with all coproducts and finite products such that the former distribute over the latter). The category \text{Frm} is therefore in some sense the closest approximation to \text{Top}^{op} by the category of models of an infinitary Lawvere theory, and consequently its opposite category, the category of locales, approximates \text{Top} itself. Substituting topological spaces with locales leads to pointless topology, which classically is nearly identical to ordinary point-set topology. However, pointless topology behave better with respect to weaker foundations, and in particular behaves better when internalized to, for example, a topos. It also captures precisely the part of topology which is captured by sheaves.


Viewing all articles
Browse latest Browse all 4

Trending Articles