Monads, Monoids, and Categories

This is part 31 of Categories for Programmers. Previously: Lawvere Theories. See the Table of Contents .

There is no good place to end a book on category theory. There’s always more to learn. Category theory is a vast subject. At the same time, it’s obvious that the same themes, concepts, and patterns keep showing up over and over again. There is a saying that all concepts are Kan extensions and, indeed, you can use Kan extensions to derive limits, colimits, adjunctions, monads, the Yoneda lemma, and much more. The notion of a category itself arises at all levels of abstraction, and so does the concept of a monoid and a monad. Which one is the most basic? As it turns out they are all interrelated, one leading to another in a never-ending cycle of abstractions. I decided that showing these interconnections might be a good way to end this book.

Bicategories

One of the most difficult aspects of category theory is the constant switching of perspectives. Take the category of sets, for instance. We are used to defining sets in terms of elements. An empty set has no elements. A singleton set has one element. A cartesian product of two sets is a set of pairs, and so on. But when talking about the category Set I asked you to forget about the contents of sets and instead concentrate on morphisms (arrows) between them. You were allowed, from time to time, to peek under the covers to see what a particular universal construction in Set described in terms of elements. The terminal object turned out to be a set with one element, and so on. But these were just sanity checks.

A functor is defined as a mapping of categories. It’s natural to consider a mapping as a morphism in a category. A functor turned out to be a morphism in the category of categories (small categories, if we want to avoid questions about size). By treating a functor as an arrow, we forfeit the information about its action on the internals of a category (its objects and morphisms), just like we forfeit the information about the action of a function on elements of a set when we treat it as an arrow in Set. But functors between any two categories also form a category. This time you are asked to consider something that was an arrow in one category to be an object in another. In a functor category functors are objects and natural transformations are morphisms. We have discovered that the same thing can be an arrow in one category and an object in another. The naive view of objects as nouns and arrows as verbs doesn’t hold.

Instead of switching between two views, we can try to merge them into one. This is how we get the concept of a 2-category, in which objects are called 0-cells, morphisms are 1-cells, and morphisms between morphisms are 2-cells.

0-cells a, b; 1-cells f, g; and a 2-cell α.

The category of categories Cat is an immediate example. We have categories as 0-cells, functors as 1-cells, and natural transformations as 2-cells. The laws of a 2-category tell us that 1-cells between any two 0-cells form a category (in other words, C(a, b) is a hom-category rather than a hom-set). This fits nicely with our earlier assertion that functors between any two categories form a functor category.

In particular, 1-cells from any 0-cell back to itself also form a category, the hom-category C(a, a); but that category has even more structure. Members of C(a, a) can be viewed as arrows in C or as objects in C(a, a). As arrows, they can be composed with each other. But when we look at them as objects, the composition becomes a mapping from a pair of objects to an object. In fact it looks very much like a product — a tensor product to be precise. This tensor product has a unit: the identity 1-cell. It turns out that, in any 2-category, a hom-category C(a, a) is automatically a monoidal category with the tensor product defined as composition of 1-cells. Associativity and unit laws simply fall out from the corresponding category laws.

Let’s see what this means in our canonical example of a 2-category Cat. The hom-category Cat(a, a) is the category of endofunctors on a. Endofunctor composition plays the role of a tensor product in it. The identity functor is the unit with respect to this product. We’ve seen before that endofunctors form a monoidal category (we used this fact in the definition of a monad), but now we see that this is a more general phenomenon: endo-1-cells in any 2-category form a monoidal category. We’ll come back to it later when we generalize monads.

You might recall that, in a general monoidal category, we did not insist on the monoid laws being satisfied on the nose. It was often enough for the unit laws and the associativity laws to be satisfied up to isomorphism. In a 2-category, monoidal laws in C(a, a) follow from composition laws for 1-cells. These laws are strict, so we will always get a strict monoidal category. It is, however, possible to relax these laws as well. We can say, for instance, that a composition of the identity 1-cell id_a with another 1-cell, f :: a -> b, is isomorphic, rather than equal, to f. Isomorphism of 1-cells is defined using 2-cells. In other words, there is a 2-cell:

ρ :: f ∘ id_a -> f

that has an inverse.

Identity law in a bicategory holds up to isomorphism (an invertible 2-cell ρ).

We can do the same for the left identity and associativity laws. This kind of relaxed 2-category is called a bicategory (there are some additional coherency laws, which I will omit here).

As expected, endo-1-cells in a bicategory form a general monoidal category with non-strict laws.

An interesting example of a bicategory is the category of spans. A span between two objects a and b is an object x and a pair of morphisms:

f :: x -> a
g :: x -> b

You might recall that we used spans in the definition of a categorical product. Here, we want to look at spans as 1-cells in a bicategory. The first step is to define a composition of spans. Suppose that we have an adjoining span:

f':: y -> b
g':: y -> c

The composition would be a third span, with some apex z. The most natural choice for it is the pullback of g along f'. Remember that a pullback is the object z together with two morphisms:

h :: z -> x
h':: z -> y

such that:

g ∘ h = f' ∘ h'

which is universal among all such objects.

For now, let’s concentrate on spans over the category of sets. In that case, the pullback is just a set of pairs (p, q) from the cartesian product x × y such that:

g p = f' q

A morphism between two spans that share the same endpoints is defined as a morphism h between their apices, such that the appropriate triangles commute.

A 2-cell in Span.

To summarize, in the bicategory Span: 0-cells are sets, 1-cells are spans, 2-cells are span morphisms. An identity 1-cell is a degenerate span in which all three objects are the same, and the two morphisms are identities.

We’ve seen another example of a bicategory before: the bicategory Prof of profunctors, where 0-cells are categories, 1-cells are profunctors, and 2-cells are natural transformations. The composition of profunctors was given by a coend.

Monads

By now you should be pretty familiar with the definition of a monad as a monoid in the category of endofunctors. Let’s revisit this definition with the new understanding that the category of endofunctors is just one small hom-category of endo-1-cells in the bicategory Cat. We know it’s a monoidal category: the tensor product comes from the composition of endofunctors. A monoid is defined as an object in a monoidal category — here it will be an endofunctor T — together with two morphisms. Morphisms between endofunctors are natural transformations. One morphism maps the monoidal unit — the identity endofunctor — to T:

η :: I -> T

The second morphism maps the tensor product of T ⊗ T to T. The tensor product is given by endofunctor composition, so we get:

μ :: T ∘ T -> T

We recognize these as the two operations defining a monad (they are called return and join in Haskell), and we know that monoid laws turn to monad laws.

Now let’s remove all mention of endofunctors from this definition. We start with a bicategory C and pick a 0-cell a in it. As we’ve seen earlier, the hom-category C(a, a) is a monoidal category. We can therefore define a monoid in C(a, a) by picking a 1-cell, T, and two 2-cells:

η :: I -> T
μ :: T ∘ T -> T

satisfying the monoid laws. We call this a monad.

That’s a much more general definition of a monad using only 0-cells, 1-cells, and 2-cells. It reduces to the usual monad when applied to the bicategory Cat. But let’s see what happens in other bicategories.

Let’s construct a monad in Span. We pick a 0-cell, which is a set that, for reasons that will become clear soon, I will call Ob. Next, we pick an endo-1-cell: a span from Ob back to Ob. It has a set at the apex, which I will call Ar, equipped with two functions:

dom :: Ar -> Ob
cod :: Ar -> Ob

Let’s call the elements of the set Ar “arrows.” If I also tell you to call the elements of Ob “objects,” you might get a hint where this is leading to. The two functions dom and cod assign the domain and the codomain to an “arrow.”

To make our span into a monad, we need two 2-cells, η and μ. The monoidal unit, in this case, is the trivial span from Ob to Ob with the apex at Ob and two identity functions. The 2-cell η is a function between the apices Ob and Arr. In other words, η assigns an “arrow” to every “object.” A 2-cell in Span must satisfy commutation conditions — in this case:

dom ∘ η = id
cod ∘ η = id

In components, this becomes:

dom (η ob) = ob = cod (η ob)

where ob is an “object” in Ob. In other words, η assigns to every “object” and “arrow” whose domain and codomain are that “object.” We’ll call this special “arrow” the “identity arrow.”

The second 2-cell μ acts on the composition of the span Ar with itself. The composition is defined as a pullback, so its elements are pairs of elements from Ar — pairs of “arrows” (a₁, a₂). The pullback condition is:

cod a₁ = dom a₂

We say that a₂ and a₁ are “composable,” because the domain of one is the codomain of the other.

The 2-cell μ is a function that maps a pair of composable arrows (a₁, a₂) to a single arrow a₃ from Ar. In other words μ defines composition of arrows.

It’s easy to check that monad laws correspond to identity and associativity laws for arrows. We have just defined a category (a small category, mind you, in which objects and arrows form sets).

So, all told, a category is just a monad in the bicategory of spans.

What is amazing about this result is that it puts categories on the same footing as other algebraic structures like monads and monoids. There is nothing special about being a category. It’s just two sets and four functions. In fact we don’t even need a separate set for objects, because objects can be identified with identity arrows (they are in one-to-one correspondence). So it’s really just a set and a few functions. Considering the pivotal role that category theory plays in all of mathematics, this is a very humbling realization.

Challenges

Derive unit and associativity laws for the tensor product defined as composition of endo-1-cells in a bicategory.
Check that monad laws for a monad in Span correspond to identity and associativity laws in the resulting category.
Show that a monad in Prof is an identity-on-objects functor.
What’s a monad algebra for a monad in Span?

Bibliography

Paweł Sobociński’s blog.

NN Says:

September 7, 2017 at 7:35 am

I really want to thank you for your articles.
I am a regular programmer without much understanding in category theory and these articles helped me a lot.

Cex Says:

September 8, 2017 at 1:32 am

I just finished to read this last post and watch the last video. This is an excellent material with the right balance between formal and intuitive explanations.
I love it!
Thank you!

I really think that is not just useful for programmers but even for mathematicians that would like to see different kinds of applications of category theory.

I’m looking forward for a paper publication!

giuliohome Says:

September 9, 2017 at 1:59 am

Thank you for all your effort 👍💪✌
I find that this last article summarizes and answers some of my initial beginner questions https://giuliohome.wordpress.com/2017/09/03/quantum-questions-about-modern-math-and-it/
Still a chance to have your feedback and your opinion about the links with quantum field theories? I guess it is not off topic since nowadays quantum computing and teleportation are subjects discussed also in the IT industry and there are innovative quantum programming languages already available 😉 (I respectfully doubt they’re actively used yet…)

Juan Manuel (@babui_) Says:

September 13, 2017 at 12:39 am

Thanks for all your effort in making all these knowledge undersatandable.

stevemao Says:

September 15, 2017 at 7:47 pm

Thanks for the series. Will you update the missing ones to your YouTube channel too? I always like videos more 🙂

Bartosz Milewski Says:

September 15, 2017 at 9:58 pm

The videos were done in front of the live audience. If I have a venue and an audience, I might do more videos.

zozo Says:

September 17, 2017 at 5:54 am

I am not sure whether you measure the contribution you provide thanks to this blog. It is the first time I actually learn something meaningful in programming with such crystal clear insights. Hope programming will finally take such direction and that spaghetti code will only be considered as an old nightmare coming from dark ages in a few years. Thanks for it.

HenryChern Says:

September 17, 2017 at 10:51 am

A very unusual ending of the book, which causes a feeling of pity for category theory. Where is the “happy end”? 🙂

PeterH Says:

September 19, 2017 at 1:16 am

A great blog. I’d like to suggest a topic: homotopy type theory. If it fits the scope of your book I think it would be really useful to explain it in an accessible way.

RayG Says:

November 2, 2017 at 4:39 am

What a great series! Book please. Now!

Florin P. Says:

November 15, 2017 at 2:55 am

There is a book : http://www.lulu.com/shop/bartosz-milewski/category-theory-for-programmers/hardcover/product-23389988.html

jvuorinen Says:

November 24, 2017 at 3:16 pm

Hi!

It says:”We say that a1 and a1 are “composable,” because the domain of one is the codomain of the other.”

Maybe it should be “a1 and a2”?

Good stuff anyway!

Meelap Says:

December 12, 2017 at 8:45 am

Your posts are such a joy to read! Thank you!

June 14, 2019 at 11:08 am

An interesting aside: You know that a single-object category is equivalent to a monoid. It turns out that a single-object bicategory is equivalent to a monoidal category. It has only one hom-category, and any two 1-cells are composable: this composition is your tensor product, identity 1-cell is your unit object.

Enrico Says:

June 1, 2021 at 1:02 pm

Not finished reading your book yet, but I’m leaning a comment here because maybe it’s the most appropriate place for it, as it is about monads.

I think you might find this question of mine interesting: https://stackoverflow.com/questions/67779824/does-the-function-monad-really-offer-something-more-than-the-function-applicativ

Bartosz Milewski's Programming Cafe