4. The Monoidal Category of Hilbert Spaces

An important goal of the enterprise of physics is to describe, not just one physical system at a time, but also how a large complicated system can be built out of smaller simpler ones. The simplest case is a so-called `joint system': a system built out of two separate parts. Our experience with the everyday world leads us to believe that to specify the state of a joint system, it is necessary and sufficient to specify states of its two parts. (Here and in what follows, by `states' we always mean what physicists call `pure states'.) In other words, a state of the joint system is just an ordered pair of states of its parts. So, if the first part has as its set of states, and the second part has as its set of states, the joint system has the cartesian product as its set of states.

One of the more shocking discoveries of the twentieth century is that
this is *wrong*. In both classical and quantum physics, given
states of each part we get a state of the joint system. But only in
classical physics is every state of the joint system of this form! In
quantum physics are also `entangled' states, which can only be
described as *superpositions* of states of this form. The reason
is that in quantum theory, the states of a system are no longer
described by a set, but by a Hilbert space. Moreover -- and this is
really an extra assumption -- the states of a joint system are
described not by the cartesian product of Hilbert spaces, but by their
tensor product.

Quite generally, we can imagine using objects in any category to describe physical systems, and morphisms between these to describe processes. In order to handle joint systems, this category will need to have some sort of `tensor product' that gives an object for any pair of objects and . As we shall explain, categories of this sort are called `monoidal'. The category is a example where the tensor product is just the usual cartesian product of sets. Similarly, the category is a monoidal category where the tensor product is the usual tensor product of Hilbert spaces. However, these two examples are very different, because the product in is `cartesian' in a certain technical sense, while the product in is not. This turns out to explain a lot about why joint systems behave so counterintuively in quantum physics. Moreover, it is yet another way in which resembles more than .

To see this in detail, it pays to go back to the beginning and think about cartesian products. Given two sets and , we define to be the set of all ordered pairs with and . But what is an ordered pair? This depends on our approach to set theory. We can use axioms in which ordered pairs are a primitive construction, or we can define them in terms of other concepts. For example, in 1914, Wiener defined the ordered pair to be the set . In 1922, Kuratowski gave the simpler definition . We can use the still simpler definition if our axioms exclude the possibility of sets that contain themselves. Various other definitions have also been tried [17]. In traditional set theory we arbitrarily choose one approach to ordered pairs and then stick with it. Apart from issues of convenience or elegance, it does not matter which we choose, so long as it `gets the job done'. In other words, all these approaches are all just technical tricks for implementing our goal, which is to make sure that if and only if and .

It is a bit annoying that the definition of ordered pair cannot get
straight to the point and capture the concept without recourse to an
arbitrary trick. It is natural to seek an approach that focuses more
on the *structural role* of ordered pairs in mathematics and less
on their *implementation*. This is what category theory provides.

The reason traditional set theory arbitarily chooses a specific implementation of the ordered pair concept is that it seems difficult to speak precisely about ``some thing -- I don't care what it is -- with the property that iff and ''. So, the first move in category theory is to stop focussing on ordered pairs and instead focus on cartesian products of sets. What properties should the cartesian product have? To make our answer applicable not just to sets but to objects of other categories, it should not refer to elements of . So, the second move in category theory is to describe the cartesian product in terms of functions to and from this set.

The cartesian product has
functions called `projections' to the sets and :

Secretly we know that these pick out the first or second component of any ordered pair in :

But, our goal is to characterize the product by means of these projections without explicit reference to ordered pairs. For this, the key property of the projections is that given any element and any element , there exists a unique element such that and . Furthermore, as a substitute for elements of the sets and , we can use functions from an arbitrary set to these sets.

Thus, given two sets and , we define their **cartesian product** to be any set equipped with
functions
,
such that for any set and functions
,
, there exists a unique function
with

Note that with this definition, the cartesian product is not unique! Wiener's definition of ordered pairs gives a cartesian product of the sets and , but so does Kuratowski's, and so does any other definition that `gets the job done'. However, this does not lead to any confusion, since one can easily show that any two choices of cartesian product are isomorphic in a canonical way. For a proof of this and other facts about cartesian products, see for example the textbook by McLarty [27].

All this generalizes painlessly to an arbitrary category.
Given two objects and in some category,
we define their **cartesian product** (or simply **product**)
to be any object equipped with morphisms

called

We say a category has **binary products** if every pair of objects
has a a product. One can also talk about -ary products for other
values of , but a category with binary products has -ary
products for all , since we can construct these as iterated
binary products. The case is trivial, since the product of
one object is just that object itself (up to canonical isomorphism).
The only remaining case is . This is surprisingly important.
A -ary product is usually called a **terminal object** and
denoted : it is an object such that that for any object there
exists a unique morphism from to . Terminal objects are unique
up to canonical isomorphism, so we feel free to speak of `the'
terminal object in a category when one exists. The reason we denote
the terminal object by is that in , any set with one element
is a terminal object. If a category has a terminal object and binary
products, it has -ary products for all , so we say it has **finite products**.

It turns out that these concepts capture much of our intuition about
joint systems in classical physics. In the most stripped-down version
of classical physics, the states of a system are described as elements
of a mere *set*. In more elaborate versions, the states of a
system form an object in some fancier category, such as the category
of *topological spaces* or *manifolds*. But, just like
, these fancier categories have finite products -- and we use
this fact when describing the states of a joint system.

To sketch how this works in general, suppose we have any category with
finite products. To do physics with this, we think of any of the
objects of this category as describing some physical system. It
sounds a bit vague to say that a physical system is `described by'
some object , but we can make this more precise by saying that
states of this system are morphisms
. When our
category is , a morphism of this sort simply picks out an
element of the set . In the category of topological spaces, a
morphism of this sort picks out a point in the topological space
-- and similarly for the category of manifolds, and so on. For this
reason, category theorists call a morphism
an **element** of the object .

Next, we think of any morphism as a `process' carrying states of the system described by to states of the system described by . This works as follows: given a state of the first system, say , we can compose it with to get a state of the second system, .

Then, given two systems that are described by the objects and , respectively, we decree that the joint system built from these is described by the object . The projection can be thought of as a process that takes a state of the joint system and discards all information about the second part, retaining only the state of the first part. Similarly, the projection retains only information about the second part.

Calling these projections `processes' may strike the reader as
strange, since `discarding information' sounds like a subjective
change of our *description* of the system, rather than an
objective physical process like time evolution. However, it is worth
noting that in special relativity, time evolution corresponds to a
change of coordinates
, which can also be thought of
as change of our description of the system. The novelty in thinking
of a projection as a physical process really comes, not from the fact
that it is `subjective', but from the fact that it is not invertible.

With this groundwork laid, we can use the definition of `product' to show that a state of a joint system is just an ordered pair of states of each part. First suppose we have states of each part, say and . Then there is a unique state of the joint system, say , which reduces to the given state of each part when we discard information about the other part: and . Conversely, every state of the joint system arises this way, since given we can recover and using these equations.

However, the situation changes drastically when we switch to
quantum theory! The states of a quantum system can still be
thought of as forming a set. However, we do not
take the product of these sets to be the set of
states for a joint quantum system. Instead, we describe
states of a system as unit vectors in a Hilbert space, modulo
phase. We define the Hilbert space for a joint system to be the
*tensor product* of the Hilbert spaces for its parts.

The tensor product of Hilbert spaces is not a cartesian product in the sense defined above, since given Hilbert spaces and there are no linear operators and with the required properties. This means that from a (pure) state of a joint quantum system we cannot extract (pure) states of its parts. This is the key to Bell's `failure of local realism'. Indeed, under quite general conditions one can derive Bell's inequality from the assumption that pure states of a joint system determine pure states of its parts [3,8], so violations of Bell's inequality should be seen as an indication that this assumption fails.

The Wooters-Zurek argument that `one cannot clone a quantum state'
[32] is also based on the fact that the tensor product of Hilbert
spaces is not cartesian. To get some sense of this, note that
whenever is an object in some category for which the product
exists, there is a unique morphism

such that and . This morphism is called the

In , however, since the tensor product is not a product in the category-theoretic sense, it makes no sense to speak of a diagonal morphism . In fact, a stronger statement is true: there is no natural (i.e. basis-independent) way to choose a linear operator from to other than the zero operator. So, there is no way to duplicate information in quantum theory.

Since the tensor product is not a cartesian product in the sense explained above, what exactly is it? To answer this, we need the definition of a `monoidal category'. Monoidal categories were introduced by Mac Lane [23] in early 1960s, precisely in order to capture those features common to all categories equipped with a well-behaved but not necessarily cartesian product. Since the definition is a bit long, let us first present it and then discuss it:

**Definition. ** *A* **monoidal category** *consists of:*

**(i)***a category*,**(ii)***a functor*,**(iii)***a***unit object**,**(iv)***natural isomorphisms called the***associator**:

*the***left unit law**:

*and the***right unit law**:

**(v)**-

**(vi)**-

This obviously requires some explanation! First, it makes use of some
notions we have not explained yet, ruining our otherwise admirably
self-contained treatment of category theory. For example, what is
in clause (ii) of the definition? This is just the
category whose objects are pairs of objects in , and whose
morphisms are pairs of morphisms in , with composition of
morphisms done componentwise. So, when we say that the tensor product
is a functor
, this implies
that for any pair of objects
there is an object
, while for any pair of morphisms
in there is a morphism
in . Morphisms are just as important
as objects! For example, in , not only can we take the tensor
product of Hilbert spaces, but also we can take the tensor product of
bounded linear operators
and
,
obtaining a bounded linear operator

In physics, we think of as a joint process built from the processes and `running in parallel'. For example, if we have a joint quantum system whose two parts evolve in time without interacting, any time evolution operator for the whole system is given by the tensor product of time evolution operators for the two parts.

Similarly, in the tensor product is given by disjoint union,
both for objects and for morphisms. In Figure 8
we show
two spacetimes and and their tensor product .
This as a way of letting two spacetimes `run in parallel', like
independently evolving separate universes. The resemblance to the
tensor product of morphisms in should be clear.
Just as in , the tensor product in is not a cartesian
product: there are no projections with the required properties. There
is also no natural choice of a cobordism from to .
This means that the very nature of topology prevents us from finding
spacetimes that `discard' part of space, or `duplicate' space. Seen
in this light, the fact that we cannot discard or duplicate
information in quantum theory is not a flaw or peculiarity of this
theory. *It is a further reflection of the deep structural
analogy between quantum theory and the conception of spacetime
embodied in general relativity.*

Turning to clause (iii) in the definition, we see that a monoidal category needs to have a `unit object' . This serves as the multiplicative identity for the tensor product, at least up to isomorphism: as we shall see in the next clause, and for every object . In the unit object is regarded as a Hilbert space, while in it is the empty set regarded as an -dimensional manifold. Any category with finite products gives a monoidal category in which the unit object is the terminal object .

This raises an interesting point of comparison. In classical physics we describe systems using objects in a category with finite products, and a state of the system corresponding to the object is just a morphism . In quantum physics we describe systems using Hilbert spaces. Is a state of the system corresponding to the Hilbert space the same as a bounded linear operator ? Almost, but not quite! As we saw in Section 3, such operators are in one-to-one correspondence with vectors in : any vector corresponds to an operator with . States, on the other hand, are the same as unit vectors modulo phase. Any nonzero vector in gives a state after we normalize it, but different vectors can give the same state, and the zero vector does not give a state at all. So, quantum physics is really different from classical physics in this way: we cannot define states as morphisms from the unit object. Nonetheless, we have seen that the morphisms play a fundamental role in quantum theory: they are just Dirac's `kets'.

Next, let us ponder clause (iv) of the definition of monoidal
category. Here we see that the tensor product is associative,
but only *up to a specified isomorphism*, called the `associator'.
For example, in we do not have
,
but there is an obvious isomorphism

given by

Similarly, we do not have and , but there are obvious isomorphisms

Moreover, all these isomorphisms are `natural' in a precise sense. For example, when we say the associator is natural, we mean that for any bounded linear operators , , the following square diagram commutes:

In other words, composing the top morphism with the right-hand one gives the same result as composing the left-hand one with the bottom one. This compatibility condition expresses the fact that no arbitrary choices are required to define the associator: in particular, it is defined in a basis-independent manner. Similar but simpler `naturality squares' must commute for the left and right unit laws.

Finally, what about clauses (v) and (vi) in the definition of monoidal
category? These are so-called `coherence laws', which let us
manipulate isomorphisms with the same ease as if they were equations.
Repeated use of the associator lets us construct an isomorphism from
any parenthesization of a tensor product of objects to any other
parenthesization -- for example, from
to
. However, we can
actually construct *many* such isomorphisms -- and in this
example, the pentagonal diagram in clause (v) shows two. We would
like to be sure that all such isomorphisms from one parenthesization
to another are equal. In his fundamental paper on monoidal categories,
Mac Lane [23] showed that the commuting pentagon in clause
(v) guarantees this, not just for a tensor product of four objects,
but for arbitrarily many. He also showed that clause (vi) gives a
similar guarantee for isomorphisms constructed using the left and
right unit laws.

© 2004 John Baez

baez@math.removethis.ucr.andthis.edu