In Lecture 4 we covered the basic algebraic and geometric aspects of Hilbert space and Euclidean space. In this lecture we cover Descartes’ idea to encode points in such spaces as finite lists of numbers — the method of coordinates. A straightforward implementation of this idea only works when finitely many coordinates suffice, so we first need to introduce the notion of dimension, which is done using the concept of linear independence. This material will be mostly familiar from prior exposure to linear algebra, so several proofs are omitted (though I may fill them in later).
Let be a vector space.
Definition 5.1. A finite set is said to be linearly independent if the identity
holds precisely when for each
Remark that Definition 5.1 applies in the case where is the empty set, which is a finite set of cardinality zero.
Definition 5.2. We say that is finite-dimensional if there exists a nonnegative integer
such that that
contains a linearly independent set of cardinality
, but does not contain a linearly independent set of cardinality
In this case the number
is called the dimension of
For example, the zero space is a finite-dimensional vector space of dimension zero.
Now let be a finite-dimensional vector space, fixed for the remainder of the lecture. Let
be a linearly independent set such that
Theorem 5.3. (Coordinate Theorem) For every vector there are scalars
such that
Moreover, if also
then for each
A set as in Theorem 5.3 is said to be a basis of
Now suppose is equipped with a scalar product, i.e. it is either a Hilbert space (scalar field
) or a Euclidean space (scalar field
).
Theorem 5.4. (Gram-Schmidt) There exists a linearly independent set of cardinality
such that each
satisfy
A set as in Theorem 5.4 is said to be an orthonormal basis of
.
Theorem 5.5. (Cartesian Coordinates) Let be an orthonormal basis. Then, for ever
we have
Proof: By the coordinate theorem, we have
and therefore
-QED
The scalar products
are called the Cartesian coordinates of
relative to the orthonormal basis
.
Now suppose we choose an ordering (aka labeling, aka enumeration) of
Then,
is called an ordered orthonormal basis of
and the Cartesian coordinates of any vector
relative to
can be stored as an ordered list,
i.e. as a spatial vector in where
In this way abstract vectors in a Euclidean space or Hilbert space can be represented as spatial vectors once an ordered orthonormal basis is chosen. However, for most purposes it is not necessary to choose an ordering on
and one may simply work directly with unordered Cartesian coordinates. For example, we have
for every pair of vectors and in particular
The use of Cartesian coordinates without imposing an ordering is a middle path between the asceticism of never using any coordinates and the indulgence of always using ordered coordinates. Another important computation is change of coordinates. Suppose that is a novel orthonormal basis generating its own Cartesian coordinates
To express the novel coordinates we compute
Thus, by uniqueness of coordinates in a given basis, we conclude that
holds for every This is how change-of-basis is done without ordering bases.
Definition 5.6. A function from one Hilbert space to another is said to be linear if
and
For some reason, linear functions between Hilbert spaces are called linear transformations; they are generally denoted by capital letters and the usual brackets surrounding the argument to the function are omitted.
Previously, we discussed functions between finite sets and showed how these can be encoded as binary matrices (“one-hot encoding”). A non-trivial Hilbert space is an uncountably infinite set and functions between such objects can be very complicated. However, linear functions between finite-dimensional Hilbert spaces are not much more complicated than functions between finite sets: they are encoded by matrices whose entries may be any elements of the ground field.
Let and
be two Hilbert spaces with specified orthonormal bases, and let
be a linear transformation. For any
we have
Thus, is uniquely determined by the
vectors
and for each of these we have
It follows that is uniquely determined by the
scalar products
which we call the matrix elements of relative to
and
If one wishes, orderings
and
of
and
may be chosen and the matrix elements of
may be arranged into the
matrix
with entries
but for most purposes it is not necessary to do this and one can work directly with the matrix elements of relative to two unordered orthonormal bases.
As an example, let us discuss change of basis. In the category of finite sets with functions the only thing that can occur is a relabeling of the points of
and
which will change the one-hot encoding of
by permuting its rows and columns. For linear maps between Hilbert spaces
and
, in which
and
are sitting as orthonormal bases, we may consider genuinely different orthonormal basis
and
We then have
a formula which gives the new matrix elements in terms of the old ones and the Cartesian coordinates of relative to the original bases
Now let us consider linear transformations and
where
are Hilbert spaces with orthonormal bases
respectively. Then, the composition
is a function
and you can (and should) check that it is a linear transformation. We compute the matrix elements of the composition transformation in terms of the matrix elements of its constituents as follows: