In Lecture 4, we introduced the notion of a Euclidean space, which is a vector space together with a scalar product
defined on
. In a Euclidean space, we can use the scalar product to define a notions of vector length, distance between two vectors, and angle between two vectors. In short, while a vector space alone is a purely algebraic object, we can do Euclidean geometry in a vector space with a scalar product. This realization is extremely useful since it gives us a way to think geometrically about vectors which may not be at all like vectors in
For example, they could be functions, as in Assignment 3.
For better or worse, it turns out that Euclidean geometry, as useful as it is in this generalized setup, is not sufficient to describe the world around us. Mathematically, this means that we must sometimes think about non-Euclidean geometry. At the level of linear algebra, this comes down to opening ourselves up to thinking about general bilinear forms, which extend the scalar product concept in that they might fail to satisfy the symmetry and positivity axioms. An important example is the geometry of special relativity. In this physical theory, the vector space is taken to model spacetime, with the first three coordinates of a vector corresponding to its position in space, and the last coordinate being its position in time. It turns out that the geometry of spacetime is governed by a “fake” scalar product, called the Lorentz form, which is defined by
So, physicists are telling us that in order to understand the geometry of spacetime we have to think about a strange version of the usual dot product on which is made by taking the usual dot product of the spatial coordinates, and then subtracting the product of the time coordinates — typical, they always do this kind of thing. The Lorentz form is definitely not a scalar product, since the length of a vector can be negative:
Still, mathematically, there’s no reason we can’t consider such fake scalar products as a legitimate generalization of the scalar product concept.
Definition 1: A function is said to be a bilinear form if, for all vectors
and all scalars
we have
So, a bilinear form is just a “weak” scalar product on which might fail two out of three of the scalar product axioms.
In this lecture, we will see that the set of all bilinear forms that can be defined on an -dimensional vector space
can be viewed as the set of all tables of real numbers with
rows and
columns, or in other words
matrices. In fact, it is not difficult to come to this realization — we just have to pick a basis in
in
in order to describe a given bilinear form as a matrix. Things however get a bit tricky when we want to compare the two matrices which describe the same bilinear form relative to different bases.
Let’s start with something easier.
Definition 2: A function is said to be a linear form if, for all vectors
and all scalars
we have
Now suppose that is a linear form on an
-dimensional vector space
Then, in order to be able to compute the number
for any vector
it is sufficient to know how to calculate the numbers
where is a basis of
Indeed, in order to compute
from this information, we simply write
as a linear combination of basis vectors,
and then compute
Note that this has a very simple description in terms of the usual dot product in namely
Equivalently, the number is computed as the product of a
matrix and an
matrix:
We can write this more succinctly as the matrix equation
where and
are the only things they could be based on context. The
matrix
in this equation is referred to as the matrix of the linear form
relative to the basis
and its entries are just the values of the form on each of the basis vectors. Not too complicated.
Essentially the same idea works for bilinear forms: in order to know how to compute the number for any two vectors
it is sufficient to know how to compute the
numbers
relative to a basis Indeed, if we have access to this table of numbers, then to compute
for given
we first write these vectors as linear combinations of basis vectors,
and then calculate using bilinearity:
Once again, the result of this calculation can be expressed in terms of matrices, namely as the product three matrices: an matrix, an
matrix, and an
matrix. Here’s how this looks:
This formula is often written
where the symbols are the only things they could possibly be based on context. In particular, the
matrix
is referred to as the matrix of the bilinear form
relative to the basis
Now we come to the issue of dependence on the choice of basis. This is easily worked out for linear forms, but is a little more complex for bilinear forms.
Let and
be two bases in the same vector space
and let
be a linear form on
Let
be the matrices which represent the form relative to the two bases
We want to discern the relationship between these two matrices. We follow the same Marie Kondo approved out with the old, in with the new strategy as in Lecture 4: we write vectors of the “old” basis
as linear combinations of the vectors of the new basis
Now we evaluate the linear form on both sides of each of these vector equations, to get the scalar equations
These scalar equations can be written as the single matrix equation
or more briefly as
where And that’s it — that’s change of basis for linear forms.
Although the end result is slightly more complicated, the strategy for working out the relationship between the matrices and
representing the same bilinear form
relative to two (possibly) different bases
and
is the same: out with the old, in with the new. Just as in the case of a linear form, the first step is to write
Now we consider the numbers We have
Although it may take a little bit of experimentation (try it out for ), the above is fairly easily seen to be equivalent to the matrix equation
where is the transpose of the
matrix
That’s it for this lecture, and next time we will do more interesting things with bilinear forms, aka generalized scalar products. Although the above change of basis formulas are presented in any standard course in linear algebra, my personal opinion is that they aren’t too important. If you find them easy to remember, excellent; more important is the ability to re-derive them whenever you want, since this means you understand why they are what they are. My hope is that you will understand the meaning of linear and bilinear forms conceptually, which doesn’t require calculating their matrices relative to a particular basis.
To drive the above point home, let us close this lecture by remarking that there’s no need to stop at bilinear forms. Why not keep going to trilinear forms? Indeed, for any one may define a
-linear form on a given vector space
to be any function real-valued function of
arguments on
which is a linear function of each argument. Conceptually, this isn’t any more complicated than a bilinear form. However, to represent such a function we need to use a -dimensional array of numbers, which is often referred to as a $k$-dimensional tensor. In particular, a
-dimensional tensor is a list, and a
-dimensional tensor is a matrix. In general, change of basis formulas for
-dimensional tensors are quite messy and not very meaningful.
1 Comment