Notes on Jordan Normal (Canonical) Form
Math 5718 - Spring 2001
We have seen several results that give conditions under which
certain special classes of matrices (operators) can be
diagonalized. Remember that diagonalizing a matrix requires
finding a basis with respect to which the matrix is diagonal.
Equivalently, it means finding a diagonal matrix that is similar
to the given matrix. For general
matrices with fewer
than n linearly independent eigenvectors, diagonalization is not
possible. The best we can do (the closest we can come to
representing the matrix in diagonal form) is the Jordan form. We
begin by stating the theorem; then we will look at examples to
understand how to interpret it.1
Theorem. An
matrix with s linearly
independent eigenvectors is similar to a matrix J that has the
form
where the diagonal blocks (called Jordan blocks) have the
form
and
is an eigenvalue of A. In the special case that A
has n linearly independent eigenvectors, the diagonal blocks are
,
and J is a strictly diagonal matrix. In the
general case, the diagonal blocks are ``nearly diagonal,'' and J
is block diagonal.
Let's look at a specific example. Suppose that A is a
matrix with eigenvalues
and
that is already
in Jordan form:
The position of the 1's on the off diagonal are important in
identifying the blocks. In this case, there are three blocks:
We might already anticipate that A has three linearly
independent eigenvectors, corresponding to the three diagonal
blocks; but we need to understand why the blocks have the sizes
they do and why the eigenvalue 0 appears in two different blocks.
We can learn a lot by examining the representation
J = P-1AP.
Suppose that the columns of P are denoted vi for
.
Writing
J = P-1AP in the form AP=PJ, we have
Looking at these products by columns, we see that
| Av1 = 4 v1 |
 |
Av2 = 4 v2+v1 |
(1) |
 |
 |
 |
(2) |
 |
 |
|
(3) |
It is clear that v1,v3, and v5 are ordinary eigenvectors,
as they satisfy
.
Notice that these eigenvectors
correspond to the upper left element of each Jordan block.
However, v2 and v4 do not satisfy the usual
eigenvector-eigenvalue relations; instead they are related to one
of the other ordinary eigenvectors. These ``near eigenvectors''
are called generalized eigenvectors. We say that v1 and
v2 form a string corresponding to equations
(
). The length of this string corresponds to the
dimensions of the first Jordan block. Similarly,
equations (
) define another string of length two
consisting of v3 and v4 (corresponding to the dimensions of
the second Jordan block). Finally, v5 forms a string of length
one.
The process of finding the Jordan form of a matrix A amounts to
finding the columns of P, which are the eigenvectors and
generalized eigenvectors of A. Said differently, finding the
Jordan form for a matrix A amounts to finding a basis with
respect to which A can be represented in Jordan form. Such a
basis is called a Jordan basis.
We have defined a generalized eigenvector as a vector vi that
satisfies a relation of the form
where vi-1 is an ordinary eigenvector. It's useful to make a
connection to an equivalent definition of generalized eigenvector
(the one used by Axler). If vi-1 is an ordinary eigenvector
corresponding to the eigenvalue
,
then we have
;
of course, it follows that
for all positive integers k. For a generalized eigenvector,
vi, we have
It follows that
for all positive integers
.
Thus, a generalized eigenvector can also be defined as
a vector for which
for all positive
integers
.
The integer m gives the dimension of the
Jordan block associated with
and vi-1.
These observations should make the following conclusions evident.
These statements cover the situation in which there may be more
than one generalized eigenvector associated with a particular
Jordan block. Suppose that J is an
Jordan block
with
on the diagonal. Then
-
for
,
and
(that is, v1 is an ordinary eigenvector and
are generalized eigenvectors).
-
,
but
for i<m.
Example: Another example should help clarify and extend
these ideas. Now consider the following matrix in Jordan form.
We see that J consists of three Jordan blocks, one of which is
and two of which are
.
Let's begin by
finding the effect of J on the standard basis vectors, ei, of
(for example,
e4=(0,0,0,1,0,0)). The first Jordan block
forms a string consists of e1,e2, and e3:
The second Jordan block forms a string consisting of e4 and
e5:
The third Jordan block forms a string consisting of e6 and
e7:
We see that J has two eigenvalues,
(corresponding to
e1 and e4) and
(corresponding to e6). Let's
recall some terminology already introduced in class. We say that
has algebraic multiplicity 5, as it appears on the
diagonal 5 times. (Alternatively, the characteristic polynomial
for J would have a term
.) Because
(J-4I)e1=0 and
(J-4I)e4=0, we see that null (J-4I) is spanned by
and dim null (J-4I) =2. Thus, the geometric
multiplicity of
is 2.
Similarly,
has algebraic multiplicity 2 and because null
(J+2I) is spanned by e6, dim null (J+2I) =1. Thus, the
geometric multiplicity of
is 1. Notice that if the
geometric multiplicity of an eigenvalue equals its algebraic
multiplicity, then all blocks associated with the eigenvalue are
.
If this equality holds for all eigenvalues of the
matrix, the Jordan form is strictly diagonal (the matrix is
diagonalizable).
A general rule emerges from these observations: The algebraic
multiplicity of an eigenvalue is the number of times that
eigenvalue appears on the diagonal, while the geometric
multiplicity of an eigenvalue is the number of different blocks
that contain the eigenvalue. But these observations are not enough
to tell us the exact size of each block; for example, an
eigenvalue with algebraic multiplicity 5 and geometric
multiplicity 2 could have blocks of size (1,4) or (2,3). So we
need to look again at the strings associated with the eigenvalue.
We will return to the previous example and look at the powers of
.
We have already seen that null (J-4I) is spanned
by
and has dimension 2. It is easy to check that
null (J-4I)2 is spanned by
and has
dimension 4. Equally important, these four vectors are related by
the strings
We also have that null (J-4I)3 is spanned by
and has dimension 5. These five vectors are related by the strings
Finally notice that for
,
null (J-4I)k is also spanned
by
and has dimension 5. We have now
accounted for all five eigenvectors and generalized eigenvectors
associated with
.
The lengths of the strings leading to
each eigenvector (3 and 2) give the size of the Jordan blocks.
Similarly, for
,
null (J+2I) is spanned by e6 and
has dimension 1, while null (J-4I)2 is spanned by
and has dimension 2. The relevant string is
For
,
null (J-4I)k is also spanned by
and has dimension 2. The length of this string gives us the size
of the Jordan block associated with
.
It's time to put all these ideas together and find the Jordan form
and a Jordan basis for a specific matrix A. The process consists
of four steps:
- Find the eigenvalues of A and their algebraic multiplicity.
- Determine the geometric multiplicity of the eigenvalues by finding dim null
.
- Form the strings associated with each ordinary eigenvector by examining the
powers of
.
- Use all this information to assemble the Jordan form and find a Jordan basis.
Example: Find the Jordan form for the
matrix2
The eigenvalues of this upper triangular matrix are on the
diagonal. We see that
has algebraic multiplicity 2 and
has algebraic multiplicity 3. To analyze
,
we
consider
 |
(4) |
Perhaps the easiest way to find the dimension of null (A-2I) is
to note that rank
(A-2I) = 4 (because there are four linearly
independent rows); therefore, dim null
(A-2I)=5-4=1 (note that
). A little work shows that
 |
(5) |
We see that (A-2I)2 has three linearly independent rows, which
means that rank
(A-2I)2=3 and dim null
(A-2I)2=5-3=2. Note
also that the upper left
block corresponding to
consists of zeros. One more calculation shows us that
We see that dim null
(A-2I)3 = 2, which is the same as dim null
(A-2I)2. Thus, we expect to have one ordinary eigenvector that
satisfies Av1=2v1 and one generalized eigenvector that
satisfies
Av2=2v2+v1. From this observation, we see that the
eigenvalue
has the single Jordan block
Let's also find the eigenvectors associated with this block. The
ordinary eigenvector satisfies Av1=2v1. Referring to
(
), we see that v1 is any multiple of
(1,0,0,0,0)T. The generalized eigenvector satisfies
Av2=2v2+v1, which gives us
v2=(0,1/5,0,0,0)T.
Alternatively, we could have first found v2 as any vector in
null (A-2I)2 that is not in null (A-2I). Using (
)
we see that v2 can be any multiple of
(0,1,0,0,0)T. We can
then use
Av2=2v2+v1 or
v1 = Av2-2v2 to find
v1=(5,0,0,0,0)T.
A similar analysis of the eigenvalue
requires
 |
(6) |
We see that rank (A+I)=2 and dim null
(A+I)=5-2=3. Thus, both
the algebraic and geometric multiplicity of
are 3. This
means that there are three
Jordan blocks associated
with
;
they are
J2=J3=J4 = [-1]. Alternatively, you
can check that dim null (A+I)2=3 which equals dim null (A+I).
The eigenvectors associated with
must form a basis for
null (A+I). Referring to (
), we see that null (A+I)
is spanned by
(0,0,1,0,0)T, (0,0,0,1,0)T, (-1,0,0,0,3)T.
Assembling all of these observations, the Jordan form for A is
The matrix P that gives the similarity transformation between
A and J has columns consisting of the eigenvectors:
You can verify that
J=P-1AP.
Example: Find the Jordan form for an
matrix
A with the following properties.
- dim null (A-3I)=2.
- dim null
(A-3I)2=4.
- dim null
(A-3I)k=5 for
.
- dim null (A+2I)=1.
- dim null
(A+2I)2=2.
- dim null
(A+2I)3=3.
- dim null
(A+2I)k=4 for
.
- dim null (A-I)=2.
- dim null (A-I)k=2 for
.
We see that there are three distinct eigenvalues,
,
and
.
For each eigenvalue, we must at least
count the ordinary and generalized eigenvectors.
: Because dim null (A-3I)=2, the geometric
multiplicity of
is 2 and we should expect two Jordan
blocks. We also expect to have two ordinary eigenvectors for this
eigenvalue; call them v1 and v4 . To determine the size of
the blocks, we must look at higher powers of (A-3I). Because dim
null
(A-3I)2=4, there are four eigenvectors that span
(A-3I)2: the two ordinary eigenvectors and two new generalized
eigenvectors. Call the new generalized eigenvectors v2 and
v5. Now note that dim null
(A-3I)3=5, so there is another
generalized eigenvector; call it v3. Because dim null
(A-3I)k=5 for
,
there are no additional generalized
eigenvectors. This analysis produces two strings associated with
;
they are
Knowing that there are five eigenvectors associated with
,
we conclude that 3 will appear on the diagonal of the
Jordan form 5 times and that the algebraic multiplicity of
is 5.
: We see that the geometric multiplicity of
is 1, so we will have one Jordan block corresponding to
and one ordinary eigenvector; call it v1. To find the
size of the block, we need to find the number of generalized
eigenvectors. The fact that dim null
(A+2I)2=2 means that there
is a generalized eigenvector that satisfies
(A+2I)v2 = v1 or
(A+2I)2v2=0. The fact that dim null
(A+2I)3=3 means that
there is another generalized eigenvector that satisfies
(A+2I)v3
= v2 or
(A+2I)3v3=0. And the fact that dim null
(A+2I)4=4
means that there is another generalized eigenvector that satisfies
(A+2I)v4 = v3 or
(A+2I)4v4=0. Because dim null
(A+2I)k=4 for
,
there are no more generalized
eigenvectors. The string we have uncovered here is
Thus, the Jordan block corresponding to
is
,
and the algebraic multiplicity of
is 4.
: The geometric multiplicity of
is
2, meaning there are two Jordan blocks and two ordinary
eigenvectors. The fact that dim null (A-I)2=2 (no increase over
dim null (A-I)2) means that there are no additional generalized
eigenvectors and the algebraic multiplicity is also 2. Thus, there
are two
Jordan blocks corresponding to
.
Assembling all of this information, we conclude that the Jordan
form for A is
Note that the Jordan blocks have sizes
3, 2, 4, 1, 1. The Jordan
form is unique up to ordering of the blocks.