Around the year A.D. 825 the Arabian mathematician Mohammed ibn
M
s
al-Khow
rizm
illustrated the solution for the roots of an
arbitrary
second degree polynomial. Today, we know this formula as the
quadratic equation which says that the roots of
can be found as
This is an example of what is called a solution ``by radicals.''
In other words, the formula for the roots of a polynomial can be found
using an equation (using addition, subtraction, mutliplication,
division, and extraction of roots) involving the coefficients of the
polynomial. There are of course two other famous formulas for the roots of
polynomials. A solution by radicals to the cubic and quartic were published by
Cardano in 1545, attributing the former to Tartaglia and the latter to
Ferrari. Then in 1824, the Norwegian mathematician Niels Abel
demonstrated that no solution by radicals exists for any quintic or
higher degree polynomial.
Unfortunately, the formulas for the cubic and quartic are extremely
complicated and even more difficult to remember. In these notes we
show an alternate method, which connects matrices, their eigenvalues,
and roots of unity, giving a unified approach to finding roots of
qaurtic or smaller degree polynomials. We start with a definition.
The most basic circulant matrix is known as a circulant generator
matrix, which is a circulant matrix having first row
. For example, the
circulant
generator is the matrix
Then to generate the
circulant matrix with first row
we evaluate the function
at
.
Notice that the polynomial is cubic and has degree one less than the
size of the matrix. For example, suppose we want to generate the
circulant matrix with first row
. We
evaluate
at
as
![]() |
|||
![]() |
|||
![]() |
It is these matrices and their eigenvalues that we will use to find
the roots of small degree polynomials. You might ask why we want to
generate the desired circulant matrix by evaluating the appropriate function
at
, as this seems like more work than
necessary. But this perpective will help us when computing the
eigenvalues of a given circulant matrix. One first needs to realize
that if for some matrix
and vector
we have
, then it is
also true that for any polynomial
we have
. In other words, if
is an eigenvalue
of the matrix
, then
is an eigenvalue for the matrix
. Lets convince ourselves with an example. Choose the matrix
![]() |
|||
![]() |
|||
![]() |
The claim is that the eigenvalues of
are just
for
each eigenvalue of
. Clearly the eigenvalues of
are
. Then taking the eigenvalues of
, for
we get
, and for
we get
.
How does this help us with circulant matrices? If we know the
eigenvalues of the circulant generator
, to find the eigenvalues of
the circulant matrix
we simply evaluate the polynomial
at the eignvalues of
. That is, suppose that
is the
circulant matrix with first row
. That means
that
, where
. So if
is an eigenvalue of the
circulant generator
, then
is an eigenvalue of the
circulant matrix
. Without computation, we state the
eigenvalues of the
,
, and
circulant
generators.
![]() |
|||
![]() |
![]() |
||
![]() |
We can now easily compute the eigenvalues of the following circulant matrix.
Looking at the first row we see that
, where
. Since we already know the eigenvalues of
the eigenvalues of
are found as
, for each eigenvalue
of
.
We can now give a formula for the eigenvalues of an arbitrary circulant matrix.
![]() |
|||
![]() |
|||
where![]() |
|||
![]() |
We are now ready to find the roots of quadratic, cubic, and quartic
polynomials. We start with an easy example. Suppose we have
. We first divide by through the coefficient on
to
get the monic polynomial
(which has the same roots as
). To find its roots we simply must find a
circulant matrix
with coefficients
such that the
characteristic polynomial of
equals the polynomial
.
Remember the eigenvalues of a matrix
are the roots of its
characteristic polynomial. Then, for the appropriate choices
we
evaluate
at
and these will be the roots of
, which are also the roots of the original
.
The characteristic polynomial of the circulant
is
![]() |
|||
We must have
and
. So
and
. It follows that the correct circulant matrix is
So, the polynomial is
, and evaluated at
we get
the eigenvalues of
as
. Or, using the
computations from the previous page, the roots of
are
. We can check this with the quadratic formula.
Since we already have the formula for the quadratic memorized, we move onto the cubic. We start (for reasons to be described later) with the following general ``reduced'' cubic
Before we work on the cubic we state the following useful theorem.
Next, recall that for any monic polynomial of degree
, the sum of the
roots equals the coefficient on the
term. So, if
, then for each root
of
we have
Notice that the reduced cubic does not have a squared term. That is,
the sum of the roots of the cubic
equals
zero. But, the diagonal entries of the appropriate circulant matrix
will sum to zero, and since all its diagonal entries are the same we
must have a circulant matrix with the form
If we can determine the values
such that the characteristic
polynomial equals
, the the eigenvalues of
, which are the roots of
, are found by evaluating
at the cubic roots of unity.
The characteristic polynomial of
is
.
So we need to find
such that
To solve this system (which is not linear) we observe that
and
are both roots of
. It follows,
using the quadratic equation, that
At this point we must be careful. If
then
we let
![]() |
|||
![]() |
However, if these values are complex we need to remember that roots of complex numbers are not necessarily unique. So, for any of the possible cube roots, we let
To complete the computation we get, when
, the three roots of
:
![]() |
|||
![]() |
Although the solution is not in a ``nice'' form, the road to the
solution relies only on basic theory of linear algebra, and especially
on the ease of computing the eigenvalues of circulant matrices. We
now extend this to the general cubic
.
We can ``reduce'' the general cubic by making the substitution
. This substitution eliminates the quadratic term.
Then after solving this reduced cubic, we easily find the roots of the
original cubic by back-substitution.
Although we do not go into the details here, this same approach can be
extended to the general quartic
. First, make the substitution,
to eliminate
the cubic term and see that the circulant matrix will have the form
The steps from here involve solving a cubic, an approach we have
already covered.
Unfortunately, for higher degree polynomials, this
approach using circulant matrices fails in general. This was proven,
somewhat indirectly, by Abel and Galois in the 1800's. Most of the
techniques for solving for roots of higher degree polynomials rely on
approximation techniques such as Newton's method, although roots of
certain classes of polynomials can be found relatively easily. However,
what we have seen here is that the theory of circulant matrices gives
a fascinating connection between matrix theory and polynomials.