Matrix diagonalization

Matrix diagonalization is the process of performing a similarity transformation on a matrix in order to recover a similar matrix that is diagonal (i.e., all its non-diagonal entries are zero).

Once a matrix is diagonalized it becomes very easy to raise it to integer powers.

Not all matrices are diagonalizable. The diagonalizable matrices are those that have no defective eigenvalues (i.e., eigenvalues whose geometric multiplicity is less than their algebraic multiplicity).

Table of contents

Similarity transformations
Diagonalizable matrix
Relation to eigenvalues and eigenvectors
How to diagonalize a matrix
The diagonalization is not unique
The most important application
Inverse matrix
Solved exercises
1. Exercise 1

Similarity transformations

Remember that two square matrices and are said to be similar if there exists an invertible matrix such that

If two matrices are similar, then they have the same rank, trace, determinant and eigenvalues. Not only two similar matrices have the same eigenvalues, but their eigenvalues have the same algebraic and geometric multiplicities.

Diagonalizable matrix

We can now provide a definition of diagonalizable matrix.

Definition Let be a matrix. We say that is diagonalizable if and only if it is similar to a diagonal matrix.

In other words, when is diagonalizable, then there exists an invertible matrix such thatwhere is a diagonal matrix, that is, a matrix whose non-diagonal entries are zero.

Example Define the matrix [eq3] and [eq4] The inverse of is [eq5] The similarity transformation [eq6] gives the diagonal matrix as a result. Hence, is diagonalizable.

Relation to eigenvalues and eigenvectors

We can write the diagonalization $D=P^{-1}AP$ as

The -th column of is equal towhere $P_{ullet k}$ is the -th column of (if you are puzzled, revise the lecture on matrix multiplication and linear combinations).

The -th column of is equal to where $D_{ullet k}$ is the -th column of .

In turn, $PD_{ullet k}$ is a linear combination of the columns of with coefficients taken from the vector $D_{ullet k}$ .

Since is diagonal, the only non-zero entry of $D_{ullet k}$ is $D_{kk}$ . Therefore,

Thus, we have arrived at the conclusion that

The latter equality means that $P_{ullet k}$ is an eigenvector of associated to the eigenvalue $D_{kk}$ .

This is true for . Thus, the diagonal elements of are the eigenvalues of and the columns of are the corresponding eigenvectors.

The matrix used in the diagonalization must be invertible. Therefore, its columns must be linearly independent. Stated differently, there must be linearly independent eigenvectors of .

In the lecture on the linear independence of eigenvectors, we have discussed the fact that, for some matrices, called defective matrices, it is not possible to find linearly independent eigenvectors. A matrix is defective when it has at least one repeated eigenvalue whose geometric multiplicity is strictly less than its algebraic multiplicity (called a defective eigenvalue).

Therefore, defective matrices cannot be diagonalized.

The next proposition summarizes what we have discussed thus far.

Proposition A matrix is diagonalizable if and only if it does not have any defective eigenvalue.

Proof

We have already proved the "only if" part because we have shown above that, if is diagonalizable, then it possesses linearly independent eigenvectors, which implies that no eigenvalue is defective. The "if" part is simple. If possesses linearly independent eigenvectors, then we can adjoin them to form the full-rank matrix and we can form a diagonal matrix whose diagonal elements are equal to the corresponding eigenvalues. Then, by the definition of eigenvalues and eigenvectors, we have that and the diagonalization of follows.

Remember that if all the eigenvalues of are distinct, then does not have any defective eigenvalue. Therefore, possessing distinct eigenvalues is a sufficient condition for diagonalizability.

How to diagonalize a matrix

Suppose we are given a matrix and we are told to diagonalize it. How do we do it?

The answer has already been given in the previous proof, but it is worth repeating.

We provide the answer as a recipe for diagonalization:

Compute the eigenvalues of .
Check that no eigenvalue is defective. If any eigenvalue is defective, then the matrix cannot be diagonalized. Otherwise, you can go to the next step.
For each eigenvalue, find as many linearly independent eigenvectors as you can (their number is equal to the geometric multiplicity of the eigenvalue).
Adjoin all the eigenvectors so as to form a full-rank matrix .
Build a diagonal matrix whose diagonal elements are the eigenvalues of .
The diagonalization is done: $D=P^{-1}AP$ .

Importantly, we need to follow the same order when we build and : if a certain eigenvalue has been put at the intersection of the -th column and the -th row of , then its corresponding eigenvector must be placed in the -th column of .

Example Define the matrix [eq13] The eigenvalues solve the characteristic equationLet us compute the determinant [eq15] Thus, there are two eigenvalues $lambda _{1}=-1$ and $lambda _{2}=2$ . There are no repeated eigenvalues and, as a consequence, no defective eigenvalues. Therefore, is diagonalizable. The eigenvectors $x_{1}$ associated to $lambda _{1}$ solveSince [eq17] we can choose, for example, [eq18] Moreover, [eq19] so we can choose, as an eigenvector associated to $lambda _{2}$ , the following vector: [eq20] Therefore, the diagonal matrix of eigenvalues is [eq21] and the invertible matrix of eigenvectors is [eq22]

The diagonalization is not unique

Provided a matrix is diagonalizable, there is no unique way to diagonalize it.

For example, we can change the order in which the eigenvalues are put on the diagonal of . Or we can replace a column of with a scalar multiple of itself (which is another eigenvector associated to the same eigenvalue). If there is a repeated eigenvalue, we can choose a different basis for its eigenspace.

Example For instance, in the previous example, we could have defined [eq23] and [eq24] Another possibility would have been to choose [eq25] and [eq26]

The most important application

The most important application of diagonalization is the computation of matrix powers.

Let be a diagonal matrix: [eq27]

Then its -th power can be easily computed by raising its diagonal elements to the -th power: [eq28]

If a matrix is diagonalizable, then and [eq30]

Thus, all we have to do to raise to the -th power is to 1) diagonalize (if possible); 2) raise the diagonal matrix to the -th power, which is very easy to do; 3) pre-multiply the matrix $D^{n ext{ }}$ thus obtained by and post-multiply it by $P^{-1}$ .

Inverse matrix

Once a matrix has been diagonalized it is straightforward to compute its inverse (if it exists).

In fact, we have thatwhere [eq32]

Solved exercises

Below you can find some exercises with explained solutions.

Exercise 1

Suppose that a matrix can be diagonalized as where [eq34] [eq35] Suppose that $a^{2}+b^{2}=1$ . Show thatand compute $A^{4}$ .

Solution

First of all, let us check that $P^{-1}=P^{ op }$ : [eq37] We can easily compute powers of : [eq38]

How to cite

Please cite as:

Taboga, Marco (2021). "Matrix diagonalization", Lectures on matrix algebra. https://www.statlect.com/matrix-algebra/matrix-diagonalization.