Invariant subspace

A subspace is said to be invariant under a linear operator if its elements are transformed by the linear operator into elements belonging to the subspace itself.

The kernel of an operator, its range and the eigenspace associated to the eigenvalue of a matrix are prominent examples of invariant subspaces.

The search for invariant subspaces is one of the most important themes in linear algebra. The reason is simple: as we will see below, the matrix representation of an operator with respect to a basis is greatly simplified (i.e., it becomes block-triangular or block-diagonal) if some of the vectors of the basis span an invariant subspace.

Table of contents

Definition
The kernel of an operator is invariant
The range of an operator is invariant
The eigenspace of an eigenvalue is invariant
Block-triangular matrices
Direct sums of invariant subspaces
More than two subspaces
Practical implications
Summary of the workflow
Eigenvalues and eigenvectors again
Solved exercises
1. Exercise 1
2. Exercise 2

Definition

Remember that, given a vector space , a linear operator is a function that preserves linear combinations, that is,for any couple of vectors $s_{1},s_{2}in S$ and any couple of scalars .

Definition Let be a vector space and a linear operator. Let be a subspace of . We say that is invariant under if and only if for any .

In other words, if is invariant under , then the restriction of to , denoted by , is a linear operator on (i.e., ).

Example Let be the space of vectors. Let be the subspace spanned by the vector [eq6] In other words, all the vectors of have the form [eq7] where is a scalar. Suppose that a linear operator is such that [eq8] Then, whenever , we have . Therefore, is an invariant subspace under .

The kernel of an operator is invariant

The kernel of a linear operator is the subspace

Since and all the elements of are mapped into by the operator , the kernel is invariant under .

The range of an operator is invariant

The range of a linear operator is the subspace

Since , any is mapped by into . Therefore, the range is invariant.

The eigenspace of an eigenvalue is invariant

Let be the space of vectors. Let be a matrix. We can use the matrix to define a linear operator as follows:

Suppose is an eigenvalue of and is the subspace of containing all the eigenvectors associated to (so-called eigenspace).

By the definition of eigenvector, we havefor any . Since is a subspace, . Therefore, the eigenspace is invariant under .

Block-triangular matrices

There is a tight link between invariant subspaces and block-triangular matrices.

In order to understand this link, we need to revise some facts about linear operators.

Let be a finite-dimensional vector space and a basis for .

If can be written as a linear combination of the basis asthen its coordinate vector with respect to is [eq19]

Remember that any operator has an associated matrix, called matrix of the operator with respect to and denoted by , such that, for any , we havewhere and are respectively the coordinate vectors of and with respect to .

We have previously proved that the matrix of the operator has the following structure: [eq24]

We are now ready to state the main proposition in this lecture.

Proposition Let be a finite-dimensional vector space and a linear operator. Let be a subspace of and a basis for . Complete $B_{U}$ so as to form a basis for . The subspace is invariant under if and only if has the block-triangular structure [eq28] where the block $arphi _{11}$ is , $arphi _{12}$ is , $arphi _{22}$ is and denotes a block of zeros.

Proof

We first prove the "only if part", starting from the hypothesis that is invariant. Denote by $f_{kl}$ the -th entry of . Since is invariant, then, for , belongs to and, as a consequence, it can be written as a linear combination of (and enter with zero coefficient in the linear combination). Therefore, for , the coordinate vector of is [eq37] As a consequence, when is invariant, the matrix of the operator is [eq38] We now prove the "if part", starting from the hypothesis that has the assumed block-triangular structure. Any vector has a coordinate vector of the form [eq40] where $u_{1}$ is and is . Then, [eq42] Therefore, . Since this is true for any , is an invariant subspace.

We can also write [eq44] where is the matrix of the restriction of to with respect to the basis $B_{U}$ .

Direct sums of invariant subspaces

Remember that is said to be the sum of subspaces , in which case we writeif and only if

A sum of subspaces like the one just shown is said to be a direct sum and it is denoted byif and only if are linearly independent whenever $u_{j}in U_{j}$ and $u_{j} eq 0$ for .

Direct sums of invariant subspaces have the following important property.

Proposition Let be a linear space. Let $U_{1}$ and $U_{2}$ be subspaces of such thatLet and be bases for $U_{1}$ and $U_{2}$ respectively (as a consequence, $B=B_{1}cup B_{2}$ is a basis for ). Let be a linear operator. Then, $U_{1}$ and $U_{2}$ are both invariant under if and only if has the block-diagonal structure [eq55] where the blocks $arphi _{11}$ and $arphi _{22}$ are and respectively.

Proof

We first prove the "only if" part, starting from the hypothesis that $U_{1}$ and $U_{2}$ are both invariant under . By the properties of direct sums, any vector has a unique representationwhere $u_{1}in U_{1}$ and $u_{2}in U_{2}$ . Moreover, $u_{j}$ has a unique representation in terms of the basis $B_{j}$ (for ). Therefore, any can be written as a linear combination of the vectors of . In other words, is a basis for . The first columns of are [eq59] Since $U_{1}$ is invariant under , $b_{k}in U_{1}$ implies that . Therefore, for , can be written as a linear combination of the vectors of $B_{1}$ and the first columns of are [eq63] Similarly, we can demonstrate that the remaining columns of are [eq65] Thus, [eq66] which is a block-diagonal matrix with the structure described in the proposition. We now prove the "if" part, starting from the hypothesis that is block diagonal. Since is block-upper triangular, $U_{1}$ is invariant by the proposition above on block-upper triangular matrices. Moreover, any vector $uin U_{2}$ has a coordinate vector of the form [eq69] where $u_{2}$ is and is . Then, [eq71] Therefore, . Since this is true for any $uin U_{2}$ , $U_{2}$ is an invariant subspace.

More than two subspaces

The previous proposition can be extended, by applying it recursively, to the case in which and all the subspaces $U_{j}$ are invariant.

Proposition Let be a linear space. Let $U_{1}$ , $U_{2}$ , ..., $U_{m}$ be subspaces of , with bases $B_{1}$ , ..., $B_{m}$ , and such thatso that, as a consequence, [eq75] is a basis for . Let be a linear operator. Then, all the sets $U_{j}$ (for ) are invariant under if and only if has the block-diagonal structure [eq77]

Practical implications

What are the practical implications of everything that we have shown so far? In particular, what happens when we are dealing with linear operators defined by matrices? We provide some answers in this section.

Let be a matrix. Let be the space of all column vectors.

We consider the linear operator defined by the matrix , that is,for any .

Suppose that we have been able to find two invariant subspaces $U_{1}$ and $U_{2}$ such that

In other words, [eq80] and $u_{1}$ is linearly independent from $u_{2}$ whenever $u_{1}in U_{1}$ , $u_{2}in U_{2}$ and the two vectors are non-zero.

We can choose bases and for $U_{1}$ and $U_{2}$ respectively and we know that $B=B_{1}cup B_{2}$ is a basis for .

Define the following matrices by adjoining the vectors of the basis: [eq83]

Note that [eq84] where are vectors that are guaranteed to exist because $Ab_{j}in U_{1}$ for by the invariance of $U_{1}$ , which means that $Ab_{j}$ can be written as a linear combination of the basis of $U_{1}$ (i.e., $Ab_{j}=V_{1}h_{j}$ ). In order to match the notation used in the propositions above, we defineso that

Similarly, we can find a matrix $arphi _{22}$ such that

As a consequence, we have [eq89] or [eq90] where is invertible because its columns are vectors of a basis, which are by definition linearly independent.

Recall the definition of matrix similarity. The last equation means that is similar to the block-diagonal matrix [eq91] and the change-of-basis matrix used in the similarity transformation is .

Summary of the workflow

Thus, the process of similarity transformation of a matrix into a block-diagonal matrix (generalized here to the case of more than two invariant subspaces) works as follows:

we identify invariant subspaces such that
we find bases for the invariant subspaces and we use them to construct the matrixwhere the columns of $V_{j}$ are the vectors of the basis of $U_{j}$ (for );
we perform the similarity transformationand the matrix turns out to be block-diagonal. In particular, there are blocks on the diagonal and the dimensions of the blocks are equal to the number of columns of the matrices (i.e., the number of vectors in each of the bases).

This is one of the most important workflows in linear algebra! We encourage the reader to solidly understand and memorize it.

Eigenvalues and eigenvectors again

We have explained above that the eigenspace associated to an eigenvalue of is an invariant subset.

Denote by the distinct eigenvalues of and by their respective eigenspaces.

As explained in the lecture on the linear independence of eigenvectors, when is not defective, we can form a matrixwhere the columns of $V_{j}$ are a basis for $U_{j}$ and all the columns of together are a basis for the space of all column vectors. As a consequence,

Thus, we can use the matrix of eigenvectors to perform a similarity transformation and obtain the block-diagonal matrix

Actually, in the lecture on matrix diagonalization, we have proved that is a diagonal matrix having the eigenvalues on its main diagonal.

Solved exercises

Below you can find some exercises with explained solutions.

Exercise 1

Define the matrix [eq102]

Verify that [eq103] is an invariant subspace under the linear transformation defined by .

Solution

Any vector takes the form [eq104] where is a scalar. Then, [eq105] As a consequence, and is an invariant subspace.

Exercise 2

Let be the space of column vectors. Define the matrix [eq106]

By simply inspecting , can you find two subspaces $U_{1}$ and $U_{2}$ such thatand $U_{1}$ and $U_{2}$ are invariant under the linear transformation defined by ?

Solution

Note that is a block-diagonal matrix: [eq108] where [eq109] Therefore, two complementary invariant subspaces are [eq110]

How to cite

Please cite as:

Taboga, Marco (2021). "Invariant subspace", Lectures on matrix algebra. https://www.statlect.com/matrix-algebra/invariant-subspace.