Projection matrix

In linear algebra, a projection matrix is a matrix associated to a linear operator that maps vectors into their projections onto a subspace.

Table of contents

Preliminary notions
Projections
Oblique projections
Projection operator
Matrix of the projection operator
How to derive the projection matrix
Complementary projector
A matrix is idempotent iff it is a projection matrix
Solved exercises

Exercise 1

Preliminary notions

Let us start by reviewing some notions that are essential for understanding projections.

Let be a linear space. Let $S_{1}$ and $S_{2}$ be subspaces of .

Remember that the sum $S_{1}+S_{2}$ is the set

When $S_{1}$ and $S_{2}$ have only the zero vector in common (i.e., ), then the sum is called a direct sum and it is denoted by $S_{1}oplus S_{2}$ .

Moreover, when the direct sum is equal to the whole space, that is,we say that the two spaces are complementary.

As we have proved, when $S_{1}$ and $S_{2}$ are complementary, any vector belonging to can be uniquely written aswhere $s_{1}in S_{1}$ and $s_{2}in S_{2}$ .

Projections

After revising all these notions, we are ready to define projections.

Definition Let be a linear space. Let $S_{1}$ and $S_{2}$ be complementary subspaces (i.e., ). Let with its unique decompositionin which $s_{1}in S_{1}$ and $s_{2}in S_{2}$ . Then, the vector $s_{1}$ is called the projection of onto $S_{1}$ along $S_{2}$ , and the vector $s_{2}$ is called the projection of onto $S_{2}$ along $S_{1}$ .

We note that the locutions "along $S_{1}$ " and "along $S_{2}$ " are needed because the complement of a given subspace is not necessarily unique. For example, there may be another subspace $S_{3}$ that is complementary to $S_{1}$ . As a consequence, when we project a vector onto $S_{1}$ , we need to specify whether we are considering $S_{2}$ or $S_{3}$ as a complement of $S_{1}$ .

Example Let be the space of all real vectors. Let $S_{1}$ be the space spanned by [eq7] which contains all the scalar multiples of $b_{1}$ . Let $S_{2}$ be the space spanned by the two vectors [eq8] which contains all the linear combinations of $c_{1}$ and $c_{2}$ . We have that because no non-zero vector of $S_{1}$ can be written as a linear combination of $c_{1}$ and $c_{2}$ . Therefore, . Now, consider the vector [eq11] We have thatThus, the unique decomposition of iswhere $s_{1}=2b_{1}$ and $s_{2}=2c_{1}+c_{2}$ . The projection of onto $S_{1}$ along $S_{2}$ is [eq14] and the projection of onto $S_{2}$ along $S_{1}$ is [eq15]

Oblique projections

Projections as defined above are also sometimes called oblique projections in order to distinguish them from orthogonal projections, which are a particular kind of projection in which the two complementary subspaces $S_{1}$ and $S_{2}$ are orthogonal complements.

Projection operator

We now define projection operators.

Definition Let be a linear space and $S_{1}$ and $S_{2}$ two subspaces such that . The function that associates to each its projection onto $S_{1}$ along $S_{2}$ is called the projection operator onto $S_{1}$ along $S_{2}$ .

The first important property of the projection operator is that it is a linear operator, that is, it preserves addition and multiplication by scalars.

Proposition The projection operator onto $S_{1}$ along $S_{2}$ is a linear operator.

Proof

Arbitrarily choose two vectors . They have the unique decompositions [eq18] where and . Denote by $P_{S_{1},S_{2}}$ the projection operator onto $S_{1}$ along $S_{2}$ . Then, [eq21] Take any two scalars and and consider the linear combinationThen,Therefore,Since , , and were arbitrary, the latter equality implies that the projection operator is linear.

A couple of observations are in order:

the range of $P_{S_{1},S_{2}}$ (i.e., the subset of the codomain formed by all the values actually taken by $P_{S_{1},S_{2}}$ ) is $S_{1}$ ;
the kernel of $P_{S_{1},S_{2}}$ (i.e., the subset of the domain formed by all the vectors that are mapped to the zero vector) is $S_{2}$ .

Matrix of the projection operator

Let be a basis for . Any vector can be represented by its coordinate vector with respect to , denoted by . If can be written as a linear combination of the basis asthen [eq28]

Moreover, any linear operator can be represented by a square matrix, called matrix of the operator with respect to and denoted by , such that

In the case of a projection operator $P_{S_{1},S_{2}}$ , this implies that there is a square matrix that, once post-multiplied by the coordinates of a vector , gives the coordinates of the projection of onto $S_{1}$ along $S_{2}$ . Such a matrix is called a projection matrix (or a projector).

Definition The matrix of a projection operator with respect to a given basis is called a projection matrix.

How to derive the projection matrix

Now that we know what a projection matrix is, we can learn how to derive it.

In the lecture on complementary subspaces we have shown that, if is a basis for $S_{1}$ , is a basis for $S_{2}$ , and then is a basis for .

For the sake of legibility, denote the projection $P_{S_{1},S_{2}}$ simply by in what follows.

Note that projects:

vectors of into themselves (because their component from $S_{2}$ is equal to zero);
vectors of into the zero vector (because their component from $S_{1}$ is zero).

By applying the general rule for deriving the matrix of a linear operator, we obtain that [eq36] where is the identity matrix and the other blocks are zero matrices (in particular, the diagonal one is ).

In step we have used the fact that the coordinate vector of $b_{k}$ with respect to the basis (to which $b_{k}$ itself belongs, occupying the -th position) is a vector that has a single entry equal to (the -th) and all the other entries equal to .

Thus, the projection has an extremely simple structure: when we use it to project a vector on $S_{1}$ , we leave the coordinates corresponding to the basis of $S_{1}$ unchanged and we set all the other coordinates to zero.

However, in most cases we are not so lucky as to have coordinates already expressed with respect to . In such cases, we need to perform a change of basis (please revise how it works here).

Suppose that the basis used to express coordinates is . Then, the change-of-basis from to is [eq37]

The matrix of the projection operator with respect to the basis is

Example As in the previous example, we consider the space of all real vectors. Coordinates are naturally expressed with respect to the canonical basis where [eq40] The basis of $S_{1}$ is where [eq42] and the basis of $S_{2}$ is where [eq44] We have already argued that . As before, denote $P_{S_{1},S_{2}}$ (the projection operator onto $S_{1}$ along $S_{2}$ ) simply by . First of all, we have that [eq46] because the projection operator preserves the first coordinate and annihilates the other two (when coordinates are expressed with respect to ). The change-of-basis matrix that fits our purposes is [eq48] Its inverse is [eq49] The projection matrix under the canonical basis is [eq50] Let us compute the projection onto $S_{1}$ of the vector [eq11] We have done it already in the previous exercise, but this time we can use the projection matrix: [eq52] which is the same result we have derived previously.

Complementary projector

Once we have derived the projection matrix that allows us to project vectors onto $S_{1}$ , it is very easy to derive the matrix that allows us to project vectors onto the complementary subspace $S_{2}$ .

If a vector is decomposed as then we can write the projection onto $S_{2}$ asand its coordinates as [eq57]

Thus, the matrix of the projection operator onto $S_{2}$ , sometimes called complementary projector, is [eq58]

In the derivation above we have also seen that

Thus, we have that

A matrix is idempotent iff it is a projection matrix

A square matrix is said to be idempotent if and only if it is equal to its square:

It turns out that idempotent matrices and projection matrices are the same thing!

Proposition A matrix is idempotent if and only if it is a projection matrix.

Proof

Let us prove the "if part". We start from the hypothesis that is a projection matrix. As such, it is the matrix of a projection operator with respect to some basis , that is, . As proved above, we have that [eq63] Therefore, [eq64] which proves that is idempotent. Let us now prove the "only if" part, starting from the hypothesis that is idempotent. Suppose that is . Let be the space of all vectors. Define the two subspacesandIn other words, $S_{1}$ and $S_{2}$ are the range and kernel of the operator defined by the matrix . We have thatsince any vector can be written aswhere: $Asin S_{1}$ and becauseSuppose that a vector belongs to both $S_{1}$ and $S_{2}$ . Since $tin S_{1}$ , there exists such thatWe can pre-multiply both sides by and obtainSince $tin S_{2}$ , we have that . As a consequence, by equation (3), and by equation (2). Therefore,Hence,From equation (1) we know that projects into its component $Asin S_{1}$ . Therefore, it is the matrix of the projection operator that projects vectors of into $S_{1}$ along $S_{2}$ . Hence, is a projection matrix.

Solved exercises

Below you can find some exercises with explained solutions.

Exercise 1

Consider the projection problem analyzed in the previous two examples, where we have already derived the projection matrix of the projection operator onto $S_{1}$ . Derive the complementary projection matrix (onto $S_{2}$ ) and use it to find the projection onto $S_{2}$ of the vector [eq75]

Solution

Define . We have that [eq77] The projection of onto $S_{2}$ is [eq78]

How to cite

Please cite as:

Taboga, Marco (2021). "Projection matrix", Lectures on matrix algebra. https://www.statlect.com/matrix-algebra/projection-matrix.