Index > Matrix algebra

Orthogonal projection

by Marco Taboga, PhD

The orthogonal projection of a vector onto a given subspace is the vector that is closest to .

Table of contents

Essential concepts
Definition
The orthogonal projection minimizes the distance
Projection matrix
Orthonormal projection

Essential concepts

Before explaining orthogonal projections, we are going to revise some important concepts.

Let be a vector space. Remember that two vectors and belonging to are orthogonal when their inner product is zero:

Let be a subspace of . The orthogonal complement of , denoted by $R^{ot }$ , is the unique subspace satisfying

The two subspaces and $R^{ot }$ are complementary subspaces, which means thatwhere denotes a direct sum. By the properties of direct sums, any vector can be uniquely written aswhere and $tin R^{ot }$ .

Definition

We can now define orthogonal projections.

Definition Let be a linear space. Let be a subspace of and $R^{ot }$ its orthogonal complement. Let with its unique decompositionin which and $tin R^{ot }$ . Then, the vector is called the orthogonal projection of onto and it is denoted by .

Thus, the orthogonal projection is a special case of the so-called oblique projection, which is defined as above, but without the requirement that the complementary subspace of be an orthogonal complement.

Example Let be the space of column vectors. Define [eq7] Its orthogonal complement is [eq8] as we can easily verify by checking that the vector spanning $R^{ot }$ is orthogonal to the two vectors spanning . Now, consider the vector [eq9] Then, [eq10]

The orthogonal projection minimizes the distance

The distance between two vectors is measured by the norm of their difference.

It turns out that is the vector of that is closest to .

Proposition Let be a finite-dimensional vector space. Let be a subspace of . Then, for any .

Proof

Sincewhere $tin R^{ot }$ , the vector belongs to $R^{ot }$ and, as a consequence, is orthogonal to any vector belonging to , including the vector . Therefore, [eq16] where in step we have used Pythagoras' theorem. By taking the square root of both sides, we obtain the stated result.

Projection matrix

Suppose that is the space of complex vectors and is a subspace of .

By the results demonstrated in the lecture on projection matrices (that are valid for oblique projections and, hence, for the special case of orthogonal projections), there exists a projection matrix $P_{R}$ such thatfor any .

The projection matrix is [eq18] where:

$B_{R}$ is any matrix whose columns form a basis for ;
$B_{R^{ot }}$ is any matrix whose columns form a basis for $R^{ot }$ .

In the case of orthogonal projections, the formula above becomes simpler.

Proposition Let be the space of complex vectors. Let be a subspace of . Let $B_{R}$ be a matrix whose columns form a basis for . Denote by $B_{R}^{st }$ the conjugate transpose of $B_{R}$ . Then, the matrix is the projection matrix such that for any .

Proof

We choose the columns of $B_{R^{ot }}$ in such a way that they form an orthonormal basis for $R^{ot }$ . As a consequence, as explained in the lecture on unitary matrices (see the section on non-square matrices with orthonormal columns), we havewhere denotes the conjugate transpose of $B_{R^{ot }}$ . Moreover, since the columns of $B_{R}$ are orthogonal to the columns of $B_{R^{ot }}$ , we haveandThe columns of $B_{R}$ are linearly independent since they form a basis. Hence, $B_{R}x eq 0$ for any , which implies that for any . Thus, $B_{R}^{st }B_{R}$ is full-rank (hence invertible). We use these results to derive the following equality: [eq26] which implies, by the definition of inverse matrix, that [eq27] Thus, [eq28]

When we confine our attention to real vectors, conjugate transposition becomes simple transposition and the formula for the projection matrix becomeswhich might be familiar to those of us that have previously dealt with linear regressions and the OLS estimator.

Orthonormal projection

When the columns of the matrix $B_{R}$ are orthonormal, we have a further simplification: and

Denote by the columns of $B_{R}$ .

Then, for any , we have [eq33] which is the formula for projections on orthonormal sets that we have already encountered in the lectures on the Gram-Schmidt process and on the QR decomposition.

How to cite

Please cite as:

Taboga, Marco (2021). "Orthogonal projection", Lectures on matrix algebra. https://www.statlect.com/matrix-algebra/orthogonal-projection.