Mean-square convergence

This lecture discusses mean-square convergence, first for sequences of random variables and then for sequences of random vectors.

The formula that defines convergence in mean-square and its main elements.

Table of contents

The intuition
How to measure similarity
How to define convergence
Square integrability
Definition for sequences of random variables
Example
How to generalize the definition to the multivariate case
Definition for sequences of random vectors
Relation between multivariate and univariate convergence
Solved exercises

The intuition

As explained previously, different definitions of convergence are based on different ways of measuring how similar to each other two random variables are.

The definition of mean-square convergence is based on the following intuition: two random variables are similar to each other if the square of their difference is small on average.

How to measure similarity

Remember that a random variable is a mapping from a sample space (e.g., the set of possible outcomes of a coin-flipping experiment) to the set of real numbers (e.g., the winnings from betting on tails).

Let and be random variables defined on the same sample space .

For a fixed sample point , the squared difference between the two realizations of and provides a measure of how different those two realizations are.

The mean squared difference quantifies how different the two realizations are on average (as varies).

It is a measure of the "distance" between the two variables. In technical terms, it is called a metric.

How to define convergence

Intuitively, if a sequence converges to , the mean squared difference should become smaller and smaller by increasing .

In other words, the sequence of real numbersshould converge to zero.

Requiring that a sequence of distances tends to zero is a standard criterion for convergence in a metric space.

Square integrability

This kind of convergence analysis can be carried out only if the expected values of $X^{2}$ and $X_{n}^{2}$ are well-defined and finite.

In technical terms, we say that and are required to be square integrable.

Definition for sequences of random variables

The considerations above lead us to define mean-square convergence as follows.

Definition Let be a sequence of square integrable random variables defined on a sample space . We say that is mean-square convergent (or convergent in mean-square) if and only if there exists a square integrable random variable such that

The variable is called the mean-square limit of the sequence and convergence is indicated byor by

The notation indicates that convergence is in the Lp space $L^{2}$ (the space of square integrable functions).

Example

The following example illustrates the concept of mean-square convergence.

Let be a covariance stationary sequence of random variables such that all the random variables in the sequence have:

the same expected value ;
the same variance ;
zero covariance with each other.

Define the sample mean as follows: [eq12] and define a constant random variable .

The distance between a generic term of the sequence and is

But is equal to the expected value of because [eq15] Therefore, [eq16] by the very definition of variance.

In turn, the variance of is [eq17]

Thus, [eq18] and

But this is just the definition of mean square convergence of to .

Therefore, the sequence converges in mean-square to the constant random variable .

How to generalize the definition to the multivariate case

The above notion of convergence generalizes to sequences of random vectors in a straightforward manner.

Let be a sequence of random vectors defined on a sample space , where each random vector has dimension .

The sequence of random vectors is said to converge to a random vector in mean-square if converges to according to the metric [eq24] where is the Euclidean norm of the difference between and and the second subscript is used to indicate the individual components of the vectors and .

The distance is well-defined only if the expected value on the right-hand side exists. A sufficient condition for its existence is that all the components of and be square integrable random variables.

Intuitively, for a fixed sample point , the square of the Euclidean norm provides a measure of the distance between two realizations of and .

The mean provides a measure of how different those two realizations are on average (as varies).

If the distance becomes smaller and smaller by increasing , then the sequence of random vectors converges to the vector .

Definition for sequences of random vectors

The following definition formalizes what we have just said.

Definition Let be a sequence of random vectors defined on a sample space , whose entries are square integrable random variables. We say that is mean-square convergent if and only if there exists a random vector with square integrable entries such that

Again, is called the mean-square limit of the sequence and convergence is indicated byor by

Relation between multivariate and univariate convergence

A sequence of random vectors is convergent in mean-square if and only if all the sequences of entries of the random vectors are.

Proposition Let be a sequence of random vectors defined on a sample space , such that their entries are square integrable random variables. Denote by the sequence of random variables obtained by taking the -th entry of each random vector . The sequence converges in mean-square to the random vector if and only if converges in mean-square to the random variable $X_{ullet ,i}$ (the -th entry of ) for each .

Solved exercises

Below you can find some exercises with explained solutions.

Exercise 1

Let be a random variable having a uniform distribution on the interval .

In other words, is a continuous random variable with supportand probability density function [eq40]

Consider a sequence of random variables whose generic term iswhere is the indicator function of the event .

Find the mean-square limit (if it exists) of the sequence .

Solution

When tends to infinity, the interval becomes similar to the interval becauseTherefore, we conjecture that the indicators converge in mean-square to the indicator . But is always equal to , so our conjecture is that the sequence converges in mean square to . To verify our conjecture, we need to verify thatThe expected value can be computed as follows. [eq53] Thus, the sequence converges in mean-square to because

Exercise 2

Let be a sequence of discrete random variables.

Let the probability mass function of a generic term of the sequence be [eq57]

Find the mean-square limit (if it exists) of the sequence .

Solution

Note thatTherefore, one would expect that the sequence converges to the constant random variable . However, the sequence does not converge in mean-square to . The distance of a generic term of the sequence from is [eq62] Thus,while, if was convergent, we would have

Exercise 3

Does the sequence in the previous exercise converge in probability?

Solution

The sequence converges in probability to the constant random variable because, for any , we have that [eq67]

How to cite

Please cite as:

Taboga, Marco (2021). "Mean-square convergence", Lectures on probability theory and mathematical statistics. Kindle Direct Publishing. Online appendix. https://www.statlect.com/asymptotic-theory/mean-square-convergence.