Search for probability and statistics terms on Statlect
StatLect

Hetroskedasticity-robust standard errors

by , PhD

In linear regression analysis, an estimator of the asymptotic covariance matrix of the OLS estimator is said to be heteroskedasticity-robust if it converges asymptotically to the true value even when the variance of the errors of the regression is not constant.

In this case, also the standard errors, which are equal to the square roots of the diagonal entries of the covariance matrix, are said to be heteroskedasticity-robust.

Table of Contents

The linear regression

Consider the linear regression model[eq1]where:

Sample

There are $N,$ observations in the sample:[eq2]

The OLS estimator

The ordinary least squares (OLS) estimator of $eta $ can be written as[eq3]

Asymptotic covariance matrix

Under appropriate conditions, the OLS estimator is asymptotically normal:[eq4]where:

Standard errors

The standard errors are the estimates of the standard deviations of the entries of $widehat{eta }$.

Denote by [eq9] an estimator of [eq6].

Then, the covariance matrix of $widehat{eta }$ is approximated by[eq11]and the standard errors are equal to the square roots of the diagonal entries of the latter matrix.

Heteroskedasticity

The errors of the regression are said to be conditionally homoskedastic if their variance is constant:[eq12]where sigma^2 is a constant.

If the conditional variance is not constant, the errors are said to be conditionally heteroskedastic, and the regression is said to be affected by heteroskedasticity.

Robustness to heteroskedasticity

An estimator [eq13] of the asymptotic covariance matrix [eq6] is heteroskedasticity-robust if it is consistent even when the errors are conditionally heteroskedastic.

Consistent means that[eq15]where [eq16] denotes convergence in probability.

Formula for the asymptotic covariance

Under mild technical conditions, the asymptotic covariance matrix is[eq17]where[eq18]is the so-called long-run covariance matrix.

Consistent estimators

The matrix [eq19] is consistently estimated by[eq20]

Therefore, by the Continuous Mapping Theorem, if we can find a consistent estimator $widehat{W}$ of $W$, then the asymptotic covariance matrix is consistently estimated by[eq21]

A simpler formula for the long-run covariance

Suppose that the following assumptions about the sequence [eq22] hold:

  1. zero mean:[eq23]

  2. no serial correlation:[eq24]if $j
eq i$;

  3. weak stationarity: [eq25] does not depend on $i.$

Then, the long-run covariance matrix can be written as[eq26]

Proof

The proof is as follows:[eq27]

Note that the zero-mean assumption is the same as the orthogonality assumption usually needed to prove the consistency of the OLS estimator.

Estimator of the long-run covariance

Under mild technical conditions, the long-run covariance matrix is consistently estimated by[eq28]where the residuals $e_{i}$ are defined as[eq29]

Proof

The sample average[eq30]converges in probability to [eq31] if the sequence [eq32] satisfies the conditions of a Law of Large Numbers (the mild technical conditions mentioned above). The errors $arepsilon _{i}$ in the last formula can be replaced by the residuals $e_{i}$, as the latter converge in probability to the former when the sample size increases (a formal proof can be found here).

The robust estimator

If we plug the formula for $widehat{W}$ in the expression we had previously derived for the estimator of the asymptotic covariance matrix, we obtain:[eq33]

This estimator is robust to heteroskedasticity.

As a matter of fact, we did not assume homoskedasticity to prove its consistency.

The square roots of the diagonal entries of the matrix[eq34]are known as heteroskedasticity-robust standard errors.

Matrix form

Using matrix notation, we can write the expression above in a more compact form.

Define the vectors and matrices

[eq35]

Then, the heteroskedasticity-robust covariance matrix is[eq36]

Non-robust estimator

Compare the formulae above with those for the non-robust estimator[eq37]where[eq38]

This estimator is non-robust to heteroskedasticity.

In fact, in order to prove its consistency, we need to assume conditional homoskedasticity [eq39]for every i with sigma^2 constant.

Proof

Under the hypothesis of homoskedasticity, we can write the long-run covariance matrix as follows:[eq40]which is consistently estimated by[eq41]The estimator of the asymptotic covariance matrix becomes:[eq42]Hence, the estimator of the covariance matrix is[eq43]

Synonyms

Heteroskedasticity-robust standard errors go by many different names:

  1. heteroskedasticity-consistent standard errors;

  2. Eicker-Huber-White standard errors;

  3. Huber-White standard errors;

  4. White standard errors.

More details

More mathematical details and proofs of the facts stated above can be found in the lecture on the properties of the OLS estimator.

Keep reading the glossary

Previous entry: Realization of a random variable

Next entry: Sample mean

How to cite

Please cite as:

Taboga, Marco (2021). "Hetroskedasticity-robust standard errors", Lectures on probability theory and mathematical statistics. Kindle Direct Publishing. Online appendix. https://www.statlect.com/glossary/heteroskedasticity-robust-standard-errors.

The books

Most of the learning materials found on this website are now available in a traditional textbook format.