Linear regression - Hypothesis testing

This lecture discusses how to perform tests of hypotheses about the coefficients of a linear regression model estimated by ordinary least squares (OLS).

Table of contents

Normal vs non-normal model
The linear regression model
Matrix notation
Tests of hypothesis in the normal linear regression model
Tests of hypothesis when the OLS estimator is asymptotically normal
1. Test of a restriction on a single coefficient (z test)
2. Test of a set of linear restrictions (Chi-square test)
Learn more about regression analysis

Normal vs non-normal model

The lecture is divided in two parts:

in the first part, we discuss hypothesis testing in the normal linear regression model, in which the OLS estimator of the coefficients has a normal distribution conditional on the matrix of regressors;
in the second part, we show how to carry out hypothesis tests in linear regression analyses where the hypothesis of normality holds only in large samples (i.e., the OLS estimator can be proved to be asymptotically normal).

How to choose which test to carry out after estimating a linear regression model.

The linear regression model

The regression model is [eq1] where:

$y_{i}$ is an output variable;
$x_{i}$ is a vector of inputs;
is a vector of coefficients;
$arepsilon _{i}$ is an error term.

There are observations in the sample, so that .

Matrix notation

We also denote:

by the vector of outputs
by the matrix of inputs
by the vector of errors

Using this notation, we can write

Moreover, the OLS estimator of is

We assume that the design matrix has full-rank, so that the matrix $X^{ op }X$ is invertible.

Tests of hypothesis in the normal linear regression model

We now explain how to derive tests about the coefficients of the normal linear regression model.

The vector of errors is assumed to have a multivariate normal distribution conditional on , with mean equal to and covariance matrix equal towhere is the identity matrix and is a positive constant.

It can be proved (see the lecture about the normal linear regression model) that the assumption of conditional normality implies that:

the OLS estimator is conditionally multivariate normal with mean and covariance matrix ;
the adjusted sample variance of the residualsis an unbiased estimator of ; furthermore, it has a Gamma distribution with parameters and ;
is conditionally independent of .

Test of a restriction on a single coefficient (t test)

In a test of a restriction on a single coefficient, we test the null hypothesiswhere $eta _{k}$ is the -th entry of the vector of coefficients and .

In other words, our null hypothesis is that the -th coefficient is equal to a specific value.

This hypothesis is tested with the test statistic [eq12] where $S_{kk}$ is the -th diagonal entry of the matrix .

The test statistic has a standard Student's t distribution with degrees of freedom. For this reason, it is called a t statistic and the test is called a t test.

Proof

Under the null hypothesis has a normal distribution with mean and variance $sigma ^{2}S_{kk}$ . As a consequence, the ratio [eq15] has a standard normal distribution (mean and variance ). We can write [eq16] Since has a Gamma distribution with parameters and , the ratio [eq18] has a Gamma distribution with parameters and . It is also independent of because is independent of . Therefore, the ratio [eq21] has a standard Student's t distribution with degrees of freedom (see the lecture on the Student's t distribution).

The null hypothesis is rejected if falls outside the acceptance region.

How the acceptance region is determined depends not only on the desired size of the test, but also on whether the test is:

two-tailed ( $eta _{k}$ could be smaller or larger than ; we do not exclude either of the two possibilities)
one-tailed (only one of the two things, i.e., either smaller or larger, is possible).

For more details on how to determine the acceptance region, see the glossary entry on critical values.

Test of a set of linear restrictions (F test)

When testing a set of linear restrictions, we test the null hypothesiswhere is a matrix and is a vector. is the number of restrictions.

Example Suppose that is and that we want to test the hypothesis . We can write it in the form by setting [eq24]

Example Suppose that is and that we want to test whether the two restrictions and $eta _{3}=0$ hold simultaneously. The first restriction can be written asSo we have [eq27]

To test the null hypothesis, we use the test statistic [eq28] which has an F distribution with and degrees of freedom. For this reason, it is called an F statistic and the test is called an F test.

Proof

Under the null and conditional on , the vector , being a linear transformation of the normal random vector , has a multivariate normal distribution with mean [eq29] and covariance matrix [eq30] Thus, we can write [eq31] Since is multivariate normal, the quadratic form has a Chi-square distribution with degrees of freedom (see the lecture on quadratic forms involving normal vectors). Furthermore, since has a Gamma distribution with parameters and the statistic [eq33] has a Chi-square distribution with degrees of freedom (see the lecture on the Gamma distribution). Thus, we can write [eq34] Thus is a ratio between two Chi-square variables, each divided by its degrees of freedom. The two variables are independent because depends only on and depends only on , and and are independent. As a consequence, has an F distribution with and degrees of freedom (see the lecture on the F distribution).

The F test is one-tailed.

A critical value in the right tail of the F distribution is chosen so as to achieve the desired size of the test.

Then, the null hypothesis is rejected if the F statistics is larger than the critical value.

When you use a statistical package to run a linear regression, you often get a regression output that includes the value of an F statistic. Usually this is obtained by performing an F test of the null hypothesis that all the regression coefficients are equal to (except the coefficient on the intercept).

Tests based on maximum likelihood procedures (Wald, Lagrange multiplier, likelihood ratio)

As we explained in the lecture on maximum likelihood estimation of regression models, the maximum likelihood estimator of the vector of coefficients of a normal linear regression model is equal to the OLS estimator .

As a consequence, all the usual tests based on maximum likelihood procedures (e.g., Wald, Lagrange multiplier, likelihood ratio) can be employed to conduct tests of hypothesis about .

Tests of hypothesis when the OLS estimator is asymptotically normal

In this section we explain how to perform hypothesis tests about the coefficients of a linear regression model when the OLS estimator is asymptotically normal.

As we have shown in the lecture on the properties of the OLS estimator, in several cases (i.e., under different sets of assumptions) it can be proved that:

the OLS estimator is asymptotically normal, that is,where denotes convergence in distribution (as the sample size tends to infinity), and is a multivariate normal random vector with mean and covariance matrix ; the value of the matrix depends on the set of assumptions made about the regression model;
it is possible to derive a consistent estimator of , that is,where denotes convergence in probability (again as tends to infinity). The estimator is an easily computable function of the observed inputs $x_{i ext{ }}$ and outputs $y_{i}$ .

These two properties are used to derive the asymptotic distribution of the test statistics used in hypothesis testing.

Test of a restriction on a single coefficient (z test)

In a z test the null hypothesis is a restriction on a single coefficient:where $eta _{k}$ is the -th entry of the vector of coefficients and .

The test statistic is [eq42] where $widehat{V}_{kk}$ is the -th diagonal entry of the estimator of the asymptotic covariance matrix.

The test statistic $z_{N}$ converges in distribution to a standard normal distribution as the sample size increases. For this reason, it is called a z statistic (because the letter z is often used to denote a standard normal distribution) and the test is called a z test.

Proof

We can write the z statistic as [eq43] By assumption, the numerator converges in distribution to a normal random variable with mean and variance $V_{kk}$ . The estimated variance $widehat{V}_{kk}$ converges in probability to $V_{kk}$ , so that, by the Continuous Mapping theorem, the denominator converges in probability to $sqrt{V_{kk}}$ . Thus, by Slutsky's theorem, we have that $z_{N}$ converges in distribution to the random variable [eq46] which is normal with mean [eq47] and variance [eq48] Therefore, the test statistic $z_{N}$ converges in distribution to , which is a standard normal random variable.

When is large, we approximate the actual distribution of $z_{N}$ with its asymptotic one (standard normal).

We then employ the test statistic $z_{N}$ in the usual manner: based on the desired size of the test and on the distribution of $z_{N}$ , we determine the critical value(s) and the acceptance region.

The test can be either one-tailed or two-tailed. The same comments made for the t-test apply here.

The null hypothesis is rejected if $z_{N}$ falls outside the acceptance region.

Test of a set of linear restrictions (Chi-square test)

In a Chi-square test, the null hypothesis is a set of linear restrictionswhere is a matrix and is a vector.

The test statistic is [eq50] which converges to a Chi-square distribution with degrees of freedom. For this reason, it is called a Chi-square statistic and the test is called a Chi-square test.

Proof

We can write the test statistic as [eq51] By the assumptions on the convergence of and , and by the Continuous Mapping theorem, we have that [eq52] By Slutsky's theorem, we haveBut is multivariate normal with meanand varianceThus,but, by standard results on normal quadratic forms, the quadratic form on the right hand side has a Chi-square distribution with degrees of freedom ( is the dimension of the vector )

When setting up the test, the actual distribution of $chi _{N}^{2}$ is approximated by the asymptotic one (Chi-square).

Like the F test, also the Chi-square test is usually one-tailed.

The desired size of the test is achieved by appropriately choosing a critical value in the right tail of the Chi-square distribution.

The null is rejected if the Chi-square statistics is larger than the critical value.

Learn more about regression analysis

Want to learn more about regression analysis? Here are some suggestions:

How to cite

Please cite as:

Taboga, Marco (2021). "Linear regression - Hypothesis testing", Lectures on probability theory and mathematical statistics. Kindle Direct Publishing. Online appendix. https://www.statlect.com/fundamentals-of-statistics/linear-regression-hypothesis-testing.