What is heteroskedasticity and why does it matter?

Heteroskedasticity means the variance of the regression error changes across observations rather than being constant. OLS remains unbiased, but its usual standard errors are biased, so t-tests and confidence intervals are no longer valid. The fix is to use heteroskedasticity-robust (White) standard errors.

What is the White test for heteroskedasticity?

The White test regresses the squared OLS residuals on the regressors, their squares and cross-products. Under the null of homoskedasticity, the sample size times the R-squared of that auxiliary regression has a chi-squared distribution, with degrees of freedom equal to the number of slope terms. A large value rejects homoskedasticity.

When should you use Newey-West standard errors?

Use Newey-West (HAC) standard errors when the errors may be serially correlated, which is common in time-series regressions. They are robust to both serial correlation and heteroskedasticity. If there is no serial correlation, White standard errors are preferable because they are more precise.

Diagnostic Testing in the Linear Regression Model

When the homoskedasticity or no-serial-correlation assumptions fail, OLS stays unbiased but its usual standard errors are wrong. Diagnostic tests such as the White test and the LM test detect these problems, and robust (White or Newey-West) standard errors restore valid inference.

How to read these notes

These notes follow the lecture on diagnostic testing and robust inference. They assume you already know the basics of OLS and hypothesis testing. The focus is on what goes wrong when two of the classical assumptions break, how to test for the breakage, and how to fix the standard errors so that t-tests remain valid.

1. Which assumptions are at stake

The large-sample analysis of OLS rests on a set of assumptions. Two of them concern the structure of the error variance:

The two assumptions under test

LS3 — Homoskedasticity: \(E[u_t^2\,|\,X]=\sigma_0^2\). The error variance is constant.
LS4 — No serial correlation: \(E[u_t u_s\,|\,X]=0\) for all \(t e s\). Errors at different times are uncorrelated.

The key fact to fix in your mind is that violations of LS3 and LS4 do not bias OLS. Unbiasedness comes from linearity (LS1) and exogeneity, \(E[u\,|\,X]=0\) (LS2). Since \(\hat\beta=\beta_0+(X'X)^{-1}X'u\) and the exogeneity assumption makes the second term mean-zero, OLS is unbiased regardless of the error variance structure. What LS3 and LS4 affect is the variance of OLS, and therefore the validity of standard errors, t-tests and confidence intervals.

2. Heteroskedasticity: the variance is not constant

Heteroskedasticity means the conditional variance of the error changes across observations: \(E[u_t^2\,|\,X]=\sigma_t^2(x)\). A simple example in the notes is \(u_t=v_t x_{1t}\) with \(E[v_t^2\,|\,X]=\sigma^2\), which gives \(E[u_t^2\,|\,X]=\sigma^2 x_{1t}^2\) — the error variance grows with a regressor, producing a tell-tale U-shaped pattern when residuals are plotted. A classic empirical case is a regression of credit-card spending on income, where the spread of the residuals widens as income rises.

The damage shows up in the asymptotic variance of OLS, which in general is the "sandwich" form

√T(β̂ − β₀) →^d N( 0, Q⁻¹ΩQ⁻¹ )

where \(Q=E[x_tx_t']\) and \(\Omega=\lim \mathrm{Var}(T^{-1/2}\sum_t x_t u_t)\). Under homoskedasticity \(\Omega=\sigma_0^2 Q\), so the sandwich collapses to the familiar \(\sigma_0^2 Q^{-1}\) and the usual OLS standard errors are correct. Under heteroskedasticity \(\Omega e\sigma_0^2 Q\), the usual formula is biased, and the t-statistic is no longer asymptotically standard normal.

The fix: White (robust) standard errors

White (1980) showed that \(\Omega\) can be estimated directly from the squared residuals, \(\hat\Omega^{W}=T^{-1}\sum_t \hat u_t^2 x_t x_t'\). Plugging this into the sandwich gives the heteroskedasticity-robust variance estimator \(\hat Q^{-1}\hat\Omega^{W}\hat Q^{-1}\), and the resulting t-statistics are asymptotically standard normal whether or not heteroskedasticity is present. Most software produces these with one option.

3. Testing for heteroskedasticity: the White test

To decide whether robust standard errors are needed, we test the null of homoskedasticity directly. The White test works as follows:

Run the original regression by OLS and obtain the residuals \(\hat u_t\).
Regress the squared residuals \(\hat u_t^2\) on the regressors, their squares and their cross-products. For example, with \(x_t=(1,x_{1t})'\), regress \(\hat u_t^2\) on \(x_{1t}\) and \(x_{1t}^2\).
Under the null of homoskedasticity, \(T R^2\) from this auxiliary regression has a \(\chi^2_q\) distribution, where \(q\) is the number of slope coefficients in the auxiliary regression.
Reject homoskedasticity if \(T R^2\) exceeds the critical value.

The intuition is that if the error variance is genuinely constant, nothing about the regressors should help predict the squared residuals, so the auxiliary \(R^2\) should be near zero. A large \(R^2\) indicates the variance depends on the regressors.

4. Weighted least squares: regaining efficiency

Robust standard errors fix inference but leave OLS inefficient — under heteroskedasticity OLS is no longer the best linear unbiased estimator. If we actually know the form of the heteroskedasticity, we can do better with weighted least squares (WLS).

Suppose we know \(\sigma_t(x)\), for example \(\sigma_t=\sigma x_{1t}\) (quadratic heteroskedasticity). Dividing the whole regression through by \(x_{1t}\),

y_t/x_1t = (x_t/x_1t)′β₀ + u_t/x_1t

produces a transformed model whose error \(u_t/x_{1t}\) has constant variance \(\sigma^2\). Running OLS on the transformed variables is WLS, and it is efficient and unbiased. The idea is that WLS gives more weight to the low-variance observations, which carry more reliable information, instead of weighting every point equally as OLS does. When the variance is known only up to parameters, such as \(\sigma_t^2(x)=\exp(\alpha_0'x_t)\), those parameters can be estimated (often by maximum likelihood) and used to construct feasible weights.

5. Serial correlation: errors that move together over time

In time-series regressions the no-serial-correlation assumption LS4 often fails: the error in one period is correlated with the error in the next, so the variance matrix \(E[uu'\,|\,X]\) has non-zero off-diagonal terms. Like heteroskedasticity, serial correlation does not bias OLS but it does bias the usual standard errors, so t-tests are invalid without a robust correction.

A natural idea would be to estimate the off-diagonal terms with products of residuals, but this fails: the cross-product \(\sum_s\sum_t \hat u_s \hat u_t x_s x_t'\) collapses because \(X'\hat u=0\) by construction of OLS. The solution is the Newey-West estimator, which adds weighted residual cross-products at increasing lags:

Ω̂^NW = Ω̂^W + T⁻¹ Σ_h w_h ( cross-products at lag h )

The fix: Newey-West (HAC) standard errors

Newey-West standard errors are robust to both serial correlation and heteroskedasticity (they are "HAC": heteroskedasticity- and autocorrelation-consistent). The weights \(w_h\) — typically Bartlett weights \(w_h=1-h/B\) — decline with the lag distance, and the bandwidth \(B\) must grow slowly with the sample size for consistency.

6. Testing for serial correlation: the LM test

To test whether the error is serially correlated, we use a Lagrange Multiplier (LM) test, mirroring the structure of the White test:

Run the original regression by OLS and obtain the residuals \(\hat u_t\).
Run the auxiliary regression of \(\hat u_t\) on the original regressors and lagged residuals \(\hat u_{t-1},\dots,\hat u_{t-q}\).
Under the null of no serial correlation, \(T R^2\) from this regression has a \(\chi^2_q\) distribution, with \(q\) equal to the number of lagged residual terms.

Including the original regressors in the auxiliary regression matters: it ensures the test is valid even if the regressors are correlated with the lagged errors.

7. Which standard errors should you use?

The choice is a precision-versus-robustness trade-off. The most robust option is not always the best, because robustness costs precision.

Situation	Preferred standard errors	Why
No heteroskedasticity, no serial correlation	Ordinary OLS	Most precise; the classical formula is correct.
Heteroskedasticity, no serial correlation	White	Robust to heteroskedasticity and more precise than Newey-West.
Serial correlation (with or without heteroskedasticity)	Newey-West	The only option that is consistent under serial correlation.

The practical rule that comes out of the tutorials: do not reach for Newey-West standard errors when there is no evidence of serial correlation. If the LM test finds no serial correlation, White standard errors are preferable because they are more precise. Always test before you choose.

Econometrics tuition

Diagnostic testing and robust inference are core exam topics and constant sources of confusion in empirical projects. For 1-1 help with heteroskedasticity, serial correlation, White/Newey-West standard errors or WLS, see econometrics tuition or statistics tuition.

Free videos: the @economaths channel has worked videos on heteroskedasticity, GLS, autocorrelation and Newey-West standard errors.

Diagnostic testing in the linear regression model

How to read these notes

1. Which assumptions are at stake

2. Heteroskedasticity: the variance is not constant

3. Testing for heteroskedasticity: the White test

4. Weighted least squares: regaining efficiency

5. Serial correlation: errors that move together over time

6. Testing for serial correlation: the LM test

7. Which standard errors should you use?

Need help with this topic?

Diagnostic testing in the linear regression model

How to read these notes

1. Which assumptions are at stake

2. Heteroskedasticity: the variance is not constant

3. Testing for heteroskedasticity: the White test

4. Weighted least squares: regaining efficiency

5. Serial correlation: errors that move together over time

6. Testing for serial correlation: the LM test

7. Which standard errors should you use?

Related study notes

Need help with this topic?