Session CLRM Review 4
Session CLRM Review 4
Sisir Debnath
Indian Institute of Technology Delhi
September 3, 2021
Introduction
A sampling experiment:
Draw 30 observations at random from the population. Compute the regression. Repeat
100 times. Display estimates.
Stata Program:
set seed 2038947
mat M = J(100,1,.)
forvalues i = 1(1)100{
insheet using healthcare.csv, comma clear
sample 30
reg hhninc educ
matrix M[‘i’,1]= b[educ]
}
svmat M, names(b educ)
save b educ, replace
#delimit;
twoway (hist b educ, xline(.019963, lc(red) lw(thick)) bin(40))
(kdensity b edu, xtitle(‘‘Coefficient on Education")
ytitle(‘‘Frequency")
legend(label(1 ‘‘Density") label(2
‘‘Distribution of b")));
#delimit cr
1000
Frequency
500 0
Density Distribution of b
Endowment of the stochastic properties of the model upon the least squares
estimator. The estimator is a function of the observed (realized) data.
−1
b = X′X X′y
−1
= X′X X ′ (Xβ + ϵ)
−1 ′
= β + X′X X ϵ
|{z} | {z }
The true parameter Sampling error
−1
b = X′X X′y
−1
= X′X X ′ (Xβ + ϵ)
−1 ′
= β + X′X X ϵ
|{z} | {z }
The true parameter Sampling error
n
= β + ∑ vi ϵi
i=1
|{z}
The true parameter (constant) | {z }
Linear function of the errors
CC′ = (A + D)(A′ + D′ )
= (AA′ + DA′ + AD′ + DD′ )
I = CX
I = (A + D)X
I = AX + DX
−1 ′
I = X′X X X + DX
I = I + DX
DX = 0
−1 ′
DA′ = D X′X X′
=0
= AD′
−1
Therefore, CC′ = (AA′ + DD′ ) = (X ′ X ) + DD′
HSL719 2021-22 IITD Sisir Debnath
Efficiency of b
Var[b0 ] = σ2 CC′
−1
= σ2 X ′ X + DD′
= Var[b] + σ2 DD′
Gauss-Markov Theorem:
In the classical linear regression model with regressor matrix X, the least square
estimator b is the minimum variance linear unbiased estimator of β. For any vector of
constants w, the minimum variance linear unbiased estimator of w′ β is given by w′ b,
where b is the least square estimator.