0% found this document useful (0 votes)

5 views15 pages

Session CLRM Review 4

The document discusses the finite sample properties of least squares estimators, contrasting them with asymptotic properties. It presents an application using German health care panel data to illustrate the estimation process and properties such as linearity, unbiasedness, and efficiency of the least squares estimator. Additionally, it introduces the Gauss-Markov Theorem, asserting that the least squares estimator is the minimum variance linear unbiased estimator in classical linear regression models.

Uploaded by

sd.shashank74

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views15 pages

Session CLRM Review 4

Uploaded by

sd.shashank74

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Finite Sample Properties of Least Squares

Sisir Debnath
Indian Institute of Technology Delhi

September 3, 2021
Introduction

Estimates and estimators

Properties of an estimator - the sampling distribution

“Finite sample” properties as opposed to “asymptotic” or “large sample”

properties

HSL719 2021-22 IITD Sisir Debnath

Application: Health Care Panel Data

German health care usage data.

7,293 Individuals.
Varying numbers of periods.
Data downloaded from Journal of Applied Econometrics Archive.
There are altogether 27,326 observations.
The number of observations ranges from 1 to 7. (Frequencies are: 1=1525, 2=2158,
3=825, 4=926, 5=1051, 6=1000, 7=987).
Variables description
DOCVIS number of doctor visits in last three months
HOSPVIS number of hospital visits in last calendar year
DOCTOR 1(Number of doctor visits > 0)
HOSPITAL 1(Number of hospital visits > 0)
HSAT health satisfaction, coded 0 (low) - 10 (high)
PUBLIC insured in public health insurance = 1; otherwise = 0
ADDON insured by add-on insurance = 1; otherswise = 0
HHNINC household nominal monthly net income in German marks / 10000.
(4 observations with income=0 were dropped)
HHKIDS children under age 16 in the household = 1; otherwise = 0
EDUC years of schooling
AGE age in years
MARRIED marital status
For now, treat this sample as if it were a cross section, and as if it were the full
population.
HSL719 2021-22 IITD Sisir Debnath
Application: Health Care Panel Data, Population Regression
. reg hhninc educ

Source SS df MS Number of obs = 27,326

F(1, 27324) = 2019.63
Model 58.8590627 1 58.8590627 Prob > F = 0.0000
Residual 796.318636 27,324 .02914356 R-squared = 0.0688
Adj R-squared = 0.0688
Total 855.177698 27,325 .031296531 Root MSE = .17071

hhninc Coef. Std. Err. t P>|t| [95% Conf. Interval]

educ .019963 .0004442 44.94 0.000 .0190923 .0208336

_cons .1260903 .0051337 24.56 0.000 .116028 .1361526

HSL719 2021-22 IITD Sisir Debnath

Application: A Sampling Experiment

A sampling experiment:
Draw 30 observations at random from the population. Compute the regression. Repeat
100 times. Display estimates.

Stata Program:
set seed 2038947
mat M = J(100,1,.)
forvalues i = 1(1)100{
insheet using healthcare.csv, comma clear
sample 30
reg hhninc educ
matrix M[‘i’,1]= b[educ]
}
svmat M, names(b educ)
save b educ, replace
#delimit;
twoway (hist b educ, xline(.019963, lc(red) lw(thick)) bin(40))
(kdensity b edu, xtitle(‘‘Coefficient on Education")
ytitle(‘‘Frequency")
legend(label(1 ‘‘Density") label(2
‘‘Distribution of b")));
#delimit cr

HSL719 2021-22 IITD Sisir Debnath

Application: A Sampling Experiment

1000
Frequency
500 0

.018 .019 .02 .021 .022

Coefficient on Education

Density Distribution of b

HSL719 2021-22 IITD Sisir Debnath

Motivating Least Square

The sample of data from the population: Data generating process is y = Xβ + ϵ.

The stochastic specification of the regression model: Assumptions about the

random ϵ.

Endowment of the stochastic properties of the model upon the least squares
estimator. The estimator is a function of the observed (realized) data.
−1
b = X′X X′y
−1
= X′X X ′ (Xβ + ϵ)
−1 ′
= β + X′X X ϵ
|{z} | {z }
The true parameter Sampling error

HSL719 2021-22 IITD Sisir Debnath

Properties of b

Therefore, b is a vector of random variables.

The assumption of nonstochastic regressors. How it is used at this point.

We do the analysis conditional on an X, then show that results do not depend on

the particular X in hand, so the result must be general – i.e., independent of X.

Properties of a Least Square Estimator (b):

b is a linear.
b is unbiased.
b is the most efficient (best) linear estimator.
b is consistent (will discuss this property later).

HSL719 2021-22 IITD Sisir Debnath

Linearity of b

−1
b = X′X X′y
−1
= X′X X ′ (Xβ + ϵ)
−1 ′
= β + X′X X ϵ
|{z} | {z }
The true parameter Sampling error
n
= β + ∑ vi ϵi
i=1
|{z}
The true parameter (constant) | {z }
Linear function of the errors

HSL719 2021-22 IITD Sisir Debnath

Unbiasedness of b

Crucial assumption: ϵi is uncorrelated to X

−1
b = X′X X′y
−1
= X′X X ′ (Xβ + ϵ)
−1 ′
′
= β+ X X X ϵ

Now take expectations iterating over X

−1
E[b|X ] = β + E[ X ′ X X ′ ϵ]
=β
Therefore,
E[b] = EX (E[b|X ])
=β

HSL719 2021-22 IITD Sisir Debnath

Efficiency of b
Crucial assumption: ϵi has zero mean and it is is uncorrelated to every other ϵj .
Var[ϵi |X ] = σ2 .
Var[ϵ|X ] = σ2 I .
−1 ′
b = β + X′X X ϵ
′
−1 ′
b−β = X X X ϵ

Var[b|X ] = E[(b − β)(b − β)′ |X ]

−1 ′ −1 ′ ′
= E[( X ′ X X ϵ)( X ′ X X ϵ ) |X ]
′
−1 ′ ′ ′
−1
= XX X E[ϵϵ |X ]X X X
−1 ′ 2 −1
= X′X X σ IX X ′ X
−1 ′ −1
= σ2 X ′ X X X X′X
−1
= σ2 X ′ X

Var[b] = E (Var[b|X ]) + Var[E[b|X ]]

−1
= σ2 E X ′ X + Var[ β]
−1
= σ2 E X ′ X +0

HSL719 2021-22 IITD Sisir Debnath

Efficiency of b
An estimator is efficient if it is the minimum variance unbiased estimator.
How do we know that b is the most efficient estimator of β
The Cramer-Rao inequality provides verification of efficiency, since it establishes
the lower bound for the variance-covariance matrix of any unbiased estimator.
This lower bound is given by the corresponding element of the diagonal of the
inverse of the information matrix (or sample information matrix) In (θ ), which is
defined as:
In (θ ) = −E[H (θ )]
where H denotes the hessian matrix, i.e., the matrix of the second partial
derivatives of the log-likelihood function (?).
We will come back to this later. For time being let’s consider a simpler proof.
Let, b0 = Cy be another linear unbiased estimator of β
This immediately implies CX = I (prove this)
−1
C being a matrix of constants we can write C = A + D where A = (X ′ X ) X′.

Var[b0 ] = E[(b0 − β)(b0 − β)′ ]

= Var[C(Xβ + ϵ)]
= E[Cϵϵ′ C′ ]
= σ2 CC′
HSL719 2021-22 IITD Sisir Debnath
Efficiency of b

CC′ = (A + D)(A′ + D′ )
= (AA′ + DA′ + AD′ + DD′ )

I = CX
I = (A + D)X
I = AX + DX
−1 ′
I = X′X X X + DX
I = I + DX
DX = 0

−1 ′
DA′ = D X′X X′

=0
= AD′

−1
Therefore, CC′ = (AA′ + DD′ ) = (X ′ X ) + DD′
HSL719 2021-22 IITD Sisir Debnath
Efficiency of b

Var[b0 ] = σ2 CC′
−1
= σ2 X ′ X + DD′
= Var[b] + σ2 DD′

HSL719 2021-22 IITD Sisir Debnath

Gauss-Markov Theorem

Gauss-Markov Theorem:

In the classical linear regression model with regressor matrix X, the least square
estimator b is the minimum variance linear unbiased estimator of β. For any vector of
constants w, the minimum variance linear unbiased estimator of w′ β is given by w′ b,
where b is the least square estimator.

HSL719 2021-22 IITD Sisir Debnath

KTEE309 - MCQs Chapter 1-2 (Ms. Qu NH) PDF
No ratings yet
KTEE309 - MCQs Chapter 1-2 (Ms. Qu NH) PDF
4 pages
Advanced Econometrics PDF
No ratings yet
Advanced Econometrics PDF
58 pages
Nu - Edu.kz Econometrics-I Assignment 4 Answer Key
No ratings yet
Nu - Edu.kz Econometrics-I Assignment 4 Answer Key
4 pages
Week 3 Hypothesis Testing and Inference - 2024
No ratings yet
Week 3 Hypothesis Testing and Inference - 2024
51 pages
Econometrics Chapter 8 PPT Slides
100% (1)
Econometrics Chapter 8 PPT Slides
42 pages
Outputs 1
No ratings yet
Outputs 1
3 pages
No Linealidades Stock Watson
No ratings yet
No Linealidades Stock Watson
59 pages
Ch3 Multiple Regression
No ratings yet
Ch3 Multiple Regression
56 pages
YD Slides5 NonLin
No ratings yet
YD Slides5 NonLin
54 pages
Introduction To Econometrics - Stock & Watson - CH 6 Slides
No ratings yet
Introduction To Econometrics - Stock & Watson - CH 6 Slides
59 pages
Final Exam Suggested Solution Key
No ratings yet
Final Exam Suggested Solution Key
5 pages
Econ 1630 HW1
No ratings yet
Econ 1630 HW1
6 pages
Part 2 - Simple Regression Model
No ratings yet
Part 2 - Simple Regression Model
56 pages
CH 6 Slides
No ratings yet
CH 6 Slides
59 pages
Linear Regression
No ratings yet
Linear Regression
73 pages
Lab Exercises Answer
No ratings yet
Lab Exercises Answer
13 pages
Chapter 2: Properties of The Regression Coefficients and Hypothesis Testing
No ratings yet
Chapter 2: Properties of The Regression Coefficients and Hypothesis Testing
16 pages
EC212: Introduction To Econometrics Multiple Regression: Inference (Wooldridge, Ch. 4)
No ratings yet
EC212: Introduction To Econometrics Multiple Regression: Inference (Wooldridge, Ch. 4)
89 pages
17-Econometrics-Linear Regression
No ratings yet
17-Econometrics-Linear Regression
18 pages
Solution Assignment
No ratings yet
Solution Assignment
34 pages
2 Simple Regression Model
No ratings yet
2 Simple Regression Model
55 pages
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
55 pages
Lecture 4
No ratings yet
Lecture 4
57 pages
Simple Regression Model
No ratings yet
Simple Regression Model
54 pages
L9.1 2023
No ratings yet
L9.1 2023
47 pages
Module 6A
No ratings yet
Module 6A
25 pages
Part 2 - Multiple Regression Model
No ratings yet
Part 2 - Multiple Regression Model
49 pages
Exercise 3 Computer Intensive Statistics
No ratings yet
Exercise 3 Computer Intensive Statistics
10 pages
QM 9 Instrumental Variables I
No ratings yet
QM 9 Instrumental Variables I
29 pages
Chapter 2: Properties of The Regression Coe Cients and Hypothesis Testing
No ratings yet
Chapter 2: Properties of The Regression Coe Cients and Hypothesis Testing
5 pages
Advanced Panel Data Methods: Basic Econometrics
100% (1)
Advanced Panel Data Methods: Basic Econometrics
32 pages
TSNotes 1
No ratings yet
TSNotes 1
29 pages
Review Materials 0 8 1
No ratings yet
Review Materials 0 8 1
140 pages
CH 5 - Multicollearity
No ratings yet
CH 5 - Multicollearity
27 pages
Econometrics 7
No ratings yet
Econometrics 7
49 pages
Notes 9
No ratings yet
Notes 9
57 pages
18-Econometrics-Linear Regression
No ratings yet
18-Econometrics-Linear Regression
18 pages
Asymptotic Theory and Parametric Inference
No ratings yet
Asymptotic Theory and Parametric Inference
32 pages
Lecture 1
No ratings yet
Lecture 1
8 pages
Classical Linear Regression and Its Assumptions
No ratings yet
Classical Linear Regression and Its Assumptions
63 pages
Example of 2SLS and Hausman Test
No ratings yet
Example of 2SLS and Hausman Test
4 pages
05 Week Economicsofeducation
No ratings yet
05 Week Economicsofeducation
11 pages
FinalExam Fall2020 Updated GB213
No ratings yet
FinalExam Fall2020 Updated GB213
11 pages
Assignment Econometrics
No ratings yet
Assignment Econometrics
7 pages
Stata Session 10 1
No ratings yet
Stata Session 10 1
3 pages
19-Econometrics-Linear Regression
No ratings yet
19-Econometrics-Linear Regression
17 pages
Statistical Inference Cheat Sheet
No ratings yet
Statistical Inference Cheat Sheet
4 pages
RM2017 Midterm Questions
No ratings yet
RM2017 Midterm Questions
9 pages
L10.2 2023
No ratings yet
L10.2 2023
64 pages
Lecture Notes Statistics II PDF
No ratings yet
Lecture Notes Statistics II PDF
139 pages
Rec 5 4
No ratings yet
Rec 5 4
8 pages
Econometrics I 5
No ratings yet
Econometrics I 5
57 pages
Department of Economics Problem Set
No ratings yet
Department of Economics Problem Set
5 pages
Scott and Watson CHPT 4 Solutions
No ratings yet
Scott and Watson CHPT 4 Solutions
4 pages
CE1 Sol
No ratings yet
CE1 Sol
7 pages
ISI MStat PSB Past Year Paper 2014
No ratings yet
ISI MStat PSB Past Year Paper 2014
6 pages
Lecture4 Linearregression Oneregressor
No ratings yet
Lecture4 Linearregression Oneregressor
37 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
Pre-Calculus Essentials
From Everand
Pre-Calculus Essentials
Ernest Woodward
No ratings yet
Shortcuts to College Calculus Refreshment Kit
From Everand
Shortcuts to College Calculus Refreshment Kit
Juan Acevedo
No ratings yet
Chapter 2.simultaneous Equation Exercises
No ratings yet
Chapter 2.simultaneous Equation Exercises
2 pages
Model Evaluation Metrics - Interpretation
No ratings yet
Model Evaluation Metrics - Interpretation
1 page
Sur15 3 Sol
No ratings yet
Sur15 3 Sol
27 pages
Dummy Variables and Properties of OLS Estimators - Lecture Notes
No ratings yet
Dummy Variables and Properties of OLS Estimators - Lecture Notes
19 pages
5cf783r0hSYZTD8N 0COXan7bvGRd4pWm-EPSM UNIT 7 WeatherTrendsSalesPredictor
No ratings yet
5cf783r0hSYZTD8N 0COXan7bvGRd4pWm-EPSM UNIT 7 WeatherTrendsSalesPredictor
5 pages
VAR Slides
No ratings yet
VAR Slides
12 pages
Ch03題庫
No ratings yet
Ch03題庫
14 pages
Midterm Practice Solutions
No ratings yet
Midterm Practice Solutions
6 pages
Theory of Estimation2019
No ratings yet
Theory of Estimation2019
4 pages
Provided by Universiti Putra Malaysia Institutional Repository
No ratings yet
Provided by Universiti Putra Malaysia Institutional Repository
16 pages
HIRAC Template
No ratings yet
HIRAC Template
4 pages
New File Spss
No ratings yet
New File Spss
4 pages
Regression Modeling in Biostatistics
No ratings yet
Regression Modeling in Biostatistics
3 pages
Amazon Data Analysis Presentation 1
No ratings yet
Amazon Data Analysis Presentation 1
20 pages
2.1 The Canonical Ensemble
No ratings yet
2.1 The Canonical Ensemble
15 pages
Statistics Mini Project
No ratings yet
Statistics Mini Project
28 pages
Causal Inference Using Difference-in-Differences: Lecture 7: Leveraging Repeated Cross-Sectional Data
No ratings yet
Causal Inference Using Difference-in-Differences: Lecture 7: Leveraging Repeated Cross-Sectional Data
30 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
4 pages
4123-Article Text-15297-1-10-20211020
No ratings yet
4123-Article Text-15297-1-10-20211020
16 pages
System GMM and Others Information Assumtions Restrictions Etc.
No ratings yet
System GMM and Others Information Assumtions Restrictions Etc.
3 pages
Fit Cmclogit
No ratings yet
Fit Cmclogit
30 pages
Measuring Relationship Via Regression Analysis and Correlation
No ratings yet
Measuring Relationship Via Regression Analysis and Correlation
9 pages
ECONOMETRICS Chapter 1,2
No ratings yet
ECONOMETRICS Chapter 1,2
8 pages
DATAENG Practice Problem 11
No ratings yet
DATAENG Practice Problem 11
6 pages
BS Ref17
No ratings yet
BS Ref17
32 pages
UCS-401 - CSE7th M L Lect 07 - Case Study of Polynomial Regressions
No ratings yet
UCS-401 - CSE7th M L Lect 07 - Case Study of Polynomial Regressions
10 pages
04 Ridge-Regression - en
No ratings yet
04 Ridge-Regression - en
1 page
Data Cocorometer
No ratings yet
Data Cocorometer
7 pages
Iai&ml Unit-5
No ratings yet
Iai&ml Unit-5
15 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Session CLRM Review 4

Uploaded by

Session CLRM Review 4

Uploaded by

Finite Sample Properties of Least Squares

Estimates and estimators

Properties of an estimator - the sampling distribution

“Finite sample” properties as opposed to “asymptotic” or “large sample”

HSL719 2021-22 IITD Sisir Debnath

German health care usage data.

Source SS df MS Number of obs = 27,326

hhninc Coef. Std. Err. t P>|t| [95% Conf. Interval]

educ .019963 .0004442 44.94 0.000 .0190923 .0208336

HSL719 2021-22 IITD Sisir Debnath

HSL719 2021-22 IITD Sisir Debnath

.018 .019 .02 .021 .022

HSL719 2021-22 IITD Sisir Debnath

The sample of data from the population: Data generating process is y = Xβ + ϵ.

The stochastic specification of the regression model: Assumptions about the

HSL719 2021-22 IITD Sisir Debnath

Therefore, b is a vector of random variables.

The assumption of nonstochastic regressors. How it is used at this point.

We do the analysis conditional on an X, then show that results do not depend on

Properties of a Least Square Estimator (b):

HSL719 2021-22 IITD Sisir Debnath

HSL719 2021-22 IITD Sisir Debnath

Crucial assumption: ϵi is uncorrelated to X

Now take expectations iterating over X

HSL719 2021-22 IITD Sisir Debnath

Var[b|X ] = E[(b − β)(b − β)′ |X ]

Var[b] = E (Var[b|X ]) + Var[E[b|X ]]

HSL719 2021-22 IITD Sisir Debnath

Var[b0 ] = E[(b0 − β)(b0 − β)′ ]

HSL719 2021-22 IITD Sisir Debnath

HSL719 2021-22 IITD Sisir Debnath

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.