Simple Regression (Continued) : Y Xu Y Xu
Simple Regression (Continued) : Y Xu Y Xu
Y 0 1X u
(stochastic form)
Y 0 1 X u
SRF
The SRF is determined on the basis of a simple assumption: E( u )=0. In which case we
can write the SRF in its deterministic form as:
Y
0
1X
ui 0 .This says nothing about the residuals for any particular observation and needs
i 1
i i
explanatory variable. This also follows from the first order condition.
(3) The point ( XY ) is always on the regression line. This follows from equation
6a:
0 Y
1X
Y.
for the actual values ( Y ). In other words Y
0
(5) The fitted values and the residuals are uncorrelated: (Yu)
u
u
or Y Y
(6) Y Y
Some new concepts
(1) Total sum of squares (TSS or SST)
(2) Explained sum of squares (ESS)
(3) Residual or sum of squares (RSS )
TSS=
(Y
Y ) 2 - the total variation of the actual Y values about their sample mean.
i 1
ESS=
(Y Y )
i
i 1
Empirical Economics
26163
Handout 2
RSS=
i 1
i 1
2
(Yi Y ) 2 = ui -unexplained variation.
Yi Y
*
Yi
u
i Variation in Yi not explained by the regression
Yi Y
i
SRF : Y 0 1 X
It can easily be proven that TSS=ESS+RSS but in order to tell us how well the RHS
variables explain the variation in the LHS variable we compute what is known as the R2.
SRL:
This is known as the coefficient of determination - ESS/TSS or 1-[RSS/TSS].
In our example: 1-[9.5515/393.60]=0.9757
R2 =0.98 means that 98% of variation in Y.
If the SRF fits the data well that the ESS should be much larger than the RSS.
If SRF fits the data poorly that the RSS should be much larger than the ESS.
In the extreme where X explains no variation in Y RSS=TSS and ESS=0.
These are polar cases.
If ESS is relatively larger than RSS the SRF explains a substantial proportion of
the variation in Y.
If RSS is relatively larger than ESS the SRF explains a small proportion of the
variation in Y
We would like the ESS to be bigger RSS
Expected values and variances of the OLS estimators
By utilising the statistical properties of OLS estimation we can obtain expected values
and variance of the OLS estimators.
This means that we are looking at the properties of the distributions of
(they are estimators of the parameters
0 and 1
0 and 1
3 Zero conditional mean: E(uX)=0: the error term is totally independent of the
explanatory variables. That is, given the value of X the mean or expected value of e
is zero.
the factors which are not explicitly included in the model and
therefore subsumed in u do not systematically affect the mean
value of Y- positive us cancel out negative us.
If E(uX)=0 hold then E(YX)=0+1X
4 X is nonstochastic.
Nonstochastic means that the X is fixed. Consider the relationship between
wages (Y) and education (X). We may subdivide education into those with primary
(7), secondary (14) and tertiary (17 or 19 etc).. Each time X is fixed at a certain
level. For each of the fixed Xs there is an associated distribution of Y
If each X is nonstochastic then this assumption [E(uX)=0] is automatically
filled. Since by extension if X is fixed then it is independent of u.
One can use this assumption of a fixed X to explore the covariance of X and u :
Cov(X, u )=0
Since the X are nonstochastic
is an unbiased estimator of .
E ( 1 ) 1
E( 0 ) 0
If we assume that
on X:
E( u X ) 0
This means
2:
Var( u X ) E u E( u )
Var( u X ) E u E( u )
Var( u X ) E u 2
E u
2
Empirical Economics
26163
Handout 2
Hence
E( u 2 ) 2
However we cannot observe these values what we have are the estimates. Hence we use
an estimator of which is . Also
for the error variance becomes
2
u i2
i 1
n2
E( u ) 0 and
E( X u ) 0 .
And the standard error of the regression (or root mean square error) is defined as
n
u i2
i 1
n2
and
2 and
( ) and
the estimators (and hence their standard errors). That is, we use the
to derive standard errors of our estimators
and
1 ,
denoted as
se( 1 ) repectively.
(1) The larger the error variance-2 the larger will be the variance of the estimator.
(2) The larger the variability in X the smaller the variance in
(3)We can also find the standard deviation of
1 : / TSSX
1.
=
var(1 ) 2 / TSSX
We do not have
/s
[ s SSTX ].
and NOT
1 ; se( 1 )= / s
is an estimator of sd( 1 )= / s .
se(
1 )
Empirical Economics
26163
Handout 2
(Xi X ) 2
i
1
1/ 2
or se(
1 )
var(
1 )
(Xi X ) 2
i
1
se( 0 ) :
n
i1
i1
i1
i1
2n 1 X 2 / (Xi X ) 2 and
var( 0 )
2n 1 X 2 / (Xi X ) 2
se( 0 )
Note: Importance of the minimum variance property.
Another ideal property of an estimator is that of minimum variance. That is in the class
of all estimators the var(
1 ) and var(
0 ) must be the lowest. If this is achieved then
the estimators are said to be efficient. Therefore in addition to being unbiased
estimators should also be efficient. An efficient estimator is a reliable estimator.
Empirical Economics
26163
Handout 2
E(Y X) 0 1X
X1
X2
X3
Empirical Economics
26163
Handout 2
Heretoskedasticity
Figure 2.9 Woolridge.
E(Y X) 0 1X
X
X1
X2
X3
Not all the Y values corresponding to the various Xs are equally reliable in the
sense reliability is being judged by how closely the Y values are distributed
around the means.
If we pick an individual from a group with a smaller variance it will be more likely
that the corresponding Y for this individual will be closer to the mean than if we
pick an individual from a group with a bigger variance.
To summarise: given the value of X the variance of ut is the same for all observations.
That is the conditional variances of u t are identical.
2 E (u 2 ) var(u)
1.
2.
3.
The square root is the standard deviation of the error. A larger means
the distribution of the unobserve-ables affecting the dependent variable is
more spread out.
This has implication for the variance of the slope parameters.
Empirical Economics
26163
Handout 2
~
1
Xi Yi
t 1
(12)
Xi2
t 1
is the estimator.
Yi Xi nY X
i1
n
Xi2
i1
nX
Empirical Economics
26163
Handout 2
Summary
We can now specify the 2-variable linear regression model by listing its important
assumptions:
1. Linear in parameter: Y=0+1X+u
2. X is nonstochastic value is fixed. (this means that each independent variable is
controlled by the researcher, who can change its value in accordance with the
experimental objectives).This assumption implies assumption 3- that is, if the Xs are
fixed it must by extension be independent of the error.
3. Zero conditional mean: E(uX)=0
4. The error has a zero expected value E(u)=0.
5. The error has a constant variance for all observations, that is, var(uX)=E(u2)=2 (the
property of homoskedasticity)
6. The random variable ui are statistically independent, thus E(ui,uj)=0. This means that
the errors associated with any two observations are independent (reflects an absence of
serial correlation).
7. The error term is normally distributed (return to this later on)
8. Other assumptions
The X values must not all be the same- variation is necessary to use regression
analysis as a research tool.
The number of observation is greater than the numbers of parameters to be
estimated. Otherwise the parameters could not be estimated- from a single
observation there is no way to obtain two parameters (2 unknowns and one
equation)
The regression model is correctly specified. That is there is no specification
bias.
Empirical Economics
26163
Handout 2