Chapter 3
Chapter 3
Dimitrios Asteriou
and
Stephen G Hall
Applied Econometrics 4rd edition
SIMPLE REGRESSION
Learning Objectives
• Compute the equation of a simple regression line from a
sample of data, and interpret the slope and intercept of
the equation.
• A full understanding of the simple OLS method of
estimation and discussion of the properties of estimated
coefficients.
• Computation of a standard error of the estimate and
interpretation of its meaning and its use in Hypothesis
Testing.
• Understanding and interpretation of the R2
Applied Econometrics 4rd edition
Introduction
Regression Models
• Deterministic Regression Model: Y=b0+b1X
• Probabilistic Regression Model: Y=β0+β1X+u
Regression Models
Probabilistic Regression Model: Yt=a+βXt+u
• Why probabilistic?
E(Yt)=a+βXt
Yt=E(Yt) +ut
Yt=a+βXt+u
Applied Econometrics 4rd edition
Regression Models
Why probabilistic?
X
300
250
200
X
150
100
50
0
60 80 100 120 140 160 180
Applied Econometrics 4rd edition
• By eye
• Connecting the first with the last observation
• Take the average of the first two and the
average of the two last and connect
• Apply Ordinary Least Squares
Applied Econometrics 4rd edition
Yˆ b0 b1 X
where : b0 = the sample intercept
b1 = the sample slope
Yˆ = the predicted value of Y
Applied Econometrics 4rd edition
Y X
b Y b X n b n
0 1 1
Applied Econometrics 4rd edition
SSXY X X Y Y XY
X Y
n
2
SSXX X X
2
X
2
X
n
SSXY
b1 SSXX
Y X
b Y b X n b n
0 1 1
Applied Econometrics 4rd edition
Why OLS?
1. Elimination of the effect of the sign (positive and
negative residuals offset each other)
2. More weight is given to larger residuals and so we
work harder to reduce the very large errors
3. It chooses a method that follow some numerical and
statistical properties (remember unbiasedness,
efficiency etc)
Applied Econometrics 4rd edition
Unbiasedness
Efficiency
BLUEness
Consistency
Applied Econometrics 4rd edition
C2=B2*A2
D2=B2*B2
A22=SUM(A2:A21)
B22=SUM(B2:B21)
and so on!
Excel
Applied Calculations
Econometrics 4rd edition
b0=(C22-(A22*B22)/20)/(D22-((B22ˆ2)/20))=0.601888903
b1=AVERAGE(A2:A21)-G2*AVERAGE(B2:B21)=15.116408
Applied Econometrics 4rd edition
Excel Results
250
200
X
150
Linear (X)
100
50
0
60 80 100 120 140 160 180
Applied Econometrics 4rd edition
ls y c x (press ‘enter’)
TSS=ESS+RSS
R2 = ESS/TSS or R2 = 1 – (RSS/TSS)
28
Applied Econometrics 4rd edition
S
H 1: 1 0
b
S
where: S e
H 0: 1 0
b
SSXX
SSE
S
H 1: 1 0 n2
e
SSXX
2 X
2
H 0: 1 0 X
n
the hypothesized slope
H 1: 1 0 1
df n 2
Applied Econometrics 4rd edition