0% found this document useful (0 votes)

83 views12 pages

7 Multiple Regression 3

This lecture discusses violations of two assumptions in the classical normal linear regression model: 1) Multi-collinearity, which occurs when there are linear relationships between independent variables, making it difficult to interpret regression coefficients. High correlation between variables can also increase standard errors. 2) Heteroscedasticity, which is non-constant error variances, violating the assumption of homoscedasticity. This makes least squares estimators less efficient. One approach is to transform the model so errors have equal variance. The Goldfeld-Quandt test can check for heteroscedasticity.

Uploaded by

Gladys Gladys Mak

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

83 views12 pages

7 Multiple Regression 3

Uploaded by

Gladys Gladys Mak

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Lecture 7: The Multiple Regression Model - Part III

Dr. Keith Wong

The Chinese University of Hong Kong

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 1 / 12

Introduction

Having discussed the classical normal linear regression models, we would like
to revisit some of the assumptions and explore situations in which the
assumptions are violated.
Seeking for improvement (if any) should violations occur.
We focus on revisiting the following two assumptions
No linear relationship exists between two or more of the independent variables
The errors variance are identical and constant (homoscedasticity).

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 2 / 12

Multi-collinearity

If perfect collinearity exists, the regression estimators â, bˆ1 , · · · , bˆk are not
well-defined.
Intuitively, recall that the coefficient bi measures the change in Y by shifting
one unit of xi , given all other variables are held constant.
However, if a linear relationship exists between two or more of the
independent variables, it would be impossible to change the value of one of
them without changing the value(s) of some of the rest.
Hence the previous interpretation is no longer valid.

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 3 / 12

Multi-collinearity
To illustrate, suppose R = a + b1 Sd + b2 Sw + , where R is the average exam
score, Sd is the average study hours per day, and Sw is the average study
hours per week.
Obviously, average study hours per day and per week possess a linear
relationship: 7Sd = Sw .
By no means we can vary one of the both while holding the other constant.
Mathematically, recall that
β̂ = (X> X)−1 (X> Y)
where  
  1 x11 ··· xk1
Y1 1 x12 ··· xk2 
Y =  ...  , X = . ..  ,
   
.. ..
 .. . . . 
Yn
1 x1n ··· xkn
if exact linear relationship exists among independent variables, determinant of
(X> X) is zero.
Hence, in calculating β̂, the inverse (X> X)−1 is not well defined.
Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 4 / 12
Multi-collinearity

In practice, we are faced with more challenging situation - having

independent variables with a high degree of multi-collinearity.
No clear-cut to detect in general.
Least square estimates still possible!
Interpretation remains difficult
The distributions of β̂ are quite sensitive to the correlation between
independent variables, and also to the magnitude of the standard errors
sâ , sbˆ1 , · · · , sbˆk .
Large standard errors are likely resulted.

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 5 / 12

Multi-collinearity

For illustration purpose, a 3-variable regression gives

s2
sb̂2 = , u = 1, 2
u Sxu xu (1 − r 2 )
−s 2 r
Cov (bˆ1 , bˆ2 ) = p ,
(1 − r 2 ) Sx1 x1 Sx2 x2
Sx1 x2
where r = √ is the simple correlation between x1 and x2 .
Sx1 x1 Sx2 x2

When r → 1 or − 1, sb̂2 get very large.

In hypothesis test, it tends to be more difficult to reject the null hypothesis.

Remark: However, it can happen that the overall model is still significant
(F-test)

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 6 / 12

Heteroscedasticity

Recall the assumption of homoscedasticity, i.e. the variances of the error

terms 1 , · · · , n are identically equal to σ 2
May be inappropriate in some situations.
For example, family income v.s. expenditure
Not surprising to expect that low-income families have a rather steady
spending pattern, while high-income families have a rather volatile spending
pattern.
Hence the error variance associated with high-income families should be higher
than those associated with low-income families.
If applying the regression model on this situation, least square estimators are
still unbiased, but no longer efficient (i.e. not minimum variance among
unbiased estimators)
Motivated to consider techniques for handling non-identical error variances.

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 7 / 12

Heteroscedasticity

In general, there are many approaches.

For simplicity, we now consider a particular setting: error variances vary
directly with an independent variable.
Further assume σi2 are known.
Consider the following twovariable regression model:

Yi = a + bxi + i for i = 1, · · · , n

Here, we assume that σi2 = E[2i ] = Cxi2 , where C > 0 constant.

Remark that the model may fit into the income-expenditure example.
Idea: transform the model to be one with identical errors.

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 8 / 12

Heteroscedasticity

Upon dividing both sides of the model by xi , we have

Yi a i
= +b+ for i = 1, · · · , n
xi xi xi
or of the form

Yi∗ = a∗ + b ∗ xi∗ + ∗i for i = 1, · · · , n

Yi
where Yi∗ = , a∗ = b, b ∗ = a, xi∗ = x1i , ∗i = xii .
xi
h 2i
Then Var ((∗ )2 ) = E x 2 = x12 · Cxi2 = C ← constant variance!
i i

The least square estimation of the parameters can be applied on the

transformed model now.

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 9 / 12

Heteroscedasticity - Test

An informal but useful way is to examine the pattern of the residuals. e.g. a
plot of squares of residuals, ˆ2i against time for a time-series model.
Specific to the alternative hypothesis that σi2 = Cxi2 , Goldfeld-Quandt Test
can be used.
Idea:
calculate two regression lines, one using data thought to be associated with
low variance errors, and the other using data thought to be associated with
high variance errors.
If the residual variances associated with each regression line are approximately
equal, the homoscedasticity assumption cannot be rejected.

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 10 / 12

Heteroscedasticity - Goldfeld-Quandt Test

The Procedure is outlined as follows:

1 Order the data by the magnitude of the independent variable xi
2 Omit the middle d observations. d might be chosen, for example, to be
one-fifth of the total sample size.
3 Fit two separate regressions, the first for the portion of the data associated
with low values of xi , and the second associated with high values of xi . Each
model involves n−d n−d
2 pieces of data and 2 − 2 dof.
4 Calculate the residual sum of squares associated with each regression, say
ESSlow for the model with low values of xi , ESShigh for that with high values
of xi .
ESShigh
5 Given that the error process is normally distributed, ESSlow ∼ F n−d−4 , n−d−4 .
2 2
ESS
We can reject the null hypothesis at a chosen level of significance if ESShigh
low
is
greater than the critical value of the F distribution.
Remark: Can be applied to the multiple regression model with k independent
variables. Then the dof for F is n−d−2k−2
2 .

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 11 / 12

Chapters 4 and 6 of Robert S. Pindyck, Daniel L. Rubinfeld, Econometric Models

and Economic Forecasts (4th Edition), McGrawHill, Inc., 1997

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 12 / 12

An Introduction To Classical Econometric Theory-Ruud
No ratings yet
An Introduction To Classical Econometric Theory-Ruud
975 pages
IB FIX Manual PDF
No ratings yet
IB FIX Manual PDF
81 pages
Chapter 5 Violations of CLRM Assumptions
100% (2)
Chapter 5 Violations of CLRM Assumptions
25 pages
EViews 7 Users Guide I
100% (3)
EViews 7 Users Guide I
686 pages
Workshop 4 - Part 1 - Introductory Econometrics With EViews
100% (1)
Workshop 4 - Part 1 - Introductory Econometrics With EViews
99 pages
Chapter 04
No ratings yet
Chapter 04
70 pages
The Black Scholes Model-2
No ratings yet
The Black Scholes Model-2
58 pages
Course Plans of Department of Economics, University of Dhaka
56% (18)
Course Plans of Department of Economics, University of Dhaka
63 pages
Domodar N. Gujarati: Chapter # 8: Multiple Regression Analysis
No ratings yet
Domodar N. Gujarati: Chapter # 8: Multiple Regression Analysis
41 pages
EE708 Module 3A
No ratings yet
EE708 Module 3A
28 pages
High Yield Notes
No ratings yet
High Yield Notes
251 pages
Regression and Assumptions
No ratings yet
Regression and Assumptions
49 pages
Capitulo 2 Big Data
No ratings yet
Capitulo 2 Big Data
25 pages
Economatrics Postmte 1
No ratings yet
Economatrics Postmte 1
46 pages
cs1 Specimen Questions and Solutions
No ratings yet
cs1 Specimen Questions and Solutions
7 pages
Chapter 2
No ratings yet
Chapter 2
41 pages
A Simple Test For Heteroscedasticity and Random Coefficient Variation (Breusch y Pagan)
No ratings yet
A Simple Test For Heteroscedasticity and Random Coefficient Variation (Breusch y Pagan)
9 pages
Model Building
No ratings yet
Model Building
19 pages
Lecture 3-MSDA 3055
No ratings yet
Lecture 3-MSDA 3055
44 pages
Econometrics For Finance Chapter 4
No ratings yet
Econometrics For Finance Chapter 4
44 pages
Intro To Econometrics Latter Half Chanon-1016098-17101310898743
No ratings yet
Intro To Econometrics Latter Half Chanon-1016098-17101310898743
15 pages
Chap4 Econometrics I Jonse
No ratings yet
Chap4 Econometrics I Jonse
51 pages
Chapter Three
No ratings yet
Chapter Three
35 pages
202003271457478511akash Heteroscedasticity
No ratings yet
202003271457478511akash Heteroscedasticity
16 pages
Econometrics II Chapter Two
No ratings yet
Econometrics II Chapter Two
40 pages
Chapter9 Heteroscedasticity
No ratings yet
Chapter9 Heteroscedasticity
17 pages
STAT22209 - Chapter 03-Multiple Regression - 2022
No ratings yet
STAT22209 - Chapter 03-Multiple Regression - 2022
41 pages
ch4 (Multi Hetro Auto)
No ratings yet
ch4 (Multi Hetro Auto)
43 pages
Chapter Three
No ratings yet
Chapter Three
35 pages
01 - Quantitative Methods
No ratings yet
01 - Quantitative Methods
28 pages
Stats and Probabilty Reviewer 4th Quarter
No ratings yet
Stats and Probabilty Reviewer 4th Quarter
6 pages
ARDL Model
100% (1)
ARDL Model
16 pages
Topic 9 Heteroscedasticity
No ratings yet
Topic 9 Heteroscedasticity
83 pages
Chapter10 Heteroskedasticity
100% (1)
Chapter10 Heteroskedasticity
44 pages
Multicollinearity AND Heteroskedasticity
No ratings yet
Multicollinearity AND Heteroskedasticity
75 pages
Lecture Notes For Chapter 1: by Tan, Steinbach, Kumar
No ratings yet
Lecture Notes For Chapter 1: by Tan, Steinbach, Kumar
34 pages
Estimating Stock Market Volatility
No ratings yet
Estimating Stock Market Volatility
9 pages
Lecture 8+9 Multicollinearity and Heteroskedasticity Exercise 10.2
No ratings yet
Lecture 8+9 Multicollinearity and Heteroskedasticity Exercise 10.2
3 pages
OLS Assumptions and Diagnostics
No ratings yet
OLS Assumptions and Diagnostics
18 pages
Heteroscedasticity
No ratings yet
Heteroscedasticity
21 pages
Heteros Ce Dasti City
No ratings yet
Heteros Ce Dasti City
15 pages
Advances 20220303 24
No ratings yet
Advances 20220303 24
13 pages
FinQuiz - Smart Summary, Study Session 3, Reading 10
No ratings yet
FinQuiz - Smart Summary, Study Session 3, Reading 10
7 pages
Roger Koenker, Gilbert Bassett and Jr.1978
No ratings yet
Roger Koenker, Gilbert Bassett and Jr.1978
19 pages
Introductory Econometrics Viva Flashcards
No ratings yet
Introductory Econometrics Viva Flashcards
2 pages
04 Violation of Assumptions All
No ratings yet
04 Violation of Assumptions All
24 pages
Introduction To Econometrics - Stock & Watson - CH 9 Slides
100% (1)
Introduction To Econometrics - Stock & Watson - CH 9 Slides
69 pages
UGEB2363-1718-week 5 To 6
No ratings yet
UGEB2363-1718-week 5 To 6
70 pages
Econometrics Assignment Answer
No ratings yet
Econometrics Assignment Answer
13 pages
Chapter 4: Forecasting: Problem 1: Auto Sales at Carmen's Chevrolet Are Shown Below. Find A Naive Forecast
No ratings yet
Chapter 4: Forecasting: Problem 1: Auto Sales at Carmen's Chevrolet Are Shown Below. Find A Naive Forecast
11 pages
Chapter 5 - Violations of Regression Assumptions
No ratings yet
Chapter 5 - Violations of Regression Assumptions
44 pages
Ecotric Ques
No ratings yet
Ecotric Ques
6 pages
MultivariableRegression Summary
No ratings yet
MultivariableRegression Summary
15 pages
The Glejser Test and The Median Regression: Marilena Furno
No ratings yet
The Glejser Test and The Median Regression: Marilena Furno
24 pages
Wa0054.
No ratings yet
Wa0054.
1 page
R I t t-1: C a b R b C ε I a b R R ε R C I
No ratings yet
R I t t-1: C a b R b C ε I a b R R ε R C I
16 pages
Chapter 3 Heteroscedasticity
No ratings yet
Chapter 3 Heteroscedasticity
10 pages
Lecture 10 Heteroscedasticity
No ratings yet
Lecture 10 Heteroscedasticity
6 pages
Tables
No ratings yet
Tables
54 pages
Module 11 Unit 2 Simple Linear Regression
No ratings yet
Module 11 Unit 2 Simple Linear Regression
10 pages
Ôn Final KTL
No ratings yet
Ôn Final KTL
5 pages
Violations of Assumptions
No ratings yet
Violations of Assumptions
1 page
Chapter 3
No ratings yet
Chapter 3
36 pages
Tutorial Session 11 - Heteroscedasticity Solution
No ratings yet
Tutorial Session 11 - Heteroscedasticity Solution
3 pages
Duke Regression
No ratings yet
Duke Regression
17 pages
Media Econometrics From A Used Car Dealership Example
No ratings yet
Media Econometrics From A Used Car Dealership Example
14 pages
Lecture 1
No ratings yet
Lecture 1
6 pages
4 Regression Issues
No ratings yet
4 Regression Issues
44 pages
Omitted Variable Bias: The Simple Case
No ratings yet
Omitted Variable Bias: The Simple Case
8 pages
IB - 306: Econometrics Department of International Business
No ratings yet
IB - 306: Econometrics Department of International Business
6 pages
Chapter 4 - Acct
No ratings yet
Chapter 4 - Acct
16 pages
Assumption Checking On Linear Regression
No ratings yet
Assumption Checking On Linear Regression
65 pages
Outline 2017
No ratings yet
Outline 2017
15 pages
05 Diagnostic Test of CLRM 2
No ratings yet
05 Diagnostic Test of CLRM 2
39 pages
Chapter 6
No ratings yet
Chapter 6
5 pages
ECON 601 - Module 4 PS - Solutions - FA 19 PDF
No ratings yet
ECON 601 - Module 4 PS - Solutions - FA 19 PDF
11 pages
Multiple Regression Analysis, The Problem of Estimation
No ratings yet
Multiple Regression Analysis, The Problem of Estimation
53 pages
University of Zimbabwe: Time: 2 Hours
No ratings yet
University of Zimbabwe: Time: 2 Hours
3 pages
SLM Unit 13 Mbf103
No ratings yet
SLM Unit 13 Mbf103
19 pages
Lucas - Econometric Policy Evaluation, A Critique
No ratings yet
Lucas - Econometric Policy Evaluation, A Critique
28 pages
Nonlinear Least SQ: Queensland 4001, Australia
No ratings yet
Nonlinear Least SQ: Queensland 4001, Australia
17 pages
Forecasting: (Untitled)
No ratings yet
Forecasting: (Untitled)
1 page
Maths MC Content
No ratings yet
Maths MC Content
2 pages
Suggested Solution To Homework 5: SEEM 4480 Decision Methodology and Applicatons Xuedong He, Fall 2017
No ratings yet
Suggested Solution To Homework 5: SEEM 4480 Decision Methodology and Applicatons Xuedong He, Fall 2017
3 pages
Ontents: Foreword Preface To The Fourth Edition
No ratings yet
Ontents: Foreword Preface To The Fourth Edition
12 pages
Business Econometrics Using SAS Tools (BEST) : Class XI and XII - OLS BLUE and Assumption Errors
No ratings yet
Business Econometrics Using SAS Tools (BEST) : Class XI and XII - OLS BLUE and Assumption Errors
15 pages
Seg7530 Assg1
No ratings yet
Seg7530 Assg1
2 pages
7dJDuD5Y2Fia6Ch 6 Multicollinearity&Heterosced
No ratings yet
7dJDuD5Y2Fia6Ch 6 Multicollinearity&Heterosced
23 pages
Lecture # 3 (Heteroskedasticity in Cross-Sectional Data)
No ratings yet
Lecture # 3 (Heteroskedasticity in Cross-Sectional Data)
5 pages
Bi Is The Slope of The Regression Line Which Indicates The Change in The Mean of The Probablity Bo Is The Y Intercept of The Regression Line
No ratings yet
Bi Is The Slope of The Regression Line Which Indicates The Change in The Mean of The Probablity Bo Is The Y Intercept of The Regression Line
5 pages
ARIMA Modeling:: B-J Procedure
No ratings yet
ARIMA Modeling:: B-J Procedure
26 pages
Econometrics
No ratings yet
Econometrics
25 pages
Seem3590 Cheatsheet
No ratings yet
Seem3590 Cheatsheet
2 pages
Points For Session 4 - Updated
No ratings yet
Points For Session 4 - Updated
9 pages
Econ MIdterm 2 Practise
No ratings yet
Econ MIdterm 2 Practise
11 pages
Problem Set 2
No ratings yet
Problem Set 2
2 pages
Polynomial Regression and Step Function
100% (1)
Polynomial Regression and Step Function
6 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

7 Multiple Regression 3

Uploaded by

7 Multiple Regression 3

Uploaded by

Lecture 7: The Multiple Regression Model - Part III

Dr. Keith Wong

The Chinese University of Hong Kong

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 1 / 12

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 2 / 12

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 3 / 12

In practice, we are faced with more challenging situation - having

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 5 / 12

For illustration purpose, a 3-variable regression gives

When r → 1 or − 1, sb̂2 get very large.

In hypothesis test, it tends to be more difficult to reject the null hypothesis.

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 6 / 12

Recall the assumption of homoscedasticity, i.e. the variances of the error

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 7 / 12

In general, there are many approaches.

Here, we assume that σi2 = E[2i ] = Cxi2 , where C > 0 constant.

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 8 / 12

Upon dividing both sides of the model by xi , we have

Yi∗ = a∗ + b ∗ xi∗ + ∗i for i = 1, · · · , n

The least square estimation of the parameters can be applied on the

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 9 / 12

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 10 / 12

The Procedure is outlined as follows:

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 11 / 12

Chapters 4 and 6 of Robert S. Pindyck, Daniel L. Rubinfeld, Econometric Models

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 12 / 12

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

7 Multiple Regression 3

Uploaded by

7 Multiple Regression 3

Uploaded by

Lecture 7: The Multiple Regression Model - Part III

Dr. Keith Wong

The Chinese University of Hong Kong

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 1 / 12

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 2 / 12

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 3 / 12

In practice, we are faced with more challenging situation - having

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 5 / 12

For illustration purpose, a 3-variable regression gives

When r → 1 or − 1, sb̂2 get very large.

In hypothesis test, it tends to be more difficult to reject the null hypothesis.

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 6 / 12

Recall the assumption of homoscedasticity, i.e. the variances of the error

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 7 / 12

In general, there are many approaches.

Here, we assume that σi2 = E[2i ] = Cxi2 , where C > 0 constant.

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 8 / 12

Upon dividing both sides of the model by xi , we have

Yi∗ = a∗ + b ∗ xi∗ + ∗i for i = 1, · · · , n

The least square estimation of the parameters can be applied on the

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 9 / 12

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 10 / 12

The Procedure is outlined as follows:

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 11 / 12

Chapters 4 and 6 of Robert S. Pindyck, Daniel L. Rubinfeld, Econometric Models

Dr. Keith Wong (CUHK) SEEM 3570: Stochastic Models 12 / 12

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Here, we assume that σi2 = E[2i ] = Cxi2 , where C > 0 constant.

Yi∗ = a∗ + b ∗ xi∗ + ∗i for i = 1, · · · , n