0% found this document useful (0 votes)

9 views32 pages

311 Slide ch6

Chapter 6 of Wooldridge’s textbook discusses relaxing the assumption of linearity in regression models, introducing log-level, level-log, and log-log models to account for nonlinear relationships. It also covers the use of quadratic and interaction terms to analyze varying marginal effects and decision-making based on residual analysis. Examples illustrate how these models can be applied in real estate to understand price changes relative to factors like age and area.

Uploaded by

zhihanyu3

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views32 pages

311 Slide ch6

Uploaded by

zhihanyu3

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

Chapter 6 of Wooldridge’s textbook

1
Big Picture

In this lecture you learn

1. how to relax the assumption of linearity

2. models with log term, squared term, and interaction term

3. decision-making based on residual analysis

2
Assumption of Linearity

1. Consider a simple regression

y = β0 + β1 x + u (1)

2. This regression is linear in the sense that it assumes constant marginal effect of x on y :
dy
= β1 = constant (2)
dx
So when x changes, y changes at a constant rate

3. In the graph, the linear model can be represented by a straight line

4. For example, if y is house price and x is age, then the linear model assumes the
depreciation rate is constant

5. In reality the relationship between y and x can be nonlinear or the marginal effect can be
varying. That is the motivation for nonlinear models such as the log-level model

3
Log-level model

1. Consider an exponential growth model

y = eβ0 +β1 x+u (3)

For instance, y can be the number of confirmed cases of coronavirus and x is time

2. We get the log-level model after we take natural log

log(y) = β0 + β1 x + u (log-level model) (4)

3. We can show the marginal effect of log-level model is not constant

dy
= β1 eβ0 +β1 x+u = β1 y 6= constant (5)
dx
4. (critical thinking) Can we use log-level model when y takes negative values?

5. (critical thinking) Which model is easier to estimate, (3) or (4)?

4
An Approximation

1. There is an approximation when A and B are close

A−B
≈ log(A) − log(B) (6)
B
A−B
2. Proof (optional): for x ≈ 0 we have log(1 + x) ≈ x. We can prove (6) by letting x ≡ B
and applying the property of log function:
A−B
≡x
B
≈ log(1 + x)

A−B B A−B
= log 1 + = log +
B B B

A
= log
B
= log(A) − log(B)

3. In short, 100 times log difference approximates percentage change

5
Log-Level Model and Percentage Change

Consider taking derivative of (4) with respect to x :

d log(y)
β1 = (7)
dx
Thus when dx = 1 it follows that

β1 = d log(y) ⇒ 100β1 = percentage change of y (8)

In short, 100 times β1 in the log-level model gives the percentage change of y when x
changes by one unit

6
Example 1

7
Example 1

1. We use House data

2. We regress rprice onto age

3. The coefficient of age is -337. So age rising by one year is associated with price
decreasing by 337 dollar

4. We wonder whether 337 dollar is a big or small change. To put that number into
perspective, we divide it by the average house price 83721. The ratio times 100 equals
0.4031 percent

5. Alternatively, we get a similar percentage change using the log-level model

8
Example 1—continued

9
In Class Discussion

Consider a new level-log model

y = β0 + β1 log(x) + u (Level-log model)

dy
1. Find the marginal effect dx

2. Is the marginal effect constant?

3. How to tell which model to use, log-level or level-log? (Hint: compare the graphs of
y = ex and y = log(x))

10
Log-log Model

If we take logs of both dependent and independent variables, we get the log-log model that
indicates elasticity

log(y) = β0 + β1 log(x) + u (log-log model) (9)

where β1 measures the percentage change of y when x changes by one percent (not one unit).
d log(y) 100d log(y) percent change of y
β1 = = = = elasticity (10)
d log(x) 100d log(x) percent change of x
In short β1 measures elasticity

11
Example 2

1. We generate the log value using stata log function and gen command

2. We then fit the log-log model

3. The coefficient of log area is 0.7845. So area rising by one percent is associated with
price increasing by 0.7845 percent (less than one percent)

4. The price-area relationship is inelastic

12
Example 2

13
Model with Quadratic (Squared) Term

1. Another way to account for non-linearity (non-constant marginal effect) is using a

quadratic model:

y = β0 + β1 x + β2 x 2 + u (Quadratic Model) (11)

2. The marginal effect depends on x, so is non-constant

dy
= β1 + 2β2 x (12)
dx
3. (True or False) β1 measures the marginal effect

4. Compared to models using log values, the quadratic model can allow for a turning point.

5. There is a minimum if β2 > 0 and maximum if β2 < 0. By setting (12) to zero we locate
the turning point at
turning point β1
x =− (13)
2β2
14
Testing Constant Marginal Effect

1. From (12) it is evident that testing

H0 : β2 = 0 (14)

is the same as testing the marginal effect is constant

2. We reject the null hypothesis if the t statistic of β2 exceeds 1.96 in absolute value, or its
p-value is less than 0.05

3. We can start with a quadratic model. If it turns out that β2 is insignificant, we just drop
the squared term x2 and run a linear model. This is the general-to-specific modeling
strategy

15
Example 3

16
Example 3

1. We first regress house price onto age without a squared term

2. The fitted line is a downward-sloping straight line (why?)

3. Obviously the linear model does a bad job predicting those old houses with age greater
than 100

4. Most old houses lying above the fitted line implies systematic prediction errors and
existence of a turning point or nonlinear relationship

17
Example 3—continued

18
Example 3—continued

1. We next regress house price onto age and its squared term

2. The fitted line is a parabola (cup) facing upward

3. We get a better fit because now those old houses scatter around the new fitted line

19
Example 3—continued

20
Example 3—continued

1. The coefficient of squared term age2 is 8.29, positive and statistically significant at 5%
level

2. t statistic of β2 is 10.62 > 1.96, rejecting the hypothesis (14) of constant marginal effect

3. We use formula (13) to show that the minimum (turning point) is located where age =
87.87.

4. Before the turning point, the house price falls when a house gets older. After the turning
point, the house price starts to rise.

21
In Class Discussion

Consider the relationship between rprice and area

1. Can a house be too big in the sense that there is a turn point, after which rprice starts to
fall when area keeps rising?

2. What is the sign you expect for the squared area?

3. How to find the optimal area that maximizes the house price?

22
Model with Interaction Term

1. We can include an interaction term (product of two regressors) to allow the marginal
effect of one regressor to depend on the other regressor

y = β0 + β1 x1 + β2 x1 x2 + u (Model with Interaction Term) (15)

2. The marginal effect of x1 depends on x2 (called interaction effect)

dy
= β1 + β2 x2 (16)
dx1

3. Testing the hypothesis of no interaction effect amounts to testing

H0 : β2 = 0 (17)

23
Example 4

24
Example 4

1. A house is young if its age is less than 18, the average age. Otherwise a house is old

2. We run two regressions separately using young and old houses

3. We find that for young houses the marginal effect of baths on rprice is 28580, greater
than the marginal effect of 21032 of old houses

4. In short, we find evidence supporting the interaction effect (i.e., age matters for the
marginal effect of baths on rprice)

25
Example 4—continued

26
Example 4—continued

1. We generate the interaction term

2. The coefficient of interaction term is -30.46

3. That number is negative, so age affects the marginal effect of baths negatively

4. That number is significant at 10% level (according to the p-value), so the interaction
effect exists

5. From (16) we know that for a brand-new house (age=0), one more bathroom is
associated with 29350 price increase. For an one-year old house (age=1), the marginal
effect is 29350-30 = 29320

27
In Class Exercise

1. We want to know whether the depreciation rate of an aging house depends on the
number of bathrooms

2. Please specify a proper model and run a regression to find the answer.

3. Is the interaction effect statistically significant?

4. Which house depreciates faster when it gets old, the one with one bathroom or two
bathrooms?

28
Prediction and Residual Analysis

1. Consider a simple model

y = β0 + β1 x + u

2. The fitted or predicted value (denoted by ŷ) for given x = c is computed as

ŷ = β̂0 + β̂1 c (18)

where β̂ is the estimated coefficient

3. The prediction error is called residual (denoted by û)

û = y − ŷ (19)

4. A model over-predicts when y < ŷ or û < 0; otherwise the model under-predicts

5. ŷ is part of y explained by the model, while û captures the unexplained part

29
Example 5

30
Example 5

1. A real-estate investor wants to find most under-valued houses (bargain), whose actual
prices are less than the predicted price

2. We obtain the fitted value and residual using stata predict command

3. We sort houses based on residuals

4. We list the five houses with most negative residuals

5. The best bargain is a house sold at the price of 76804. The predicted price from the
regression is 161742

31
Application of Regression Analysis

1. How to run a regression to help IRS to detect potential tax-cheaters?

2. How to design a cellphone app that uses regression to report the calorie you burn given
the number of steps you walk?

Cook P. Fundamentals of HTML, SVG, CSS and JavaScript For Data Visual. 2022
No ratings yet
Cook P. Fundamentals of HTML, SVG, CSS and JavaScript For Data Visual. 2022
87 pages
Assignment 1 - S3975055 - Vu Lam Le - OMGT1039
No ratings yet
Assignment 1 - S3975055 - Vu Lam Le - OMGT1039
6 pages
MGCR 271: Assignment #4 Fall 2019
No ratings yet
MGCR 271: Assignment #4 Fall 2019
10 pages
How To Apply Initial Stress Using INISTATE
No ratings yet
How To Apply Initial Stress Using INISTATE
4 pages
Chapter 6
No ratings yet
Chapter 6
23 pages
ECO 401 Econometrics: SI 2021 Week 5, 12 October
No ratings yet
ECO 401 Econometrics: SI 2021 Week 5, 12 October
31 pages
Lecture 11
No ratings yet
Lecture 11
62 pages
PS06 2023
No ratings yet
PS06 2023
2 pages
Lecture 5
No ratings yet
Lecture 5
36 pages
12-Econometrics-Linear Regression
No ratings yet
12-Econometrics-Linear Regression
18 pages
Tute6Answers ECON339
No ratings yet
Tute6Answers ECON339
5 pages
MIFI 564 - UNIT 1 - New
No ratings yet
MIFI 564 - UNIT 1 - New
53 pages
Linear Regression Program So Far
No ratings yet
Linear Regression Program So Far
33 pages
Regression Analysis
No ratings yet
Regression Analysis
52 pages
Dummy Variables Regressions
No ratings yet
Dummy Variables Regressions
32 pages
Regression
No ratings yet
Regression
72 pages
Basic Eco No Metrics - Assignment 2
No ratings yet
Basic Eco No Metrics - Assignment 2
9 pages
Stock Watson 3U ExerciseSolutions Chapter8 Instructors
No ratings yet
Stock Watson 3U ExerciseSolutions Chapter8 Instructors
14 pages
Sessions 18 19 - Regression - SLR MLR
No ratings yet
Sessions 18 19 - Regression - SLR MLR
70 pages
Multiple Regression Real Estate Example PDF
No ratings yet
Multiple Regression Real Estate Example PDF
6 pages
Logarithmic Functional Form
No ratings yet
Logarithmic Functional Form
20 pages
Stats 101 - Class 03
No ratings yet
Stats 101 - Class 03
94 pages
CUHK STAT5102 Ch1
No ratings yet
CUHK STAT5102 Ch1
54 pages
Lecture Plan 12 - 16!1!1
No ratings yet
Lecture Plan 12 - 16!1!1
7 pages
Estimating Demand: Regression Analysis
No ratings yet
Estimating Demand: Regression Analysis
29 pages
Econometrics - Lecture 1
No ratings yet
Econometrics - Lecture 1
28 pages
Econ 306 HW 3
No ratings yet
Econ 306 HW 3
7 pages
Econometric Methods
No ratings yet
Econometric Methods
4 pages
Chapter 4: Economic Analysis
No ratings yet
Chapter 4: Economic Analysis
18 pages
Estimating Demand: Learn How To Interpret The Results of Regression Analysis Based On Demand Data
No ratings yet
Estimating Demand: Learn How To Interpret The Results of Regression Analysis Based On Demand Data
18 pages
CH - 05 - Further Issues - TQT
No ratings yet
CH - 05 - Further Issues - TQT
35 pages
Mehak Fatima QRM Exam
No ratings yet
Mehak Fatima QRM Exam
4 pages
Selvanathan 7e - 17
No ratings yet
Selvanathan 7e - 17
93 pages
AI Lec 3
No ratings yet
AI Lec 3
36 pages
Tutorial 1-13 Answer Intermediate Macro
No ratings yet
Tutorial 1-13 Answer Intermediate Macro
40 pages
Regression: Introduction: Basic Idea: Use Data To Identify Among Variables and Use These Relationships To Make
No ratings yet
Regression: Introduction: Basic Idea: Use Data To Identify Among Variables and Use These Relationships To Make
23 pages
ECON326 Midterm
No ratings yet
ECON326 Midterm
5 pages
Introductory Econometrics: Regression Functional Form, Model Selection, Prediction
No ratings yet
Introductory Econometrics: Regression Functional Form, Model Selection, Prediction
32 pages
Tutorial Answers
No ratings yet
Tutorial Answers
5 pages
Introductory Econometrics A Modern Approach 6th Edition Wooldridge Solutions Manual 1
100% (78)
Introductory Econometrics A Modern Approach 6th Edition Wooldridge Solutions Manual 1
8 pages
Lecture 4.3 Regression-1
No ratings yet
Lecture 4.3 Regression-1
30 pages
Unit No. 2
No ratings yet
Unit No. 2
30 pages
Chapter 2 Regression and Forecasting
No ratings yet
Chapter 2 Regression and Forecasting
88 pages
ECN225sol4 PDF
0% (1)
ECN225sol4 PDF
5 pages
Lecture 6
No ratings yet
Lecture 6
11 pages
Lec 11
No ratings yet
Lec 11
4 pages
11 - Econometrics - Linear Regression
No ratings yet
11 - Econometrics - Linear Regression
20 pages
Undergraduate Econometric
No ratings yet
Undergraduate Econometric
15 pages
Simple Linear Regression
100% (1)
Simple Linear Regression
50 pages
CH 06
No ratings yet
CH 06
22 pages
Principles of Econometrics, 5th Ed. (R. Carter Hill, William E. Griffiths Etc.) (Z-Lib - Org) - 345-353
No ratings yet
Principles of Econometrics, 5th Ed. (R. Carter Hill, William E. Griffiths Etc.) (Z-Lib - Org) - 345-353
9 pages
Tut Sol Week12
No ratings yet
Tut Sol Week12
8 pages
Econ 3044: Introduction To Econometrics Chapter-4: MLR: Further Issues and Dummy Variables
No ratings yet
Econ 3044: Introduction To Econometrics Chapter-4: MLR: Further Issues and Dummy Variables
43 pages
Introduction To Linear Regression and Correlation Analysis: Objectives
100% (1)
Introduction To Linear Regression and Correlation Analysis: Objectives
33 pages
Linear Regression
No ratings yet
Linear Regression
20 pages
pricei = β + β sqfti + β agei + β baths + e: Question 1 (7 marks)
No ratings yet
pricei = β + β sqfti + β agei + β baths + e: Question 1 (7 marks)
11 pages
AI Lab7
No ratings yet
AI Lab7
13 pages
Statistics For Business Analysis: Learning Objectives
No ratings yet
Statistics For Business Analysis: Learning Objectives
37 pages
回归结果
No ratings yet
回归结果
1 page
参考文献
No ratings yet
参考文献
1 page
作业
No ratings yet
作业
9 pages
CH 02 Wooldridge 5e ppt20250307
No ratings yet
CH 02 Wooldridge 5e ppt20250307
51 pages
Network Engineer - Praneesha Martha
No ratings yet
Network Engineer - Praneesha Martha
4 pages
Lista de Accesorios Nueva
No ratings yet
Lista de Accesorios Nueva
11 pages
1 Application For Site Approval of Industrial Accelerator Radiation Processing Facility Iarpf
No ratings yet
1 Application For Site Approval of Industrial Accelerator Radiation Processing Facility Iarpf
3 pages
Solar Energy
No ratings yet
Solar Energy
41 pages
Presentasi Bulldozer D6N LGP
No ratings yet
Presentasi Bulldozer D6N LGP
28 pages
Healthcare Generative AI Hackathon
No ratings yet
Healthcare Generative AI Hackathon
12 pages
Production Planning & Controlling: Sybba Sem 4 Chapter1
No ratings yet
Production Planning & Controlling: Sybba Sem 4 Chapter1
18 pages
Assigning Items To Catalogs - TEST
No ratings yet
Assigning Items To Catalogs - TEST
10 pages
VHB Exhaust Only Hood
No ratings yet
VHB Exhaust Only Hood
3 pages
F3 Fixture
No ratings yet
F3 Fixture
2 pages
IEC 61850 Process Bus
No ratings yet
IEC 61850 Process Bus
3 pages
First Grade of Primary
No ratings yet
First Grade of Primary
3 pages
SM PDF
No ratings yet
SM PDF
417 pages
Iso 14001 Static 16x9
100% (1)
Iso 14001 Static 16x9
13 pages
Qualcomm 213
No ratings yet
Qualcomm 213
28 pages
Wpq-105-03 Gmaw 3g Jose A. Rivas
No ratings yet
Wpq-105-03 Gmaw 3g Jose A. Rivas
1 page
Audio Technica ATH-M20x
No ratings yet
Audio Technica ATH-M20x
1 page
Case Study
No ratings yet
Case Study
11 pages
63Y Set-Up EN XX
No ratings yet
63Y Set-Up EN XX
12 pages
Social Entrepreneurship: Assignment 1: Social Enterprise and Entrepreneur Desicrew Solutions and Saloni Malhotra
No ratings yet
Social Entrepreneurship: Assignment 1: Social Enterprise and Entrepreneur Desicrew Solutions and Saloni Malhotra
3 pages
Color Video Doorphone Kit: 1byone Products Inc
No ratings yet
Color Video Doorphone Kit: 1byone Products Inc
19 pages
Colleges List
No ratings yet
Colleges List
28 pages
MPC 509
No ratings yet
MPC 509
22 pages
Abrir 02L085006 Service+Manual+VCF85
100% (3)
Abrir 02L085006 Service+Manual+VCF85
85 pages
Nomenclature of IC Engines
No ratings yet
Nomenclature of IC Engines
3 pages
Tybsc-It Sem5 SPM Apr19
No ratings yet
Tybsc-It Sem5 SPM Apr19
2 pages
Kollmorgen AKM - Servomotor
No ratings yet
Kollmorgen AKM - Servomotor
44 pages
PyQt Tutorial
No ratings yet
PyQt Tutorial
11 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

311 Slide ch6

Uploaded by

311 Slide ch6

Uploaded by

Chapter 6 of Wooldridge’s textbook

In this lecture you learn

1. how to relax the assumption of linearity

2. models with log term, squared term, and interaction term

3. decision-making based on residual analysis

1. Consider a simple regression

3. In the graph, the linear model can be represented by a straight line

1. Consider an exponential growth model

y = eβ0 +β1 x+u (3)

2. We get the log-level model after we take natural log

log(y) = β0 + β1 x + u (log-level model) (4)

3. We can show the marginal effect of log-level model is not constant

5. (critical thinking) Which model is easier to estimate, (3) or (4)?

1. There is an approximation when A and B are close

3. In short, 100 times log difference approximates percentage change

Consider taking derivative of (4) with respect to x :

β1 = d log(y) ⇒ 100β1 = percentage change of y (8)

1. We use House data

2. We regress rprice onto age

5. Alternatively, we get a similar percentage change using the log-level model

Consider a new level-log model

y = β0 + β1 log(x) + u (Level-log model)

2. Is the marginal effect constant?

log(y) = β0 + β1 log(x) + u (log-log model) (9)

2. We then fit the log-log model

4. The price-area relationship is inelastic

1. Another way to account for non-linearity (non-constant marginal effect) is using a

y = β0 + β1 x + β2 x 2 + u (Quadratic Model) (11)

2. The marginal effect depends on x, so is non-constant

1. From (12) it is evident that testing

is the same as testing the marginal effect is constant

1. We first regress house price onto age without a squared term

2. The fitted line is a downward-sloping straight line (why?)

2. The fitted line is a parabola (cup) facing upward

Consider the relationship between rprice and area

2. What is the sign you expect for the squared area?

y = β0 + β1 x1 + β2 x1 x2 + u (Model with Interaction Term) (15)

2. The marginal effect of x1 depends on x2 (called interaction effect)

3. Testing the hypothesis of no interaction effect amounts to testing

2. We run two regressions separately using young and old houses

1. We generate the interaction term

2. The coefficient of interaction term is -30.46

3. Is the interaction effect statistically significant?

1. Consider a simple model

2. The fitted or predicted value (denoted by ŷ) for given x = c is computed as

ŷ = β̂0 + β̂1 c (18)

where β̂ is the estimated coefficient

3. The prediction error is called residual (denoted by û)

4. A model over-predicts when y < ŷ or û < 0; otherwise the model under-predicts

5. ŷ is part of y explained by the model, while û captures the unexplained part

3. We sort houses based on residuals

4. We list the five houses with most negative residuals

1. How to run a regression to help IRS to detect potential tax-cheaters?

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.