0% found this document useful (0 votes)

9 views

Econometrics2Notes (2)

The document discusses the use of dummy variables in regression analysis, emphasizing the importance of avoiding the dummy variable trap by using m-1 dummy variables. It also covers various statistical models for binary outcomes, including Linear Probability Model, Logit Model, and Probit Model, highlighting their properties and limitations. Additionally, it addresses time series models, stochastic processes, stationarity, unit root tests, and cointegration, providing essential definitions and equations for each concept.

Uploaded by

rokaya.ashraf2021

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

Econometrics2Notes (2)

Uploaded by

rokaya.ashraf2021

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

1 Dummy Variables and the Dummy Vari-

able Trap
1.1 Notes from the Lecture
1. If a qualitative variable has m categories, introduce only m − 1 dummy
variables to avoid perfect collinearity, which leads to the dummy vari-
able trap.

2. The category for which no dummy variable is assigned is called the

benchmark category or the base category, and all comparisons
are made in relation to it.

3. The intercept value represents the mean value of the base category.

4. The coefficients of dummy variables represent differential intercepts,

showing how much the mean of the given category differs from the mean
of the benchmark category.

1.2 Additional Notes from the Book

5. The choice of the benchmark category is up to the researcher and does
not change the overall conclusion of the model.

6. If you introduce a dummy for each category, you must omit the inter-
cept term. For example:

Yi = β1 D1i + β2 D2i + β3 D3i + ui

Here, the coefficients of the dummy variables represent the mean values
of each category.

7. Including the intercept and m − 1 dummy variables is generally pre-

ferred as it simplifies hypothesis testing and interpretation.

1.3 Interaction Effects Using Dummy Variables

To account for interaction between qualitative variables, the model can in-
clude interaction terms. For instance:

Yi = α1 + α2 D2i + α3 D3i + α4 (D2i D3i ) + βXi + ui

where:

1
• Yi is the dependent variable (e.g., hourly wage).

• D2i is a dummy variable for gender (1 if female, 0 otherwise).

• D3i is a dummy variable for race (1 if nonwhite/non-Hispanic, 0 other-

wise).

• Xi is a quantitative regressor (e.g., years of education).

This interaction model allows for non-additive effects between qualita-

tive variables, capturing more nuanced relationships (e.g., the effect of being
female may differ based on race).

2 Linear Probability Model (LPM)

The LPM (also called the Binary Response Model) represents a regression
model where the dependent variable is binary. The model is given by:

Yi = β1 + β2 Xi + ui (1)

where Yi takes values 1 or 0 (actual values, we use OLS to estimate the

probability of Y=1).

Properties of LPM
1. E(Yi |Xi ) = β1 + β2 Xi , which is the conditional probability P (Yi =
1|Xi ).

2. Disadvantages of LPM:

• Non-normality of the error term ui (has a remedy).

• Heteroskedasticity in ui (has a remedy).
• Predicted probabilities can fall outside the range [0,1] (does not
have a remedy).
• R2 may not be a meaningful measure of fit.
• The probablility cannot be linearly related to all the independent
variables for all their possible values (partial effect of any explana-
tory variable is constant –¿ strong assumption).

2
3 Logit Model
The logit model uses the logistic distribution function for modeling binary
outcomes:
ez
Pi = , Zi = β1 + β2 Xi (2)
1 + ez
The log of the odds ratio is linear:

Pi
Li = ln = Zi = β1 + β2 Xi (3)
1 − Pi

Properties of Logit Model

1. Probabilities are bounded between 0 and 1.

2. Nonlinear in probabilities but linear in log-odds.

3. Slope coefficient βi represents the change in log-odds for a unit change

in Xi .

4. Logit model accommodates multiple regressors.

Odds ratio: The factor by which the odds change for a one-unit increase
in a predictor variable:
eβ
Proportional change in odds: The relative change in the odds for a one-unit
increase in the predictor:
eβ − 1
The marginal effect measures the change in probability resulting from a one-
unit change in a predictor variable. Formula:

Marginal Effect = P (1 − P ) × β

To estimate how probability changes with a change in X:

Change in Probability = Marginal Effect × Change in X

3
Form of Y Form of X Interpretation
Y X When X changes by 1 unit, Y will change by β1 units.
ln Y ln X When X changes by 1%, Y will change by β1 % (elasticity).
Y ln X When X changes by 1%, Y will change by β1 /100 units.
ln Y X When X changes by 1 unit, Y will change by (β1 × 100)%.

Table 1: Interpretation of Regression Coefficients from the Logit sheet

4 Probit Model
The probit model assumes a cumulative normal distribution for binary out-
comes:
P (Y = 1|X) = Φ(β1 + β2 X) (4)
where Φ is the cumulative distribution function (CDF) of the standard nor-
mal distribution: Z X
1 t2
Φ(X) = √ e− 2 dt (5)
−∞ 2π

Properties of Probit Model

1. Probabilities are bounded between 0 and 1.

2. Nonlinear relationship between X and probabilities.

3. Typically used when normality of the underlying distribution is as-

sumed.

Model LPM Logit Probit

Direction + Significance Coefficients indicate Coefficients indicate direc- Coefficients indicate direc-
direction and signifi- tion and significance, but tion and significance, but
cance of X’s effect on not directly the marginal ef- not directly the marginal ef-
P (Y = 1|X). fect on P (Y = 1|X). fect.
Marginal Effect Each coefficient repre- The marginal effect must be The marginal effect must be
sents the marginal ef- computed since coefficients computed using the cumula-
fect of X on P (Y = show the effect on ln(odds): tive standard normal distri-
1|X). bution.
ln(odds) = Z

odds ratio = eZ

Table 2: Comparison of LPM, Logit, and Probit Models

A disadvantage of the Logit and Probit models is that the parameters are
not easily interpreted.

4
Summary of Time Series Models and Their
Properties
White Noise (WN)
• Definition: A purely random process with constant mean and variance
and no autocorrelation.

• Equation: ut ∼ IIDN(0, σ 2 ).

• Properties:

– Mean is constant over time.

– Variance is constant over time.
– ACF is zero at all lags except lag 0.

Autoregressive (AR) Model

• Definition: The AR model expresses the current value of a time series
as a linear function of its past values and a stochastic error term.

• Equation: Yt = ϕ1 Yt−1 + ϕ2 Yt−2 + . . . + ϕp Yt−p + ut .

• Properties:

– Stationarity depends on the roots of the characteristic equation

being outside the unit circle.
– The autocorrelation function (ACF) decays gradually.
– The partial autocorrelation function (PACF) cuts off after lag p.

Moving Average (MA) Model

• Definition: The MA model expresses the current value of a time series
as a linear function of past error terms.

• Equation: Yt = ut + θ1 ut−1 + θ2 ut−2 + . . . + θq ut−q .

• Properties:

– Always stationary.
– The ACF cuts off after lag q.
– The PACF decays gradually.

5
Autoregressive Moving Average (ARMA) Model
• Definition: Combines the AR and MA models to explain a time series
using both past values and past error terms.

• Equation: Yt = ϕ1 Yt−1 + . . . + ϕp Yt−p + ut + θ1 ut−1 + . . . + θq ut−q .

• Properties:

– Stationarity depends on the AR part.

– ACF and PACF both exhibit mixed patterns (do not abruptly cut
off).
– Suitable for stationary time series.

Vector Autoregressive (VAR) Model

• Definition: A generalization of the AR model to multiple time series,
where each variable is explained by its own lags and the lags of other
variables.

• Equation: Yt = Φ1 Yt−1 + . . . + Φp Yt−p + ut .

• Properties:

– Captures the dynamic relationship between multiple time series.

– Requires stationarity of all variables.
– ACF and PACF are calculated for each variable in the system.

Notes on Stochastic Processes

Stochastic Processes
A stochastic process is a collection of random variables ordered in time.

Stationary Stochastic Process

A process is stationary if:

• Mean and variance are constant over time.

• Covariance depends only on the lag (distance) between two time peri-
ods, not the actual time.

6
Key Properties:
• Mean Reversion: The series tends to return to its mean over time.
• Constant Fluctuations: Variance remains stable, indicating consis-
tent amplitude of fluctuations.
Importance of Stationarity:
• Nonstationary series are specific to the observed time period and un-
suitable for generalization or forecasting.

White Noise Process

A process is purely random (white noise) if:
• Mean = 0
• Variance = σ 2 (constant)
• Serially uncorrelated: ut ∼ IIDN (0, σ 2 ), meaning the terms are inde-
pendently and identically distributed with a normal distribution.

Random Walk Models

Random Walk Without Drift:
• Equation: Yt = Yt−1 + ut , where ut is white noise.
• Properties:
– Mean: E(Yt ) = Y0
– Variance: Var(Yt ) = tσ 2 (depends on time → nonstationary).
– Persistence of Shocks: Random shocks accumulate and do not
dissipate.
– First Difference: ∆Yt = ut , which is stationary.
Random Walk With Drift:
• Equation: Yt = δ + Yt−1 + ut
• δ: Drift parameter, representing a deterministic trend.
• Properties:
– Mean: E(Yt ) = Y0 + t · δ
– Variance: Var(Yt ) = tσ 2 (nonstationary).
– First Difference: ∆Yt = δ + ut , which is stationary.

7
Key Concepts
• Random walks (with or without drift) are nonstationary processes.
• Random walks exhibit stochastic trends.

Types of Trends
Deterministic Trend:
• Predictable and constant over time.
• Equation: Yt = β1 + β2 t + ut
• Subtracting the trend (β1 + β2 t) results in a stationary series. This
process is called detrending.
Stochastic Trend:
• Unpredictable and nonstationary.
• Found in random walks with or without drift.

Integrated Processes
A stochastic process requiring differencing d times to achieve stationarity is
said to be integrated of order d, denoted as I(d).
• Example:
– I(0): Stationary time series.
– I(1): Requires first differencing.
– I(2): Requires second differencing.

Autocorrelation Function (ACF) and Correlogram

Autocorrelation Coefficient ρk :
• Measures correlation between observations separated by k lags.
• Formula: ρk = Covariance at lag k
Variance

• Range: −1 ≤ ρk ≤ 1.
Correlogram:
• A plot of ρk against k (lags).
• High and slowly decaying ρk : Indicates nonstationarity.

8
Testing Autocorrelation Significance
Box–Pierce Q Statistic:

• Tests whether all autocorrelations up to lag m are zero.

• Approximation: Q ∼ χ2 (m).

Ljung–Box (LB) Statistic:

• Variant of Q statistic with better small-sample properties.

• LB ∼ χ2 (m).

Notes on Unit Root Test and Stationarity

Unit Root Test
• Definition: The unit root test checks if a time series is stationary or
nonstationary.

• Key Concept: Start with the equation:

Yt = ρYt−1 + ut where − 1 ≤ ρ ≤ 1

If ρ = 1, the equation becomes a random walk model without drift,

which is nonstationary.

• Reformulation: Subtract Yt−1 from both sides:

Yt − Yt−1 = (ρ − 1)Yt−1 + ut

Let δ = (ρ − 1), then:

∆Yt = δYt−1 + ut

– Null Hypothesis (H0 ): δ = 0 (time series has a unit root and is

nonstationary).
– Alternative Hypothesis (H1 ): δ < 0 (time series is stationary).

9
Dickey-Fuller (DF) Test
• If H0 is true (δ = 0), the t-statistic for the coefficient of Yt−1 in the
regression follows the τ -statistic distribution (critical values available
in specialized tables).

• Variations of the DF Test:

1. Random walk without drift:

∆Yt = δYt−1 + ut

2. Random walk with drift:

∆Yt = β1 + δYt−1 + ut

3. Random walk with drift and trend:

∆Yt = β1 + β2 t + δYt−1 + ut

Augmented Dickey-Fuller (ADF) Test

• Accounts for serial correlation in the error term by adding lagged dif-
ferences of Yt :
m
X
∆Yt = β1 + β2 t + δYt−1 + αi ∆Yt−i + ϵt
i=1

• The number of lagged terms (m) is determined empirically to ensure ϵt

is white noise.

• Null Hypothesis (H0 ): δ = 0 (unit root present).

F-Test for Joint Significance

• Test if both β1 = β2 = 0 (i.e., random walk without drift or trend):

1. Compare the restricted model (without intercept or trend) with

the unrestricted model.
2. Use specialized F-distribution critical values for inference.

10
The F-statistic is given by the equation:
SSR
Explained Variation per Degree of Freedom k−1
F = = SSE
(6)
Unexplained Variation per Degree of Freedom n−k

where:

• SSR: Sum of Squares for Regression

• SSE: Sum of Squares for Error

• k: Number of parameters in the model

• n: Number of observations

Difference-Stationary Processes (DSP)

• A time series with a unit root can be made stationary by taking first
differences:
∆Yt = Yt − Yt−1

Trend-Stationary Processes (TSP)

• A TSP is stationary around a deterministic trend.

• To make a TSP stationary:

1. Regress Yt on time (t).

2. Use the residuals, which are stationary.

Notes on Cointegration and Error Correction

Mechanism
Cointegration
If two time series X and Y are nonstationary, and we can find a linear
combination of them that is stationary, then we can run a regression, and it
will not be spurious.
We use an Engle-Granger Test if the timeseries we are testing are of the same
integration order, and the Johansen Test if they are of different integration
orders.

11
Engle–Granger (EG) or Augmented Engle–Granger (AEG) Test
• The DF or ADF unit root tests can be applied by estimating a regres-
sion of the form:
Yt = β1 + β2 Xt + ut ,
obtaining the residuals ut , and applying the DF or ADF tests to ut .

• Since the residuals are based on the estimated cointegrating parameter

β2 , the standard DF and ADF critical values are not quite appropriate.
Engle and Granger have provided critical values for these tests.

• Modern software packages often include these critical values in their

outputs.

Cointegrating Regression Durbin–Watson (CRDW) Test

• A quicker method to test for cointegration is the CRDW test.

• Critical values for this test were provided by Sargan and Bhargava.

Error Correction Mechanism (ECM)

• If two variables, such as P CE (Personal Consumption Expenditure)
and P DI (Personal Disposable Income), are cointegrated, there exists
a long-term equilibrium relationship between them.

• In the short run, disequilibrium may occur. The error term in the
cointegrating equation can be treated as the equilibrium error.

• The ECM, first introduced by Sargan and later popularized by Engle

and Granger, corrects for disequilibrium:

∆Yt = α(Yt−1 − β1 − β2 Xt−1 ) + γ∆Xt + ϵt .

• The Granger representation theorem states that if two variables are

cointegrated, their relationship can be expressed as an ECM.

Vector Autoregressive (VAR) Methodology

• The VAR methodology resembles simultaneous-equation modeling but
with differences:

12
– Each endogenous variable is explained by its own lagged values
and the lagged values of all other endogenous variables in the
model.
– Typically, there are no exogenous variables in the model.

Summary and Conclusions

1. Regression analysis based on time series data implicitly assumes that
the underlying time series are stationary. The classical t tests, F tests,
etc., are based on this assumption.

2. In practice, most economic time series are nonstationary.

3. A stochastic process is said to be weakly stationary if its mean, vari-

ance, and autocovariances are constant over time (i.e., they are time-
invariant).

4. At the informal level, weak stationarity can be tested by the correlo-

gram of a time series, which is a graph of autocorrelation at various
lags. For stationary time series, the correlogram tapers off quickly,
whereas for nonstationary time series it dies off gradually. For a purely
random series, the autocorrelations at all lags 1 and greater are zero.

5. At the formal level, stationarity can be checked by finding out if the

time series contains a unit root. The Dickey–Fuller (DF) and aug-
mented Dickey–Fuller (ADF) tests can be used for this purpose.

6. An economic time series can be trend stationary (TS) or difference

stationary (DS). A TS time series has a deterministic trend, whereas
a DS time series has a variable, or stochastic, trend. The common
practice of including the time or trend variable in a regression model
to detrend the data is justifiable only for TS time series. The DF and
ADF tests can be applied to determine whether a time series is TS or
DS.

7. Regression of one time series variable on one or more time series vari-
ables often can give nonsensical or spurious results. This phenomenon
is known as spurious regression. One way to guard against it is to find
out if the time series are cointegrated.

8. Cointegration means that despite being individually nonstationary, a

linear combination of two or more time series can be stationary. The

13
Engle–Granger (EG), Augmented Engle–Granger (AEG), and Cointe-
grating Regression Durbin–Watson (CRDW) tests can be used to find
out if two or more time series are cointegrated.

9. Cointegration of two (or more) time series suggests that there is a long-
run, or equilibrium, relationship between them.

10. The error correction mechanism (ECM) developed by Engle and Granger
is a means of reconciling the short-run behavior of an economic variable
with its long-run behavior.

11. The field of time series econometrics is evolving. The established results
and tests are in some cases tentative, and a lot more work remains. An
important question that needs an answer is why some economic time
series are stationary and some are nonstationary.

Econometric S Cheat Sheet
No ratings yet
Econometric S Cheat Sheet
3 pages
2007 Hao, Naiman Quantile Regression ApplicationsSocialSciences
100% (1)
2007 Hao, Naiman Quantile Regression ApplicationsSocialSciences
137 pages
Avowed Happiness As An Overall Assessment of The Quality of Life
No ratings yet
Avowed Happiness As An Overall Assessment of The Quality of Life
19 pages
Answers Forecasting Numericals
100% (1)
Answers Forecasting Numericals
6 pages
Econometrics__2__Notes (1)
No ratings yet
Econometrics__2__Notes (1)
12 pages
Empirical Models: Data Collection
No ratings yet
Empirical Models: Data Collection
16 pages
What Is Empirical - Models
No ratings yet
What Is Empirical - Models
14 pages
Econometric Toolkit For Studying Dynamic Models in Economics and Finance
No ratings yet
Econometric Toolkit For Studying Dynamic Models in Economics and Finance
39 pages
Limited Dependent Variables - Binary Dependent Variables
No ratings yet
Limited Dependent Variables - Binary Dependent Variables
24 pages
Unitb - II - Linear Probability, Logit and Probit
No ratings yet
Unitb - II - Linear Probability, Logit and Probit
34 pages
Panel Data, Var, Non Linear Regression
No ratings yet
Panel Data, Var, Non Linear Regression
14 pages
Statistics 3 Notes
No ratings yet
Statistics 3 Notes
90 pages
2.linear Regression
No ratings yet
2.linear Regression
49 pages
econometrics-cheat-sheet
No ratings yet
econometrics-cheat-sheet
4 pages
Lecture 8
No ratings yet
Lecture 8
39 pages
Limited Dependent Variables
No ratings yet
Limited Dependent Variables
17 pages
Econometria Avanzada: Generalized Linear Models
No ratings yet
Econometria Avanzada: Generalized Linear Models
30 pages
Lec2 ASE
No ratings yet
Lec2 ASE
86 pages
What Is A Math/Stats Model?: 1. Often Describe Relationship Between Variables 2. Types
No ratings yet
What Is A Math/Stats Model?: 1. Often Describe Relationship Between Variables 2. Types
64 pages
Econometrics Chapter Four and Five
No ratings yet
Econometrics Chapter Four and Five
22 pages
Midterm 2 Nem Veg Leges
No ratings yet
Midterm 2 Nem Veg Leges
9 pages
Topic 3: Qualitative Response Regression Models
No ratings yet
Topic 3: Qualitative Response Regression Models
29 pages
Week 5 Notes
No ratings yet
Week 5 Notes
175 pages
SM Notes 2020
No ratings yet
SM Notes 2020
139 pages
Econometrics Chapter Two
No ratings yet
Econometrics Chapter Two
36 pages
metrikaq
No ratings yet
metrikaq
11 pages
Group 9 Time Series Data Analysis (ARIMA)
No ratings yet
Group 9 Time Series Data Analysis (ARIMA)
47 pages
Econometrics Lecture Notes
No ratings yet
Econometrics Lecture Notes
16 pages
Unit - 1
No ratings yet
Unit - 1
8 pages
Econometrics Eviews 6
No ratings yet
Econometrics Eviews 6
12 pages
Cursus Advanced Econometrics
No ratings yet
Cursus Advanced Econometrics
129 pages
Business Analytics
No ratings yet
Business Analytics
19 pages
Lecture 19: Interactions
No ratings yet
Lecture 19: Interactions
4 pages
Week1 Lecture2
No ratings yet
Week1 Lecture2
57 pages
Logistic Regression
No ratings yet
Logistic Regression
54 pages
Module05 Notes
No ratings yet
Module05 Notes
19 pages
ECON835 Lecture Notes Part 2 Maximum Likelihood Through Panel Data [Fall 2014]
No ratings yet
ECON835 Lecture Notes Part 2 Maximum Likelihood Through Panel Data [Fall 2014]
68 pages
Unit 1 - Part 1
No ratings yet
Unit 1 - Part 1
105 pages
Advance Econometrics Assignment
No ratings yet
Advance Econometrics Assignment
8 pages
Chapter Two Metrics (I)
No ratings yet
Chapter Two Metrics (I)
35 pages
Section 9 Limited Dependent Variables
No ratings yet
Section 9 Limited Dependent Variables
17 pages
Stat Modelling Notes
No ratings yet
Stat Modelling Notes
49 pages
Ordinary least Squares
No ratings yet
Ordinary least Squares
54 pages
Econometrics Module 2
No ratings yet
Econometrics Module 2
38 pages
Econometrics Notes
No ratings yet
Econometrics Notes
95 pages
Econometrics Notes
No ratings yet
Econometrics Notes
30 pages
CH - 3 - Econometrics UG
No ratings yet
CH - 3 - Econometrics UG
38 pages
UnivariateRegression 3
No ratings yet
UnivariateRegression 3
81 pages
Seminar Econometrie
No ratings yet
Seminar Econometrie
15 pages
STATISTICAL-MODELLING
No ratings yet
STATISTICAL-MODELLING
39 pages
DA-Unit-3-Trio
No ratings yet
DA-Unit-3-Trio
13 pages
CH 06
No ratings yet
CH 06
22 pages
Regression Models Course Notes
No ratings yet
Regression Models Course Notes
102 pages
Linear Regression 101
No ratings yet
Linear Regression 101
20 pages
3.Handouts_binary_dependent_variables
No ratings yet
3.Handouts_binary_dependent_variables
8 pages
Ssss PDF
No ratings yet
Ssss PDF
50 pages
ECO - Chapter 2 SLRM
No ratings yet
ECO - Chapter 2 SLRM
40 pages
17 ae2
No ratings yet
17 ae2
29 pages
binary
No ratings yet
binary
47 pages
Chapter 0
No ratings yet
Chapter 0
10 pages
Section 11 PDF
No ratings yet
Section 11 PDF
7 pages
Chapter 5 Mgt
No ratings yet
Chapter 5 Mgt
60 pages
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Crosstabulation and Chi Square Analysis Summary
No ratings yet
Crosstabulation and Chi Square Analysis Summary
8 pages
Modern Estimation of The Parameters of The Weibull Wind Speed Distr PDF
No ratings yet
Modern Estimation of The Parameters of The Weibull Wind Speed Distr PDF
10 pages
Spss Reviewer
No ratings yet
Spss Reviewer
5 pages
Green Concepts and Material Flow Cost Accounting Application PDF
No ratings yet
Green Concepts and Material Flow Cost Accounting Application PDF
9 pages
Probability Distributions and Curve Fitting
No ratings yet
Probability Distributions and Curve Fitting
53 pages
Y X X X: 11.36 Data For Biomarkers
No ratings yet
Y X X X: 11.36 Data For Biomarkers
3 pages
Marketing Research - SCDL Assignments SCDL Assignment On Marketing Research
No ratings yet
Marketing Research - SCDL Assignments SCDL Assignment On Marketing Research
19 pages
Extrinsic Motivation
No ratings yet
Extrinsic Motivation
9 pages
Instant Download An Introduction to Model Based Survey Sampling with Applications 1st Edition Ray Chambers PDF All Chapters
100% (1)
Instant Download An Introduction to Model Based Survey Sampling with Applications 1st Edition Ray Chambers PDF All Chapters
67 pages
The Role of Deposit Money Banks' Loan Facilities in Financing Small and Medium-Scale Enterprises in Nigeria
No ratings yet
The Role of Deposit Money Banks' Loan Facilities in Financing Small and Medium-Scale Enterprises in Nigeria
8 pages
Vander Wal 2016
No ratings yet
Vander Wal 2016
8 pages
Geostatistics Without Tears
100% (4)
Geostatistics Without Tears
116 pages
Unit4 Multivariate Analysis
No ratings yet
Unit4 Multivariate Analysis
20 pages
Creative Problem Solving and Problem Finding in Young Adults: Interconnections With Stress, Hassles, and Coping Abilities
No ratings yet
Creative Problem Solving and Problem Finding in Young Adults: Interconnections With Stress, Hassles, and Coping Abilities
23 pages
Non Invasive Blood Glucose Monitoring
No ratings yet
Non Invasive Blood Glucose Monitoring
3 pages
To-forecast-the-costs-to-be-incurred
No ratings yet
To-forecast-the-costs-to-be-incurred
15 pages
Harjit DOE
No ratings yet
Harjit DOE
8 pages
Demand Forecasting Report Polyester Filament Yarn
No ratings yet
Demand Forecasting Report Polyester Filament Yarn
9 pages
2020-02-22 Linear Models
No ratings yet
2020-02-22 Linear Models
54 pages
IE442: IE442:: Design and Analysis of Experiments in Engineering Experiments in Engineering
No ratings yet
IE442: IE442:: Design and Analysis of Experiments in Engineering Experiments in Engineering
9 pages
Ebook Handbook of Regression Modeling in People Analytics 1St Edition Keith Mcnulty Online PDF All Chapter
100% (13)
Ebook Handbook of Regression Modeling in People Analytics 1St Edition Keith Mcnulty Online PDF All Chapter
69 pages
R Square 30%
No ratings yet
R Square 30%
10 pages
Paving Analysis PDF
100% (1)
Paving Analysis PDF
603 pages
Statistical methods 4th Edition Donna L. Mohr download
100% (1)
Statistical methods 4th Edition Donna L. Mohr download
58 pages
Prmia II
No ratings yet
Prmia II
49 pages
Short Brief - Machine Learning
No ratings yet
Short Brief - Machine Learning
10 pages
Safety and Security as Part of the Hotel Servicescape for Meeting Planners
No ratings yet
Safety and Security as Part of the Hotel Servicescape for Meeting Planners
21 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Econometrics__2__Notes (2)

Uploaded by

Econometrics__2__Notes (2)

Uploaded by

1 Dummy Variables and the Dummy Vari-

2. The category for which no dummy variable is assigned is called the

4. The coefficients of dummy variables represent differential intercepts,

1.2 Additional Notes from the Book

Yi = β1 D1i + β2 D2i + β3 D3i + ui

7. Including the intercept and m − 1 dummy variables is generally pre-

1.3 Interaction Effects Using Dummy Variables

Yi = α1 + α2 D2i + α3 D3i + α4 (D2i D3i ) + βXi + ui

• D2i is a dummy variable for gender (1 if female, 0 otherwise).

• D3i is a dummy variable for race (1 if nonwhite/non-Hispanic, 0 other-

• Xi is a quantitative regressor (e.g., years of education).

This interaction model allows for non-additive effects between qualita-

2 Linear Probability Model (LPM)

where Yi takes values 1 or 0 (actual values, we use OLS to estimate the

• Non-normality of the error term ui (has a remedy).

Properties of Logit Model

2. Nonlinear in probabilities but linear in log-odds.

3. Slope coefficient βi represents the change in log-odds for a unit change

4. Logit model accommodates multiple regressors.

To estimate how probability changes with a change in X:

Change in Probability = Marginal Effect × Change in X

Table 1: Interpretation of Regression Coefficients from the Logit sheet

Properties of Probit Model

2. Nonlinear relationship between X and probabilities.

3. Typically used when normality of the underlying distribution is as-

Model LPM Logit Probit

Table 2: Comparison of LPM, Logit, and Probit Models

– Mean is constant over time.

Autoregressive (AR) Model

• Equation: Yt = ϕ1 Yt−1 + ϕ2 Yt−2 + . . . + ϕp Yt−p + ut .

– Stationarity depends on the roots of the characteristic equation

Moving Average (MA) Model

• Equation: Yt = ut + θ1 ut−1 + θ2 ut−2 + . . . + θq ut−q .

• Equation: Yt = ϕ1 Yt−1 + . . . + ϕp Yt−p + ut + θ1 ut−1 + . . . + θq ut−q .

– Stationarity depends on the AR part.

Vector Autoregressive (VAR) Model

• Equation: Yt = Φ1 Yt−1 + . . . + Φp Yt−p + ut .

– Captures the dynamic relationship between multiple time series.

Notes on Stochastic Processes

Stationary Stochastic Process

• Mean and variance are constant over time.

White Noise Process

Random Walk Models

Autocorrelation Function (ACF) and Correlogram

• Tests whether all autocorrelations up to lag m are zero.

Ljung–Box (LB) Statistic:

• Variant of Q statistic with better small-sample properties.

Notes on Unit Root Test and Stationarity

• Key Concept: Start with the equation:

If ρ = 1, the equation becomes a random walk model without drift,

• Reformulation: Subtract Yt−1 from both sides:

Let δ = (ρ − 1), then:

– Null Hypothesis (H0 ): δ = 0 (time series has a unit root and is

• Variations of the DF Test:

1. Random walk without drift:

2. Random walk with drift:

3. Random walk with drift and trend:

Augmented Dickey-Fuller (ADF) Test

• The number of lagged terms (m) is determined empirically to ensure ϵt

• Null Hypothesis (H0 ): δ = 0 (unit root present).

F-Test for Joint Significance

1. Compare the restricted model (without intercept or trend) with

• SSR: Sum of Squares for Regression

• SSE: Sum of Squares for Error

• k: Number of parameters in the model

Difference-Stationary Processes (DSP)

Trend-Stationary Processes (TSP)

• To make a TSP stationary:

1. Regress Yt on time (t).

Notes on Cointegration and Error Correction

• Since the residuals are based on the estimated cointegrating parameter

• Modern software packages often include these critical values in their

Cointegrating Regression Durbin–Watson (CRDW) Test

Error Correction Mechanism (ECM)

• The ECM, first introduced by Sargan and later popularized by Engle

∆Yt = α(Yt−1 − β1 − β2 Xt−1 ) + γ∆Xt + ϵt .

Econometrics2Notes (2)

Econometrics2Notes (2)