Econometrics Year 3 Eco
Econometrics Year 3 Eco
UNIVERSITY ULK
DEPARTMENT OF ECONOMICS
Econometrics II
Prepared by
Out of numerous arrow shots, you may be lucky to hit the bull’s eye
only a few times, most times the arrows will be spread randomly
around the bull’s eye (depending on your skill, of course)
Chap1: Time Series Concepts
1.4 White noise
In time series terms, white noise is described as a series of
identically distributed random variables (IID). The Statistical
properties of a white nose series are as follow:
-A constant zero mean over time,
- A constant variance over time,
- A constant auto-covariance over time.
Chap1: Time Series Concepts
before one pursues formal tests, it is always advisable to plot the time
series under study, Such an intuitive feel is the starting point of more
formal tests of stationarity.
2. Autocorrelation Function (ACF) and Correlogram
Consider, for instance, the correlogram of the GDP time series given in
. How do we decide whether the correlation coefficient of 0.638
at lag 10 (quarters) is statistically significant? The statistical significance of any
ρˆk can be judged by its standard error.
in large samples the sample autocorrelation coefficients are normally
distributed with zero mean and variance equal to one over the sample
size. Since we have 88 observations, the variance is 1/88 = 0.01136 and the
standard error is √0.01136 = 0.1066. Then following the properties of the
standard normal distribution, the 95% confidence interval for any (population)
ρk is:
ρˆk ± 1.96(0.1066)
In other words,
Prob (ρˆk − 0.2089 ≤ ρk ≤ ˆρk + 0.2089) = 0.95
If the interval includes the value of zero, we do not reject the
hypothesis that the true ρk is zero, but if this interval does not
include 0, we reject the hypothesis that the true ρk is zero.
Applying this to the estimated value of ρˆ10 = 0.638, the reader can
verify that the 95% confidence interval for true ρ10 is (0.638 ±
0.2089) or (0.4291, 0.8469).
Obviously, this interval does not include the value of zero,
suggesting that we are 95% confident that the true ρ10 is significantly
different from zero.
As you can check, even at lag 20 the estimated ρ20 is statistically
significant at the 5% level.
Q statistic
Solution
A 95% confidence interval can be constructed for each coefficient using the
Confidence Interval: CI=±1.96 × 1 √T=(−0.196, +0.196), because T = 100.
The null hypothesis that a given coefficient is zero is rejected if the
coefficient lies outside the confidence interval.
Using autocorrelation coefficient in the table above, the null hypothesis is
rejected at the 5% level only for the first autocorrelation coefficient is
significantly different from zero, because other autocorrelation coefficients
lies inside the confidence interval.
Exercise 2
Let us consider the consumer price index (P) in Rwanda and use EVIEWS
to test significance of autocorrelation coefficients.
ans
A 95% confidence interval is this case is CI=±1.96 × 1/ √T=(−0.26, +0.26),
because T = 56.
Lag 1 2 3 4 5
Autocorre 0.953 0.901 0.849 0.799 0.748
lation
coefficient
The null hypothesis that a given coefficient is zero is rejected if the coefficient
lies outside the confidence interval. Using autocorrelation coefficient in the
table above, the null hypothesis is rejected at the 5% level for all
autocorrelation coefficients. In the first part of the correlogram of the
consumer Price Index (P), we can see that each autocorrelation coefficient lies
out the confidence interval.
Returning to the GDP example given in Figure , the value of the LB
statistic up to lag 25 is about 891.25. The probability of obtaining such an
LB value under the null hypothesis that the sum of 25 squared estimated
autocorrelation coefficients is zero is practically zero, as the last column of
that figures shows. Therefore, the conclusion is that the GDP time series is
nonstationary, therefore reinforcing that the GDP series may be nonstationary.
In exercise you are asked to confirm that the other four U.S. economic time
series are also nonstationary.
THE UNIT ROOT TEST
A test of stationarity (or non stationarity) that has become widely
popular over the past several years is the unit root test. We will
first explain it, then illustrate it and then consider some limitations
of this test.
We start with Yt = ρYt−1 + ut −1≤ρ≤1
where ut is a white noise error term.
Deduct both side by Yt−1 ,this will be
Yt − Yt−1 = ρYt−1 − Yt−1 + ut = (ρ − 1)Yt−1 + ut
which can be alternatively written as:
where δ = (ρ − 1) and
d is the first-difference operator
We test the (null) hypothesis that δ = 0. If δ = 0, then ρ = 1, that is we
have a unit root, meaning the time series under consideration is
nonstationary.
eq1)
eq2)
eq3)
For ADF(Augmented Dickey Fuller) and PP(Phillips Perron) the H0 is that the series
have a unit root(not stationary)
while
for KPSS(Kwiatkowski-Phillips-Schmidt-Shin) the H0 is that the series is stationary
After first difference , Now we have a much different pattern of ACF and
PACF. The ACFs at lags 1, 8, and 12 seem statistically different from zero
One way of accomplishing this is to consider the ACF and PACF and the
associated correlograms of a selected number of ARMA processes, such as
AR(1), AR(2), MA(1), MA(2),ARMA(1, 1), ARIMA(2, 2), and so on. Since
each of these stochastic processes exhibits typical patterns of ACF and
PACF, if the time series under study fits one of these patterns we can
identify the time series with that process. Of course, we will have to apply
diagnostic tests to find out if the chosen ARMA model is reasonably
accurate.
FORECASTING
The values of δ, α1, α8, and α12 are already known from the estimated
regression . Therefore,we can easily obtain the forecast value of Y1992−I. The
numerical estimate
of this forecast value is
2.ESTIMATION OF THE ARIMA MODEL in
Eviews
The partial autocorrelations with spikes at lag 1, 8, and 12 seem statistically
significant but the rest are not; if the partial correlation coefficient were significant
only at lag 1, we could have identified this as an AR(1) model. Let us therefore
assume that the process that generated the (first-differenced) GDP is at the most an
AR(12) process. Of course, we do not have to include all the AR terms up to 12,
for from the partial correlogram we know that only the AR terms at lag 1, 8, and12
are significant
Let Y*t denote the first differences of U.S. GDP. Then our tentatively
identified AR model is
- How do we find the ‘correct’ model? In other words, what are the criteria in
choosing a model for empirical analysis?
- How do we detect specification errors? In other words, what are some of the
diagnostic tools that one can use?
- What are the remedies of the specification errors once detected? What are the
benefits?
According to Hendry and Richard (1983), a model chosen for empirical analysis
should satisfy the following criteria:
- Data admissibility
- Be consistent with theory; that is, it must make economic sense.
- Have weakly exogenous explanatory variables/ regressors; that is, the regressors
must be uncorrelated with the error term.
- The model must exhibit parameter constancy; that is, the value of parameters
should be stable. In the absence of parameter constancy, predictions will not be
reliable.
- The model must exhibit data coherency; that is, the residual estimated from the
model must be purely random (technically, white noise).
-The model must include all rival models in the sense that it is capable of explaining
their results.
In short, other models cannot be an improvement over the chosen model
2.2 Types of specification errors
By summary, in developing an empirical model, one is likely to
commit one or more of the following specification errors:
1. Omission of relevant variable(s).
2. Inclusion of unnecessary variable(s)
3. Adopting the wrong functional form.
4. Errors of measurement
5. Incorrect specification of the stochastic error term.
It is fruitful to distinguish between the model specification errors and
model mis-specification errors.
The first four types of error discussed above are essentially in the nature
of the model specification errors in that we have in mind a “true” model
but somehow we do not estimate the correct model. The
last type of error is in the nature of mis-specification errors.
2.3 Consequences of model specification errors
two time periods depends only on the distance or gap or lag between the two
time periods and not the actual time at which the covariance is computed. In
time series literature, such a stochastic process is known as a weakly
stationary process.
3.1.2 Non stationary stochastic process
In time series models in econometrics, a linear stochastic
process has a unit root if 1 is a root of the process's characteristic
equation.
The process will be non-stationary with the following properties:
Keeping in mind that the terms non stationarity, random walk,
(2) random walk with drift (i.e., a constant term is present) and
random walk with drift and
Ho: c(x)=c(n)=0
Residual diagnostics
Ho: There is no serial autocorrelation
Date( Questions:
t) Y1t Y2t X1t Y1t-1
1. Determine the identifiability
1992 -30 -14 -26 20
1993 -4 10 5 -30 conditions of the model.
1994 -19 -19 -36 -4 2. Estimate the equations using OLS
1995 -6 -11 -6 -19 and show how the estimates are not
1996 -9 6 -13 -6 Best Linear Unbiased Estimators
1997 11 -12 25 -9 (BLUE).
1998 9 10 5 11 3. Estimate the coefficients using the
1999 19 11 32 9
2000 29 19 14 19
2SLS.
answer
1. The identifiability conditions of the
model
2. Estimation using OLS
a. Equation 1:
Go to quick –estimate equation –y1t=c(1)*(y2t+x1t)
Y1t = 0.5166666667*(Y2t+X1t)
b. Equation 2: