Appliedeconometrics PDF
Appliedeconometrics PDF
net/publication/269113492
CITATIONS READS
3 59,890
2 authors, including:
Ali Göksu
University of Eurasia
30 PUBLICATIONS 126 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
Working on organization of the International Conference on Economic and Social Studies 2017 (ICESoS 2017) View project
CALL FOR PAPERS: International Journal of Applied Statistics and Econometrics View project
All content following this page was uploaded by Ali Göksu on 10 October 2015.
IBU Publications
2 Applied Econometrics with Eviews Applications
APPLIED ECONOMETRICS
With Eviews Applications
Authors:
Uğur ERGÜN Ali GÖKSU
ugur.ergun@ibu.edu.ba ali.goksu@ibu.edu.ba
Publisher:
International Burch University
Editor in Chief:
Prof.Dr. Mehmet Uzunoğlu
Reviewed by:
Assoc. Prof. Dr. Ercan BALDEMİR, Muğla Sıtkı Koçman University
Assoc. Prof. Dr. Safet KOZAREVIĆ, Faculty of Economics, University of Tuzla
Printed by:
Disclaimer: While every effort has been made to ensure the accuracy of the information, contained in
this publication, International Burch University will not assume liability for writing and any use made of
the proceedings, and the presentation of the participating organizations concerning the legal status of
any country, territory, or area, or of its authorities, or concerning the delimitation of its frontiers or
boundaries.
-------------------------------------------------
CIP - Katalogizacija u publikaciji
Nacionalna i univerzitetska biblioteka
Bosne i Hercegovine, Sarajevo
330.43(075.8)
ERGUN, Uğur
Applied econometrics : with eviews applications
/ Uğur Ergun, Ali Göksu. - Sarajevo :
International Burch University, 2013. - 272 str. :
graf. prikazi ; 24 cm. - (IBU Publications)
ISBN 978-9958-834-29-5
1. Göksu, Ali
COBISS.BH-ID 20792838
--------------------------------------------------------
3
APPLIED ECONOMETRICS
With Eviews Applications
IBU Publications
Sarajevo, 2013
4 Applied Econometrics with Eviews Applications
5
PREFACE
This book is designed as auxiliary source for the students who are
taking Applied Econometrics course. It is intended to clarify basic
econometrics methods with examples especially for Finance
students.
In this book, two main dimensions have been configured. In the first
chapters some basic information regarding scientific research is given
in order to polish student’s ability to understand following chapters
better. Following chapters are organized to provide more detailed
information about some specific methods and their applications in
finance by examples.
LIST OF ABBREVIATIONS
TABLE OF CONTENTS
PREFACE ......................................................................................................... 5
LIST OF ABBREVIATIONS ................................................................................. 7
TABLE OF CONTENTS ...................................................................................... 9
CHAPTER 1
SCIENTIFIC RESEARCH
DEFINITION ................................................................................................... 21
AIMS OF THE SCIENTIFIC RESEARCH ............................................................ 21
RESEARCH METHODS & METHODOLOGY .................................................... 22
Research Methods.................................................................................... 22
Research Methodology ............................................................................ 22
The Scientific Method .............................................................................. 23
CRITERIA OF A GOOD RESEARCH.................................................................. 23
POSSIBLE USERS OF RESEARCH OUTCOME .................................................. 25
RESEARCH TYPES .......................................................................................... 25
Based on the Research Purpose ............................................................... 25
Descriptive Research ............................................................................ 25
Exploratory Research ............................................................................ 26
10 Applied Econometrics with Eviews Applications
CHAPTER 2
MODELS
ECONOMIC MODEL ...................................................................................... 37
TYPES OF ECONOMIC MODELS..................................................................... 38
Visual Models ........................................................................................... 38
Mathematical Models .............................................................................. 38
Empirical Models ...................................................................................... 38
Static Model ............................................................................................. 39
Dynamic Models ....................................................................................... 39
WHICH THEORY IS APPROPRIATE? ............................................................... 40
ECONOMETRIC MODELS .............................................................................. 40
Attributes of a Good Econometric Model ................................................ 42
Structural Equation Modeling (SEM) ........................................................ 42
11
CHAPTER 3
DATA
QUALITATIVE DATA ...................................................................................... 49
In-Depth Interviews .................................................................................. 49
Direct Observation ................................................................................... 50
Written Documents .................................................................................. 50
Qualitative Methods................................................................................. 50
Participant Observation ........................................................................... 51
Direct Observation ................................................................................... 51
Unstructured Interviewing ....................................................................... 52
Case Studies.............................................................................................. 52
QUANTITATIVE VERSUS QUALITATIVE DATA ........................................... 53
WHY ARE QUANTITATIVE AND QUALITATIVE DATA IMPORTANT? .............. 53
QUANTITATIVE DATA TYPES ......................................................................... 53
Primary Data............................................................................................. 53
Secondary Data ........................................................................................ 54
Experimental Data .................................................................................... 54
Observational data ................................................................................... 54
Time Series ............................................................................................... 55
Panel Data ................................................................................................ 56
12 Applied Econometrics with Eviews Applications
CHAPTER 4
REGRESSION
DEFINITION ................................................................................................... 63
SIMPLE REGRESSION MODELS...................................................................... 63
ESTIMATION OF THE LINEAR REGRESSION MODEL ..................................... 65
Ordinary Least Squares Estimation Method (OLS) ................................... 65
Multivariate Regression Model ................................................................ 68
CLASSICAL LINEAR REGRESSION MODEL (CLRM) ASSUMPTIONS ................ 70
STATISTICAL PROPERTIES OF OLS ................................................................. 72
Linearity .................................................................................................... 72
Unbiasedness ........................................................................................... 75
Efficiency .................................................................................................. 75
Normality .................................................................................................. 75
GAUSS-MARKOV THEOREM ......................................................................... 77
COEFFICIENT OF DETERMINATION, GOODNESS OF FIT ............................... 78
13
CHAPTER 5
CLASSICAL LINEAR REGRESSION MODEL (CLRM)
LINEARITY ..................................................................................................... 83
Modeling Nonlinearities (with OLS) ......................................................... 84
Testing Linearity ....................................................................................... 84
The Ways to Correct Linearity Problem: .................................................. 88
HETEROSCEDASTICITY .................................................................................. 89
Consequences .......................................................................................... 90
Detection .................................................................................................. 90
Which Test to Apply? ............................................................................. 101
The Ways to Correct Heteroscedasticity Problem ................................. 102
AUTOCORRELATION ................................................................................... 102
Consequences of Autocorrelation in the Residuals ............................... 103
First Order Autocorrelation .................................................................... 103
Detection ................................................................................................ 104
The Ways to Correct Autocorrelation Problem ...................................... 111
MULTICOLLINEARITY .................................................................................. 111
Consequences ........................................................................................ 111
Causes of Multicollinearity ..................................................................... 112
Detection ................................................................................................ 113
The Ways to Handle Multicollinearity .................................................... 114
EXOGENEITY ............................................................................................... 116
NORMALITY ................................................................................................ 117
CONSISTENCY ............................................................................................. 121
14 Applied Econometrics with Eviews Applications
CHAPTER 6
HYPOTHESIS TESTING
DEFINITIONS ............................................................................................... 123
F-Test: A JOINT TEST OF SIGNIFICANCE ...................................................... 131
CHAPTER 7
SPECIFICATIONS
CHOOSING INDEPENDENT VARIABLES ....................................................... 135
Omitting a Relevant Variable ................................................................. 136
Detection ............................................................................................ 137
Wald Test (Coefficient Restrictions) ................................................... 137
Omitted Variables Test ....................................................................... 140
Solution ............................................................................................... 142
Including an Irrelevant Variable in the Regression equation ................. 142
How to Choose Correct Variables? ..................................................... 143
Redundant Variables Test................................................................... 143
CHOOSING A FUNCTIONAL FORM .............................................................. 145
Functional Forms: ................................................................................... 145
The Log-log Regression Model (Double Log) ...................................... 145
Cobb-Douglas Production Function .................................................... 145
Lin-Log Model (semi log) .................................................................... 146
Log-ln Model ....................................................................................... 146
Quadratic Forms ................................................................................. 146
Inverse Form ....................................................................................... 146
Intercept Dummy Independent Variable ............................................ 146
Slope Dummy Independent Variable .................................................. 147
Lags..................................................................................................... 147
Mixed Functional Forms ......................................................................... 147
15
CHAPTER 8
PARAMETER STABILITY
CHOW BREAKPOINT TEST FOR STRUCTURAL BREAK ................................. 158
CHOW FORECAST TEST FOR STRUCTURAL BREAK...................................... 159
CUSUM TEST ............................................................................................... 160
CUSUM OF SQUARES TEST ......................................................................... 161
ONE STEP AHEAD FORECAST TEST ............................................................. 162
N STEP FORECAST TEST .............................................................................. 163
RECURSIVE COEFFICIENT ESTIMATES ......................................................... 164
WALD TEST FOR STRUCTURAL CHANGE ..................................................... 165
16 Applied Econometrics with Eviews Applications
CHAPTER 9
INTERPRETING THE LINEAR REGRESSION MODELS
SIMPLE LINEAR REGRESSION MODEL ......................................................... 167
MULTIPLE LINEAR REGRESSION MODEL .................................................... 168
REGRESSOR SELECTION .............................................................................. 174
CHAPTER 10
ENDOGENEITY
SOURCES OF ENDOGENEITY ....................................................................... 176
Autocorrelation with Lagged Dependent Variable ................................ 176
Omitted Variable Bias............................................................................. 177
Measurement Error ................................................................................ 177
Simultaneity............................................................................................ 178
ENDOGENEITY TEST .................................................................................... 179
SOLUTION ................................................................................................... 181
Ad hoc Approaches................................................................................. 181
Instrumental Variables Estimation ......................................................... 181
Two Stage Least Squares ........................................................................ 185
Weak Instrument.................................................................................... 186
Generalized Method of Moment ........................................................... 186
CHAPTER 11
REGRESSIONS WITH DUMMY VARIABLES
A DUMMY INDEPENDENT VARIABLE .......................................................... 189
INTERPRETATION........................................................................................ 190
QUALITATIVE VARIABLE WITH MORE THAN TWO CATEGORIES
(POLYTOMOUS FACTORS) .......................................................................... 191
CONSTRUCTING INTERACTION REGRESSORS ............................................. 193
17
CHAPTER 12
TIME SERIES
STATIC MODELS .......................................................................................... 199
DYNAMIC MODELS ..................................................................................... 200
TIME SERIES ................................................................................................ 200
Time Series Regression with Lag ............................................................ 200
Lag Selection........................................................................................... 202
Finite Distributed Lag Models ................................................................ 205
Autoregressive Modeling (AR) ............................................................... 205
Stationarity ............................................................................................. 206
Random Walk ......................................................................................... 208
Seasonality ............................................................................................. 210
Steps for Testing in the AR(p) with Deterministic Tren .......................... 211
Lagged Dependent Variables.................................................................. 213
Moving Average ..................................................................................... 214
18 Applied Econometrics with Eviews Applications
CHAPTER 13
VOLATILITY
ARCH MODEL .............................................................................................. 249
DIAGNOSTIC CHECKING .............................................................................. 256
VOLATILITY SPILLOVER, GARCH (1,1) MODEL ............................................ 258
19
CHAPTER 14
GRANGER CAUSALITY
CAUSALITY IN BOTH DIRECTIONS ............................................................... 265
GRANGER CAUSALITY IN COINTEGRATED VARIABLES................................ 265
CHAPTER 15
VECTOR AUTOREGRESSION MODEL (VAR)
DEFINITION ................................................................................................. 267
LAG LENGTH SELECTION ............................................................................. 270
FORECASTING WITH “VAR” ........................................................................ 270
VECTOR AUTOREGRESSIONS WITH COINTEGRATED VARIABLES ............... 272
CHAPTER 1
SCIENTIFIC RESEARCH
DEFINITION
Scientific research is a systematic, controlled, empirical, amoral,
public and critical investigation of natural phenomena guided by
theory and hypotheses about the presumed relations among such
phenomena1.
Research Methodology
2
Fox, J.H.(1958)
24 Applied Econometrics with Eviews Applications
3
Bellenger D.N & A. Greenberg, B.A. (1978)
Scientific Research 25
RESEARCH TYPES4
Based on the Research Purpose
Descriptive Research
4
Kothari, C.R. & Garg, G.D. (2004)
26 Applied Econometrics with Eviews Applications
Example 1.1
- What are the feelings of Academics faced with redundancy?
- What kind of ethical behaviors do students expect at the
universities?
Exploratory Research5
5
Collis, J & Hussey, R. (2009).
Scientific Research 27
Example 1.2
- Why do some cities have higher traffic accident rate than
others?
- Why are some children becoming piddling?
28 Applied Econometrics with Eviews Applications
Example 1.3
- Examining the Turkey balance of payment deficit movement
during 2008 global financial crisis.
- Examining trends of real and nominal value of New Turkish Lira
against Euro after 2000.
Predictive Research
6
Collis, J & Hussey, R. (2009)
Scientific Research 29
Example 1.4
- What type of marketing strategy can improve the sales?
- How would an increase in interest rates affect inflation rate?
- During financial crises, what would happen to sales of
personnel computer?
Qualitative Research8
8
Kothari, C.R. (2004).
Scientific Research 31
Example 1.5
- What is the role of the experience in participation on public
health education sessions for smokers and drug users?
- How should the nurses handle patients who refuse to follow
instructions?
Quantitative Research
Example 1.6
- To examine the interaction between inflation and balance of
payment.
- To examine the relationship between usage and need of
nursing services in the rural areas compared to urban areas.
Applied Research
Example 1.7
- The researcher collects information to test the effectiveness of
traffic policies and fines in minimizing traffic causalities.
Basic Research
Example 1.8
- The researcher collects information to understand further the
relationship between socioeconomic status and the intention to
obey to traffic rules.
Deductive Research
Example 1.9
- To test the theories of motivation in different cultures and
sectors.
Inductive Research
Example 1.10
- To investigate the relationship between production level and
shift time in a factory in order to test the theory stating that
production levels vary with working time length.
CONCEPTUAL RESEARCH
Conceptual research is related to some abstract idea(s) or theory. It is
generally used by philosophers and thinkers to develop new concepts
or to reinterpret existing ones.
EMPIRICAL RESEARCH
Empirical research relies on experience or observation alone, often
without seeking for system and theory. It is data-based research,
resulting in conclusions which are capable of being verified by
observation or experiment, thus it can be named as experimental
type of research.
HISTORICAL RESEARCH9
Historical research utilizes historical sources such as documents,
remains, etc. in order to study events or ideas from the past,
including the philosophes of persons and groups at any remote point
of time. The purpose is to make people aware of what has happened
in the past offering the opportunity to learn from past failures and
successes, and apply them to present problems, make predictions
based on that and test hypothesis concerning relationships or trends.
Example 1.11
- To investigate the family trees
- To investigate the role of Ottoman Empire in the First
World War 1
9
Kumar, R. (2008).
36 Applied Econometrics with Eviews Applications
Models 37
CHAPTER 2
MODELS
ECONOMIC MODEL
Economic modeling is one of the most important parts of a research
and the economic theory generally. The researcher organizes and
designs his/her ideas through economic modeling. The model helps
the researcher to rationally and logically support the hypothesis or
answer research questions defined in the beginning of the research.
The researcher follows certain steps in this model and delivers
scientific findings in the end. As a result, he/she produces logical and
beneficial results for the society in general.
Example 2.1
- Aggregate Supply – Aggregate Demand (AS/AD) Model,
- Loanable Funds Model
- HMC Macro Sim- simulation model,
- IS/LM Model
38 Applied Econometrics with Eviews Applications
Example 2.2
Supply-and-Demand model
General Equilibrium model
Mathematical Models
Example 2.3
To examine the demand elasticity for luxury cars in low
income countries
Empirical Models
Example 2.4
To investigate the changes in income when investment changes
one percent
2.2.4 Simulation Models
Example 2.5
The equations of the model are programmed in a software
programming language
Static Model
Example 2.6
To investigate the impact of class size on the average test
score of the students
Dynamic Models
Example 2.7
To examine the role of Interest rate on Inflation rate
movements over time
ECONOMETRIC MODELS
Econometrics is a science and art of using economic theory and
statistical techniques to analyze economic data.10 An econometric
model should be constructed based on economic theory, experience
or critical thinking. Econometrics finds and explains the relationship
between economic variables based on outcomes of statistical
techniques using the available data. It clarifies the interaction and
mediation between data, economic theory and statistical techniques.
10
Stock, J.H & Watson, M.W. (2008).
Models 41
Example 2.8
The impact of previous days’ stock returns on the current stock
return
Example 2.9
To examine the degree of relationship between export and
inflation over time
Example 2.10
To investigate the impact of student-teacher ratio on student
exam results
Example 2.11
To examine the relations between inflation and
unemployment in different countries
42 Applied Econometrics with Eviews Applications
Y= C + I + X – M
Q = f (L, K)
In different notation;
Stochastic Models
Deterministic Models
Analytical Framework
LITERATURE REVIEW
DATA COLLECTION
a- Daily data
- Stock market indices* and Exchange rates*
b- Monthly data
- Stock market indices*, Exchange rates*, Interest rates*,
Export and Import volume**, GDP per capita**, FDI**, M3**
* Turkey, EU member countries (27), non-EU member
European countries (5),the US and Japan
** Turkey
APPLICATIONS
Financial linkages between Turkey and EU &US & Japan
(external linkages)
Interrelationship among ISE, welfare and monetary policy
(internal linkages)
DIAGNOSTIC TESTING
ECONOMETRIC ANALYSIS
REPORT
PRESENTATION AND
PUBLICATION
Methodological Framework
Determine Research
objectives and questions
Research Design
Data Collection
Analyzing data
Interpretation
Stationary
Test,
ADF and PP
CHAPTER 3
DATA
Data are the piece of information or knowlege which are used for
reasoning or calculation as measurement.
QUALITATIVE DATA
Qualitative data is extremely varied in nature. It includes virtually any
information that can be captured but is not numerical in nature. Here
are some of the major categories or sources of qualitative data:
In-Depth Interviews
Example 3.1
Which business leaders or role models do you respect?
How do you handle conflict on a team project?
How do you see the efficiency of current economic policies?
What sectors should government support? Why?
Direct Observation
Example 3.2
To analyze the student’s reaction to the male and female
teachers
To analyze the impact of family income on their children’s
behavior in the public schools
Written Documents
Example 3.3
To analyze the ethical sensitiveness in the novels published
between two world wars
Qualitative Methods
Participant Observation
Direct Observation
Structured Interviewing
this interview, the questions are clear and fully understood by the
respondents and, ach respondent answer same set of question in
order to obtain an objective result minimizing interviwer bias.
Unstructured Interviewing
Case Studies11
11
http://www.socialresearchmethods.net
Data 53
Primary data are those that the researcher collects himself. The
researcher is expected to have accessed primary data while using
54 Applied Econometrics with Eviews Applications
Secondary Data
Experimental Data12
Observational data
Researchers who use observational data can obtain data from lab
scientists, make reliable conclusions and recognize the limitations of
data that are gathered in more natural settings.
12
http://en.wikipedia.org/wiki/Experimental_data
Data 55
Time Series13
13
http://www.abs.gov.au
56 Applied Econometrics with Eviews Applications
Example 3.4
Panel Data14
Panel data allows the researcher to control variables which are not
possible to be observed or measured such as cultural factors or
differences in business practices across companies, variables that
change over time but not across entities, and other. New variables
can be included at different levels of analysis (i.e. students, schools,
districts, states). It is suitable for multilevel or hierarchical modeling.
14
Baltagi, B.H. (2008).
Data 57
Example 3.5
Example 3.6
If the data exhibits different regimes whether they should have been
modeled with different specification or not, it should be tested
through the Chow breakpoint test as follows:
∑
∑
15
Vogelvang, B. (2005).
Data 59
Example 3.7
Table 3.4 Eviews Output for Chow Breakpoint Test with One
Breakpoint
Chow Breakpoint Test: 1970Q1
Null Hypothesis: No breaks at specified breakpoints
Varying regressors: All equation variables
Equation Sample: 1954Q1 1994Q4
Table 3.4 Eviews Output for Chow Breakpoint Test with Multiple
Breakpoint
Chow Breakpoint Test: 1970Q1 1982Q2 1988Q3
Null Hypothesis: No breaks at specified breakpoints
Varying regressors: All equation variables
Equation Sample: 1954Q1 1994Q4
DATA TRANSFORMATION
Log Transformation
How to Choose?
Differencing
Percentage Change
It is calculated by subtracting old one from new one and dividing the
difference by old one. It is used for the give emphasis return.
Base Year
A base year is used for comparison of the two or more time series
data levels. The arbitrary level of 100 is selected so that percentage
changes (either rising or falling) over year can be easily depicted. Any
year can be chosen as base year, but generally recent years are
chosen and the other observations are adjusted based on the base
year.
62 Applied Econometrics with Eviews Applications
Regression 63
CHAPTER 4
REGRESSION
DEFINITION
Regression is an econometric technique for estimating the
relationships among variables. It helps to analyze how the typical
value of the dependent variable changes when any one of the
independent variables changes, while the other are held constant
(ceteris paribus). Linear regression estimates how much y changes
when x changes one unit. The main purpose of linear regression
analysis is to assess associations between dependent and
independent variables.
(4.1)
Where;
Example 4.1
(4.2)
implies degree and direction of the relationship between Income
and Consumption.
y
𝛽
𝛽 𝑥𝑖
𝑥 𝑦
e1
e2
𝑥 𝑦 x
(4.3)
̂ ̂ ̂ ̂ (4.4)
̂ (4.5)
(̂ ̂ ̂ ) ∑ (4.6)
∑ ̅ ̅
̂ (4.7)
∑ ̅
̂ ̅ ̅ (4.8)
Example 4.2
8.2
8.0
7.8
INCOME
7.6
7.4
7.2
7.0
6.8
6.8 7.0 7.2 7.4 7.6 7.8 8.0 8.2 8.4
CONSUMPTION
Figure 4.2 Eviews Output for Scatter Plot between Income and
Consumption
Example 4.3:
18,000
16,000
14,000
12,000
LOTSIZE
10,000
8,000
6,000
4,000
2,000
0
0 40,000 80,000 120,000 160,000 200,000
PRICE
Figure 4.3 Eviews Output for Regression Line
68 Applied Econometrics with Eviews Applications
(4.9)
Error term, ei, includes all the independent variables which are not
defined in the regression because of the data unavailability or
measurement error. We assume that effect of error term on
dependent variable is marginal.
Example 4.4
(4.10)
= 7634.897
| | (4.11)
, where and
| (4.12)
( | ) (4.13)
| (4.14)
(4.15)
The estimators are linear, that is, they are linear functions of the
dependent variable, . Linear estimators are easy to understand and
deal with compared to nonlinear estimators. Linearity assumption
can be expressed as follows:
(4.16)
Regression 73
Reset-type test (Ramsey, 1969) is the most common test for testing
the linearity assumption. This testing procedure involves the
estimation of the following (auxiliary) regression:
̂ ̂ (4.17)
,
.
̂ (4.18)
∑ ̂ ∑ ̂
∑ ̂
(4.19)
(4.20)
when linearity does not hold, the OLS estimators are biased and
inconsistent. In other words, estimation and testing results are
invalid and the model should be restructured.
Example 4.5
200,000
160,000
120,000
PRICE
80,000
40,000
0
0 40,000 80,000 120,000 160,000 200,000
PRICEF
Unbiasedness
(̂) (4.21)
Efficiency
Normality
⁄ ́ (4.22)
16
a measure for the degree of symmetry in the variable distribution
17
a measure for the degree of peakedness/flatness in the variable distribution.
76 Applied Econometrics with Eviews Applications
[ ̂ ̂ ] (4.23)
̂ [ ∑ ̂ ∑ ̂ ] (4.24)
̂ [ ∑ ̂ ∑ ̂ ] (4.25)
Example 4.6
80
Series: Residuals
70 Sample 1 546
Observations 546
60
Mean -9.39e-12
50 Median -805.5036
Maximum 85311.59
40 Minimum -52757.51
Std. Dev. 18198.08
30 Skewness 0.893173
Kurtosis 5.649682
20
Jarque-Bera 232.3196
10 Probability 0.000000
0
-40000 -20000 0 20000 40000 60000 80000
Scatter plot shows whether the series are distributed normally or not.
It skewed the left side and shows leptokurtic behavior.
Regression 77
14
Series: Residuals
12 Sample 1954Q1 1994Q4
Observations 164
10
Mean -1.77e-15
Median 0.000850
8 Maximum 0.032516
Minimum -0.046830
6 Std. Dev. 0.016056
Skewness -0.224505
4 Kurtosis 2.604619
Jarque-Bera 2.445893
2
Probability 0.294362
0
-0.0375 -0.0250 -0.0125 0.0000 0.0125 0.0250
GAUSS-MARKOV THEOREM
Under the assumed above mentioned conditions, OLS estimators are
BLUE (best linear unbiased estimators). This is the essence of the
well-known Gauss–Markov theorem which provides a theoretical
justification for the method of least squares. With the added
assumption of normality, the OLS estimators are best unbiased
estimators (BUE) in the entire class of unbiased estimators, whether
linear or not. With normality assumption, CLRM is known as the
normal classical linear regression model (NCLRM).
∑
̂ (4.26)
78 Applied Econometrics with Eviews Applications
that is, the residual sum of squares (RSS) divided by (N-K), which is
called the degrees of freedom (df), N being the sample size and K
being the number of regression parameters estimated, an intercept
and (K – 1) slope coefficients. ̂ is called the standard error of the
regression (SER) or root mean square.
Example 4.7
∑
√̂ √
TSS=ESS+RSS
∑ ̂ ̂
∑ ̂
(4.30)
variance estimates for the degrees of freedom. This gives the value of
adjusted R2, or ̅ as follows:
∑ ̂ ̂ ∑
̅
̂
∑
∑ ̂
[ ] (4.31)
Example 4.8
̅ [ ]
(4.32)
Example 4.9
CHAPTER 5
LINEARITY
Linearity is the situation in which the relationship between the
dependent variable and the independent variables is linear. It is the
main assumption of the OLS estimation method. If linearity exists, the
effect of an exogenous variable on the endogenous variable is the
same for all values of the other exogenous variables in the model
(that is, the slope of the population regression function is constant,
so that the effect on y of a unit change in x does not depend on the
value of one or more exogenous variable). In this sense, non-linearity
has important consequences for the interpretation of the estimation
results.
Testing Linearity
Example 5.1
18,000
16,000
14,000
12,000
LOTSIZE
10,000
8,000
6,000
4,000
2,000
0
-60,000 -20,000 0 20,000 60,000 100,000
RESID01
Figure 5.1 Eviews Output for Lot Size Versus Predicted Residual
18,000
16,000
14,000
12,000
LOTSIZE
10,000
8,000
6,000
4,000
2,000
0
-60,000 -20,000 0 20,000 60,000 100,000
RESID01
Figure 5.2 Eviews Output for Kernel Density, Lot Size Versus
Predicted Residual
86 Applied Econometrics with Eviews Applications
Example 5.2
.48
.44
.40
CONSF
.36
.32
.28
.24
-.08 -.06 -.04 -.02 .00 .02 .04 .06 .08
RESID01
Figure 5.3 Eviews Output for Ice-Cream Consumption Predicted
Versus Residual
̂ ̂ ̂ ̂ ̂ ̂ ̂ (5.1)
=0 (5.2)
Classıcal Lınear Regressıon Model (CLRM) 87
̂ ̂
(5.3)
̂
Example 5.3
Value df Probability
F-statistic 0.339738 (3, 538) 0.7966
Likelihood ratio 1.033391 3 0.7932
F-test summary:
Sum of Sq. df Mean Squares
Test SSR 3.41E+08 3 1.14E+08
Restricted SSR 1.80E+11 541 3.34E+08
Unrestricted SSR 1.80E+11 538 3.35E+08
Unrestricted SSR 1.80E+11 538 3.35E+08
LR test summary:
Value df
Restricted LogL -6129.993 541
Unrestricted LogL -6129.476 538
88 Applied Econometrics with Eviews Applications
The reset F statistic is equal to 0.339 and the corresponding p-value is 0.7966. There
is no evidence to reject the null hypothesis of linearity at the 5% significance level.
HETEROSCEDASTICITY
for all observations. That is, the variance of the error
term is constant (Homoscedasticity) over the sample period. If the
error terms do not have constant variance, they are said to be
heteroscedastic.
Consequences
OLS estimates are no longer BLUE. That is, among all of the unbiased
estimators, OLS does not provide the estimate with the smallest
variance. Depending on the nature of the heteroscedasticity,
significance tests can be too high or too low. The standard errors are
biased when heteroscedasticity is present. This in turn leads to bias in
test statistics and confidence interval.
Detection
Example 5.4
60,000
50,000
40,000
30,000
20,000
10,000
-10,000
71 72 73 74 75 76 77 78 79 80 81 82 83 84 85
CONSUMF RESID01
Figure 5.4 Eviews Output for Residuals Plotted Against Fitted Values
Classıcal Lınear Regressıon Model (CLRM) 91
40
30
20
10
-10
-20
250 500 750 1000 1250
WAGEF RESID01
Figure 5.5 Eviews Output for Residuals Plotted Against Fitted Values
(5.4)
(5.5)
92 Applied Econometrics with Eviews Applications
Example 5.5
Example 5.6
Test Equation:
Dependent Variable: RESID^2
Method: Least Squares
Sample: 1 1472
Included observations: 1472
(5.6)
(5.7)
Classıcal Lınear Regressıon Model (CLRM) 95
Example 5.7
Table 5.4 Eviews Output for the Ramsey´s Test for Heteroscedasticity
Ramsey RESET Test
Equation: UNTITLED
Specification: CONS INCOME PRICE TEMP TIME C
Omitted Variables: Powers of fitted values from 2 to 3
Value df Probability
F-statistic 5.331333 (2, 23) 0.0125
Likelihood ratio 11.42686 2 0.0033
F-test summary:
Mean
Sum of Sq. df Squares
Test SSR 0.010957 2 0.005478
Restricted SSR 0.034590 25 0.001384
Unrestricted SSR 0.023634 23 0.001028
Unrestricted SSR 0.023634 23 0.001028
LR test summary:
Value df
Restricted LogL 58.91245 25
Unrestricted LogL 64.62588 23
(5.8)
Example 5.8
Test Equation:
Dependent Variable: RESID^2
Method: Least Squares
Sample (adjusted): 2 30
Included observations: 29 after adjustments
GQ = R = RSS2/RSS1 (5.9)
̂ ̂ ̂ ̂ (5.11)
(5.12)
, (5.13)
Example 5.9
Test Equation:
Dependent Variable: ARESID
Method: Least Squares
Sample: 1 30
Included observations: 30
Example 5.10
Test Equation:
Dependent Variable: LRESID2
Method: Least Squares
Sample: 1 30
Included observations: 30
AUTOCORRELATION
Autocorrelation can only occur in the models that include time series
data and it means that either the model is specified with an
insufficient number of lagged variables or not all the relevant
explanatory variables are specified in the model.
The error term catch the influence of the not included variables
affecting dependent variable. Persistence effect of the excluded
variables causes positive autocorrelation. If those excluded variables
are observable and includable in the model, autocorrelation test
result is an indication of a misspecification model.
̀ (5.14)
(5.15)
There are various autocorrelation tests. All tests have same null
hypothesis of absence of autocorrelation in the disturbance term.
The tests have different alternative hypothesis because of differences
of the order of the autocorrelation.
For example;
(5.16)
With
Detection
i- Graphical method
18
Verbeek, M. (2004).
Classıcal Lınear Regressıon Model (CLRM) 105
RESID01
2,000
1,500
1,000
500
-500
-1,000
-1,500
50 100 150 200 250 300 350 400 450 500 550
Figure 5.6 Eviews Output for Residual Plot
RESIDUAL
100
80
60
40
20
-20
-40
-60
-80
1960 1965 1970 1975 1980 1985 1990 1995
(5.17)
(5.18)
Example 5.11
Test Equation:
Dependent Variable: RESID
Method: Least Squares
Included observations: 30
Presample missing value lagged residuals set to zero.
∑ ̂ (5.20)
Example 5.1219
Table 5.9 Eviews Output for the Box-Pierce and the Ljung-Box Tests (Q test)
Included observations: 30
19
Vogelvang,B. (2005)
Classıcal Lınear Regressıon Model (CLRM) 109
This is the most known autocorrelation test. This test is used for first
order autocorrelation.
This test can be used if the explanatory variables are exogenous and
a constant term has been included in the model. DW –statistics is not
used, if lagged dependent variables are present as explanatory
variables in the model. It can be employed if the explanatory
variables are exogenous and the model includes intercept. DW-
statistics should be used if all the conditions are satisfied. Otherwise
it is more informative to use Breusch-Godfrey test in the research
paper.
∑
∑
(5.21)
Table 5.10 Lower and Upper Bounds for 5% Critical Values of the DW Test20
Number of regressors including intercept
Number of K=3 K=5 K=7 K=9
Observations dL dU dL dU dL dU dL dU
25 1.206 1.550 1.038 1.767 0.868 2.012 0.702 2.280
50 1.462 1.628 1.378 1.721 1.291 1.822 1.201 1.930
75 1.571 1.680 1.515 1.739 1.458 1.801 1.399 1.867
100 1.634 1.715 1.592 1.758 1.550 1.803 1.506 1.850
200 1.748 1.789 1.728 1.810 1.707 1.831 1.686 1.852
20
Savin, N.E. & White, K.J. (1977)
Classıcal Lınear Regressıon Model (CLRM) 111
MULTICOLLINEARITY
Multicollinearity is a data problem. Collinearity between variables is
always present. It becomes a problem and violation of the classical
assumptions if the correlations among the independent variables are
very strong. It can affect accuracy of the parameter estimates.
Multicollinearity misleadingly increases the standard errors. Thus, it
makes some variables statistically insignificant while they should be
otherwise significant. It is like two or more people singing loudly at
the same time. One cannot discern who is singing how. The influence
of the independents offset each other. In multicollinearity, the
parameter estimates become inaccurate.
Consequences
The precision of estimation falls because of very big errors that may
be highly correlated errors and very large sampling variances of the
coefficients. Large standard errors can be caused by things besides
multicollinearity. If two independent variables are highly and
112 Applied Econometrics with Eviews Applications
Causes of Multicollinearity
Detection
If VIF = 0, No multicollinearity
If VIF ≥ 0, There is multicollinearity
Example 5.14
Coefficientsa
Model Collinearity
Statistics
Tolerance VIF
1 Nbedroom .854 1.171
Nbath .841 1.189
lot size .955 1.047
a. Dependent Variable: Nstories
Coefficientsa
Model Collinearity
Statistics
Tolerance VIF
1 Nbedroom .820 1.220
lot size .976 1.024
Nstories .833 1.200
a. Dependent Variable: Nbath
116 Applied Econometrics with Eviews Applications
Coefficientsa
Model Collinearity
Statistics
Tolerance VIF
1 lot size .962 1.040
Nstories .895 1.118
Nbath .867 1.153
a. Dependent Variable: Nbedroom
EXOGENEITY
There are two types of variables in macro-econometric models:
endogenous and exogenous. Endogenous variables are explained by
the equations, either the stochastic or the identities. Exogenous
variables are determined outside of the model not explained within
the model. They are taken as given.
| (5.22)
(5.23)
, (5.24)
Classıcal Lınear Regressıon Model (CLRM) 117
Together with the price, it is obvious that the quality has also
significant impact on sales quantity. Also, there is very strong positive
relationship between price of sish kebab and the quality. Therefore
are highly correlated.
NORMALITY
Normality test was introduced by Fisher (1948), but, a standard
Jarque-Bera (JB, 1980) test is being used widely. It is expressed in
terms of the third and fourth moments of the disturbances, as
follows:
( ) (5.25)
S: measure of skewness
(5.26)
K: measure of Kurtosis
(5.27)
Example 5.15
600
Series: Residuals
Sample 1 1472
500 Observations 1472
8
Series: Residuals
7 Sample 1 30
Observations 30
6
Mean 2.04e-17
5 Median 0.003040
Maximum 0.076192
4 Minimum -0.060326
Std. Dev. 0.034537
3 Skewness 0.065254
Kurtosis 2.988689
2
Jarque-Bera 0.021450
1 Probability 0.989332
0
-0.06 -0.04 -0.02 0.00 0.02 0.04 0.06 0.08
Example 5.16
WAGE MALE
30 2.4
2.0
20
Quantiles of Normal
Quantiles of Normal
1.6
10
1.2
0
0.8
-10 0.4
0 10 20 30 40 50 0 1 2
EXPER EDUC
60 8
7
40
Quantiles of Normal
Quantiles of Normal
5
20
4
3
0
2
-20 1
0 10 20 30 40 50 0 1 2 3 4 5 6
100
80
Empirical Quantiles
60
40
20
0
.24 .28 .32 .36 .40 .44 .48 .52 .56
Quantiles of CONS
INCOME PRICE
TEMP TIME
Figure 5.11 Eviews Output for QQ and PP Plots
Example 5.17
LNWAGE LNEXPER
1.6 1.2
Distance to points above median
0.8 0.6
0.4
0.4
0.2
0.0 0.0
0.0 0.4 0.8 1.2 1.6 0.0 0.5 1.0 1.5 2.0 2.5 3.0
LNEDUC MALE
.6 1
Distance to points above median
.5
.4
.3 0
.2
.1
.0 -1
0.0 0.2 0.4 0.6 0.8 1.0 1.2 0 1 2
Figure 5.12 Eviews Output for QQ and PP Plots with Logged Data
240
Series: Residuals
Sample 1 1472
200 Observations 1472
40 Jarque-Bera 443.8339
Probability 0.000000
0
-1.5 -1.0 -0.5 0.0 0.5 1.0
CONSISTENCY
Consistency is a large sample property. If there are more
observations, the probability that the estimator is some positive
number far from the true value becomes smaller. A minimum
requirement for an estimator to be useful appears to be that it is
consistent. OLS estimator is consistent if it holds that:
∑ ̀ (5.28)
{ } (5.29)
It says that the error term is mean zero and uncorrelated with any of
the explanatory, independent variables. The consistency property
implies that the estimator ̂ itself does not have an asymptotic
distribution. Sufficient condition for a consistent estimator is that the
bias and the variance should both tend to be zero as the sample size
increases. The specification of the linear regression model is as
follows:
(5.29)
Deterministic assumptions:
CHAPTER 6
HYPOTHESIS TESTING
DEFINITIONS
Statistical significance: Is a result that is not likely to occur
randomly, but rather is likely to be attribuTable to a specific
cause.
Confidence interval
tcritical 0 tcritical
Figure 6.1 Confidence Interval two Sided
21
the z-test does not involve degrees of freedom since it assumes that the population
variance is known
22
significance level + confidence level = 1
Hypothesıs Testıng 125
One-sided hypothesis test puts all of the significance level "at one
end of a distribution.
One-sided test: ,
critical region
acceptance region p = α, reject null
1- α, accept null
0 tcritical
Figure 6.2 Confidence Interval one Sided
critical region
p = α/2, reject null
acceptance region
1- α/2, accept null
-tcritical tcritical
Figure 6.3 Acceptance Region Two Sided
The p-value is the smallest value that would lead to rejection of the
null hypothesis. It is also called the significance probability.
Generally, the smaller the p-value, the more likely is it that one can
reject the null hypothesis. A p−value of 0.003 means that there is
only a 3 in a thousand chance of reaching the wrong conclusion. The
126 Applied Econometrics with Eviews Applications
smaller the p- value, the stronger the evidence against the null
hypothesis.
| | (6.2)
(6.3)
√
(6.4)
Hypothesıs Testıng 127
Do not reject the null hypothesis, in which case one concludes that
there is insufficient evidence to indicate that the alternative
hypothesis is true.
(6.5)
Example 6.1
√
√
Step four: Draw a graph included the test statistics value, the
critical value and the critical region(s) or compare the
P-value with the significance level α. And then make
a conclusion of the hypothesis.
Hypothesıs Testıng 129
2.66 15.068
Figure 6.4 Critical Region
Table 6.1 Critical Values for One Sided and Two Sided t-Test
(for infinite sample)
Significance Levels (%) Critical values
One-sided two-sided
1 2.58 2.58
5 1.96 1.96
10 1.64 1.64
Example 6.2
(6.6)
(6.7)
(6.8)
Large values for the test statistics imply rejection of null hypothesis.
Critical values for the F-test are attached.
(6.9)
Example 6.3
In the second step we add price of tea as a regressor and report the
results in Table 6.3
(6.10)
If the test statistics based on F does not reject the null hypothesis, we
can conclude that the model performs rather poorly, a model with
only one intercept term would not be statistically worse. If the test
statistics reject the null hypothesis, we cannot conclude that the
model is good.
Example 6.424:
24
Verbeek, M. (2004)
134 Applied Econometrics with Eviews Applications
Specıfıcatıons 135
CHAPTER 7
SPECIFICATIONS
(7.1)
, (7.2)
(7.3)
(7.4)
Gauss- Markov theorem is not valid, and OLS is not BLUE, estimate of
regression coefficient is biased. OLS overestimate the coefficient of
independent variable .
̂ (7.5)
Specıfıcatıons 137
Detection
(7.6)
138 Applied Econometrics with Eviews Applications
R : q x k matrix,
r : a q vector
Wald statistics:
, (7.7)
̅ ̅
(7.8)
Example 7.1
c(2)+c(3) = 1
There is only one restriction. Two test statistics are identical with the
p-values of both statistics indicates that the null hypothesis of
constant returns to scale can be rejected.
Example 7.2
Solution
(7.9)
(7.10)
(7.11)
iii- Has ̅
If all the conditions above are satisfied, then variable belongs to the
regression equation. Following techniques are strongly
recommended to be employed for specification bias problem:
Example 7.3
Functional Forms:
(7.12)
When
, (7.13)
, (7.14)
(7.15)
146 Applied Econometrics with Eviews Applications
(7.16)
Log-ln Model
(7.17)
Quadratic Forms
, i=1,…..,n (7.18)
Inverse Form
(7.19)
is large.
(7.20)
(7.21)
(7.22)
Lags
(7.23)
(7.24)
Theory/Testing Methodology
Testing-Down Approach
Testing-up Approach
When using the testing-up approach, begin with a specific model and
test-up to a more general model. To test for nonlinear terms and
interaction terms, a specific model that does not include one or more
of these terms may be appropriate. For example, the specific model
might not include nonlinear terms such as X2 and/or lnX, or
interaction terms such as X*A. We then use a Lagrange multiplier
test to test whether these terms should be added to the model.
150 Applied Econometrics with Eviews Applications
A number of other criteria and statistical tests are also used to test
for functional form. Some of these are the following:
i- Adjusted R2
SPECIFICATION TESTS
Residual Tests
Example 7.4
Example 7.5
16
Series: Residuals
14 Sample 1954Q1 1994Q4
Observations 164
12
Mean 4.12e-14
10 Median 2.84e-13
Maximum 1.73e-11
8 Minimum -1.23e-11
Std. Dev. 6.60e-12
6
Skewness 0.355963
Kurtosis 2.721785
4
Jarque-Bera 3.992331
2
Probability 0.135855
0
-1.0e-11 -2.8e-27 1.0e-11
Example 7.6
Test Equation:
Dependent Variable: RESID
Method: Least Squares
Sample: 1954Q1 1994Q4
Included observations: 164
Presample missing value lagged residuals set to zero.
ARCH-LM Test
This is LM test for ARCH (Engle, 1982) in the residuals. Ignoring ARCH
effect can cause inefficiency in estimation. The null hypothesis is that
154 Applied Econometrics with Eviews Applications
, (7.25)
e is the residual.
Example 7.7
Test Equation:
Dependent Variable: RESID^2
Method: Least Squares
Sample (adjusted): 1955Q1 1994Q4
Included observations: 160 after adjustments
, (7.26)
(7.27)
Example 7.8
Test Equation:
Dependent Variable: RESID^2
Method: Least Squares
Sample: 1954Q1 1994Q4
Included observations: 164
Collinear test regressors dropped from specification
156 Applied Econometrics with Eviews Applications
CHAPTER 8
PARAMETER STABILITY
̅ ̅
(8.1)
Example 8.1
̅ ̅
(8.2)
Example 8.2
CUSUM TEST
The CUSUM test statistic (Brown, Durbin, and Evans, 1975) is based
on cumulative sums of scaled recursive residuals and is plotting the
cumulative sum together with the 5% critical lines against time
(Vogelvang, 133). If the cumulative sum goes outside of the 5%
critical lines, then the test shows parameter instability. The CUSUM
test is based on the following recursive residual test statistics defined
as follows:
∑ (8.3)
Parameter Stabılıty 161
w : recursive residual
s : the standard error of the regression fitted to all T observations.
The expectations of the CUSUM statistics are zero under the null
hypothesis of constant parameters.
Example 8.3
40
30
20
10
-10
-20
-30
-40
1955 1960 1965 1970 1975 1980 1985 1990
CUSUM 5% Significance
∑ ∑ (8.4)
162 Applied Econometrics with Eviews Applications
[ ] (8.5)
1.0
0.8
0.6
0.4
0.2
0.0
-0.2
1955 1960 1965 1970 1975 1980 1985 1990
means that the recursive residual go outside the two standard error
bounds at this point of observation.
Example 8.5
.000 -4
.025
-8
.050
.075
.100
.125
.150
1955 1960 1965 1970 1975 1980 1985 1990
One-Step Probability
Recursive Residuals
Figure 8.3 Ewievs Result for One Step Ahead Forecast Test
Example 8.6
.000 -4
.025
-8
.050
.075
.100
.125
.150
1955 1960 1965 1970 1975 1980 1985 1990
N-Step Probability
Recursive Residuals
Example 8.7
8 160
6
120
4
80
2
40
0
0
-2
-4 -40
60 65 70 75 80 85 90 60 65 70 75 80 85 90
4 400
2
0
0
-2 -400
-4
-800
-6
-8 -1,200
60 65 70 75 80 85 90 60 65 70 75 80 85 90
(8.6)
Example 8.8
Test Equation:
Dependent Variable: INFL
Method: Least Squares
Sample: 1954Q1 1994Q4
Included observations: 164
CHAPTER 9
{ | }
(9.2)
168 Applied Econometrics with Eviews Applications
Example 9.1
Example 9.2
Table 9.2 shows the OLS estimation result for regression and includes
three independent variables.
In multivariate regression model,
estimates are interpreted under ceteris paribus condition. It means
that one unit change in temperature causes 0.003458 unit increase in
ice-cream consumption when the other variables price and income
hold constant.
(9.4)
{ | }
(9.5)
Example 9.3
(9.6)
The effect of changing age on income between male and female can
be interpreted as follows:
{ | }
(9.7)
Example 9.4
Table 9.4 shows the estimation result for the regression equation
specified in Eq.9.7. Now it is possible to interpret marginal effect of a
change in experience on wage when the other independent variables,
schooling and sex are held constant.
172 Applied Econometrics with Eviews Applications
(9.8)
(9.9)
Example 9.5
(9.10)
(9.11)
(9.12)
If we estimate the first model while, in fact, the second model is the
correct one, then it will increase the variance of the estimators for
the relevant regressors. Thus we will get higher variance and less
reliable estimates.
REGRESSOR SELECTION
To overcome the problem of including irrelevant regressors and
omitting relevant regressors in the regression model following
methods can be used together or separately:
If two regressors are omitted from the model, joint test should be
used rather than two tests separately.
Best option is to start with the optimum model and test, whether the
restrictions imposed by the model are correct, and whether the
restrictions not imposed by the model should be imposed.
CHAPTER 10
ENDOGENEITY
(10.1)
(10.2)
(10.3)
SOURCES OF ENDOGENEITY
Autocorrelation with Lagged Dependent Variable
(10.4)
(10.5)
(10.6)
(10.7)
(10.8)
, (10.9)
(10.10)
Measurement Error
, (10.11)
178 Applied Econometrics with Eviews Applications
, (10.12)
(10.13)
(10.14)
(10.15)
Simultaneity
(10.17)
Where is real per capita investment? This equation says that the
total consumption plus total investment should equal total income in
a closed economy without government intervention.
Endogeneıty 179
(10.18)
(10.19)
ENDOGENEITY TEST
Is there evidence that correlation between the potentially
endogenous variables and the error term is strong enough to result in
substantively biased estimates?
i- Run the first stage regressions using OLS and save the
residuals. This is called reduced form equation
(10.20)
SOLUTION
Ad hoc Approaches
Advantages:
Rigor and transparency;
(10.21)
(10.22)
∑ ̅ ̅
̂ (10.23)
∑ ̅ ̅
Example 10.1
The intrinsic ability may also influence the wages you obtain. As a
result a positive estimate on return on college degree may be
attributed.
(10.24)
(20.25)
(10.26)
(10.27)
25
Verbeek, M. (2004)
184 Applied Econometrics with Eviews Applications
In the first step, estimate a regression explaining x2i from x1i and z2i,
and save the residuals, save . This is the reduced form equation.
Next, add the residuals to the model of interest and estimate the
following regression by OLS.
(10.28)
- In the first step the reduced form is estimated by OLS (that is:
a regression of the endogenous regressors upon all
instruments).
26
Verbeek, M. (2004)
186 Applied Econometrics with Eviews Applications
Weak Instrument
Simple OLS can produce better results than TSLS estimator if there is
weak instrument issue. Whether the instruments are strongly
correlated enough with the potentially endogenous variables should
be checked in the first step.
To test for instrument relevance, make sure to run the first stage
regressions of the potentially endogenous variables on all of the
exogenous variables. The properties of the IV estimator can be very
poor, and the estimator can be severely biased, if the instruments
exhibit only weak correlation with the endogenous regressor(s). Even
if the sample size is large the normal distribution may provide a very
poor approximation to the true distribution of the IV estimator.
27
Cochrane (2001).
188 Applied Econometrics with Eviews Applications
Regressıons wıth Dummy Varıables 189
CHAPTER 11
(11.1)
(11.2)
(11.3)
𝐷 𝑦𝑖 𝛽 𝛽 𝛽 𝑥𝑖
y
𝐷 𝑦𝑖 𝛽 𝛽 𝑥𝑖
x
Figure 11.1 Dummy Independent Variable Regression Line
INTERPRETATION
We can say that β is a measure of how much Y tends to change when
X is changed by one unit. But, with the present dummy explanatory
Regressıons wıth Dummy Varıables 191
(11.4)
28
Verbeek, M. (2004)
192 Applied Econometrics with Eviews Applications
with the same experience and number of schooling, black people will
always get $0,326 lower wage than the white people with same
personal characteristics.
Example 11.2
(11.5)
Example 11.3
(11.6)
Dependent variable:
Type D1 D2
Blue collar (bc) 0 0
Professional (prof) 1 0
White collar (wc) 0 1
(11.7)
For Professional:
(11.8)
(11.9)
(11.10)
(11.11)
(11.12)
Example 11.4
TYPES OF VARIABLES29
Continuous or Quantitative Variables
29
http://www.unesco.org
196 Applied Econometrics with Eviews Applications
- Nominal variables
- Ordinal variables
- Dummy variables
- Preference variables
- Multiple response variables
Nominal Variables
Gender:
1. Male 2.Female
Marital Status:
1. Single 2. Married 3. Divorcee 4. Widower
Ordinal Variables
For instance;
1. The economic status of families in the society might be 'upper
lower' is lower than 'middle', but 'how much higher' is not
known.
2. A question in a questionnaire related with the time
involvement of employees in the social activities to measure
commitment and satisfaction. The respondents show their
involvement by selecting one of the following answers:
1 = Very low
2 = Low
3 = Medium
4 = High
5 = Very high
Categorical Variables
Preference Variables
Multiple response variables can take more than one value. For
instance, a survey question regarding the purpose of using computers
in research. The respondents could score more than one category.
Tıme Serıes 199
CHAPTER 12
TIME SERIES
STATIC MODELS
Suppose that we have time series data for two variables dated
contemporaneously. A static model can be written as follows:
(12.1)
Example 12.1
(12.2)
DYNAMIC MODELS
One way to model the dynamic relationships is to include lagged
values of regressors on the right hand side of the regression
equation; this is the basis of the distributed- lag model. The
distributed-lag model takes the form:
(12.3)
TIME SERIES
Time Series Regression with Lag
The researcher which is using time series data faces two problems
which do not exist in the cross sectional data:
Example 12.230
Table 12.1 Eviews Output for Time Series Regression with Lags
Dependent Variable: Y
Method: Least Squares
Sample (adjusted): 2005M07 2010M02
Included observations: 56 after adjustments
The result for time series analysis with lag is reported in Table 12.1.
Inflation in Turkey is effected by its own past values up to three lags.
The inflation four months ago does not have any impact on this
month´s inflation level.
30
Koop, G. (2005).
202 Applied Econometrics with Eviews Applications
Lag Selection
If the lag length is not clear, there are many different approaches to
lag length selection in the econometrics literature.
Step1. Choose the maximum possible lag length, qmax, that seems
reasonable to you.
(12.4)
(12.5)
Example 12.3
We also should drop the explanatory variable xt-5 from the model and
re-estimate with lag length equal to 3, giving the result in Table 12.4.
(12.6)
Where gfrt is the general fertility rate and pet is the real dollar value
of the personnel tax exemption. The objective is to see whether, in
aggregate, the decision to have children is linked to tax value of
having a child. The decision would not be based on the immediate
impact from the changes in the personnel exemption.
Example 12.5
(12.7)
(12.8)
Stationarity
ii- If y has a unit root, then its autocorrelations with past values
is close to one and does not change as lag length increases.
Example 12.6
t-Statistic Prob.*
t-Statistic Prob.*
Unit root test result given in Table 12.6 indicate that Croatian stock
market index is stationary in first difference (I(1)).
Random Walk
(12.9)
(12.10)
(12.11)
ρ = 0 implies that the AR(p) time series y contains a unit root; if -2 < ρ
< 0, then the series is stationary. In Eq. 12.11, ρ = 0 clarifies the logic
of unit root series. If ρ = 0 then the term yt-1 will drop out of the
equation and only terms involving Δy or its lags will be present in the
regression. “If a unit root is present, then the data can be differenced
to obtain stationarity”.
(12.12)
Example 12.7
Seasonality
For instance, with quarterly data, you can create the dummy
variables:
(1) D1 = 1 if an observation is from the first quarter (= 0 otherwise);
Tıme Serıes 211
Example 12.831
(12.13)
(12.14)
Example 12.9
Table 12.9 Eviews Output for Testing in The AR(P) with Deterministic Trend Model
Dependent Variable: CROBEX Method: Least Squares
Sample (adjusted): 1998M06 2012M12 Included observations: 175 after adjustments
Variable Coefficient Std. Error t-Statistic Prob.
C -9496.055 7984.226 -1.189352 0.2360
DATUM 0.013075 0.010929 1.196377 0.2333
CROBEX(-1) 0.958370 0.016081 59.59740 0.0000
DCROBEX(-1) 0.159776 0.075370 2.119878 0.0355
DCROBEX(-2) 0.087136 0.076015 1.146303 0.2533
DCROBEX(-3) 0.118046 0.074944 1.575127 0.1172
DCROBEX(-4) -0.066389 0.074777 -0.887828 0.3759
DCROBEX(-5) 0.183187 0.074444 2.460729 0.0149
DCROBEX(-6) -0.141791 0.075692 -1.873271 0.0628
DCROBEX(-7) 0.146991 0.075739 1.940763 0.0540
DCROBEX(-8) 0.210142 0.076206 2.757566 0.0065
R-squared 0.976308 Mean dependent var 1828.422
Adjusted R-squared 0.974864 S.D. dependent var 1093.873
S.E. of regression 173.4277 Akaike info criterion 13.21019
Sum squared resid 4932654. Schwarz criterion 13.40912
Log likelihood -1144.892 Hannan-Quinn criter. 13.29088
F-statistic 675.8239 Durbin-Watson stat 2.008453
Prob(F-statistic) 0.000000
Tıme Serıes 213
Result reported in Table 12.9 shows that the model does not include
deterministic trend.
(12.15)
Example 12.10
Moving Average
(12.16)
The simple moving average model says that observations that deviate
two or more periods are not correlated.
∑ (12.17)
Example 12.11
Table 12.10 Eviews Output for Moving Average Method
Dependent Variable: STATEUR
Method: Least Squares
Sample (adjusted): 3 4877
Included observations: 4875 after adjustments
Autocorrelation Function
{ }
{ }
(12.18)
(12.19)
(12.20)
(12.21)
(12.22)
Example 12.12
Table 12.12 Eviews Output for Autocorrelation Function
Sample: 1997M09 2012M12
Included observations: 184
32
Verbeek, M. (2004)
Tıme Serıes 217
MA(q) : (12.23)
AR(p): (12.24)
ARMA(p,q) :
(12.25)
(12.26)
(12.27)
(12.28)
218 Applied Econometrics with Eviews Applications
Example 12.13
ARIMA (p,q)
Stationarity of a stochastic process requires that the variances and
autocovariances are finite and independent of time.33
33
Verbeek, M. (2004)
Tıme Serıes 219
(12.29)
(12.30)
( ) (12.31)
The long run multiplier does not measure the effect of this type of
change. The “marginal effect” interpretation of regression
coefficients can be used for such temporary changes. Previously, we
were interested in the effect of increasing safety training in one
particular month on accident losses. But we did not discuss the effect
of increasing safety training permanently. The long run multiplier for
the ARDL( p, q) model is:
(12.32)
34
For instance, the effect of computer purchases on sales
222 Applied Econometrics with Eviews Applications
Example 12.14
Diagnostic Tests
*******************************************************************************
* Test Statistics * LM Version * F Version *
*******************************************************************************
* * * *
* A:Serial Correlation*CHSQ( 12)= 27.9926[.006]*F( 12, 113)= 2.5344[.005]*
* * * *
* B:Functional Form *CHSQ( 1)= .039857[.842]*F( 1, 124)= .037452[.847]*
* * * *
* C:Normality *CHSQ( 2)= 12.4107[.002]* Not applicable *
* * * *
* D:Heteroscedasticity*CHSQ( 1)= 42.0608[.000]*F( 1, 130)= 60.7955[.000]*
*******************************************************************************
A:Lagrange multiplier test of residual serial correlation
B:Ramsey's RESET test using the square of the fitted values
C:Based on a test of skewness and kurtosis of residuals
D:Based on the regression of squared residuals on squared fitted values
Tıme Serıes 223
(12.33)
The long run multiplier for the ARDL (p,q) model is:
Example 1535
(12.34)
If have unit roots then all the usual regression results might
be misleading and incorrect. This is called Spurious regression
problem. You should never run a regression of if the variables
have unit roots with the exception of cointegration method.
COINTEGRATION
Bivariate Cointegration (Engle-Granger Cointegration)
(12.35)
Written in this way, it is clear that the errors are a linear combination
of y and x. However, x and y both exhibit non-stationary unit root
behavior such that you would expect the error to also exhibit non-
stationary behavior. The error usually have a unit root. Statistically, it
is this unit root in the error term that causes the spurious regression
problem. However, it is possible that the unit roots in y and x “cancel
each other out” and that the resulting error is stationary. In this
35
KOOP, G. (2005)
226 Applied Econometrics with Eviews Applications
(12.36)
36
If it were included it could mean the errors could be growing steadily over time. This
would violate the idea of cointegration.
228 Applied Econometrics with Eviews Applications
Example 12.14
t-Statistic Prob.*
t-Statistic Prob.*
shows that there exists long run relationship between France and
Greece stock market indices. Furthermore, the long run multiplier is
1.299. This indicates that, in the long run, an increase in the France
stock market index by one unit would cause an increase in the
Greece stock market index 1.299 units.
B. GREECE
Null Hypothesis: GREECE has a unit root
Exogenous: Constant, Linear Trend
Lag Length: 1 (Automatic based on SIC, MAXLAG=31)
t-Statistic Prob.*
t-Statistic Prob.*
Exogenous: Constant
Lag Length: 0 (Automatic based on SIC, MAXLAG=30)
t-Statistic Prob.*
(12.37)
where et-1 is the error obtained from the regression model with y and
x,
(12.38)
Note that, if we know et-1, then the ECM is just a regression model.
That is, Δyt is the dependent variable and et-1 and Δxt are explanatory
variables.
In sum, the ECM has both long run and short run properties, such as:
- The errors in the model are not directly observed. How they
can be used as an explanatory variable in a regression?
Note that y and x should have unit roots and be cointegrated before
proceeding to ECM. So far we have discussed the simplest error
correction model. The ECM may also have lags and deterministic
trend like ARDL (p, q) model. Incorporating these features into the
ECM yields:
(12.39)
(12.40)
is intercept;
v is error term
After estimating the model reported in the Table 12.15, short run
coefficient has been found 0.41 and the coefficient of error term
236 Applied Econometrics with Eviews Applications
has been found 0.15 percent meaning that system corrects its
previous period disequilibrium at a speed of 0.15 percent monthly.
The sign of is found negative and significant indicating the non-
existence of long run equilibrium relationship between Greece and
France stock market indices.
Example 12.15
Diagnostic Checking
Test Equation:
Dependent Variable: RESID
Method: Least Squares
Date: 06/05/13 Time: 18:58
Sample: 12/31/1990 12/29/2006
Included observations: 4175
Presample missing value lagged residuals set to zero.
2,000
Series: Residuals
Sample 12/31/1990 12/29/2006
1,600 Observations 4175
Mean -6.18e-17
1,200 Median -0.506729
Maximum 350.3563
Minimum -413.0776
800 Std. Dev. 42.23462
Skewness -0.111649
Kurtosis 19.47798
400
Jarque-Bera 47242.53
Probability 0.000000
0
-400 -300 -200 -100 0 100 200 300
Dependent and Independent variables have unit roots but are not
cointegrated
Even though the time series have unit root, the Engle–Granger test
may indicate no cointegration. In such cases, the series may not be
trending together and may not have an equilibrium relationship.
Hence, a regression of y on x should not be run due to the spurious
regression problem. The presence of such characteristics suggests
that basic model should be respecified and other explanatory
variables included. Instead of working with y and x themselves,
difference series can be used.
In this case, the following ARDL model can be employed but with
changes in the variables:
(12.41)
For most time series variables, this specification should not suffer
from multicollinearity problems. Or, second variant of the ARDL
model based on the differenced data can be estimated.
Tıme Serıes 239
But if you are working with the differences of your time series and
then use the variant of the ARDL that involves differencing the data
you end up with second differenced data:
(12.42)
Where:
(12.43)
Multivariate Cointegration
Example 12.16
Table 12.16 Eviews Output for Johansen Cointegration Test
Sample (adjusted): 1/06/1989 12/29/2006
Included observations: 4691 after adjustments
Trend assumption: Linear deterministic trend
Series: GERMANY SPAIN ITALY
Lags interval (in first differences): 1 to 4
Unrestricted Cointegration Rank Test (Trace)
Hypothesized Trace 0.05
No. of CE(s) Eigenvalue Statistic Critical Value Prob.**
None * 0.003696 31.25751 29.79707 0.0337
At most 1 0.002596 13.88696 15.49471 0.0861
At most 2 0.000361 1.692354 3.841466 0.1933
Trace test indicates 1 cointegrating eqn(s) at the 0.05 level
* denotes rejection of the hypothesis at the 0.05 level
**MacKinnon-Haug-Michelis (1999) p-values
Unrestricted Cointegration Rank Test (Maximum Eigenvalue)
Hypothesized Max-Eigen 0.05
No. of CE(s) Eigenvalue Statistic Critical Value Prob.**
None 0.003696 17.37056 21.13162 0.1552
Tıme Serıes 241
Example 12.17
GERMANY(-1) 1.000000
SPAIN(-1) 3.364462
(0.75675)
[ 4.44593]
ITALY(-1) -1.974751
(0.22148)
[-8.91605]
C -139.3244
Diagnostic Checking
Test Equation:
Dependent Variable: RESID
Method: Least Squares
Sample: 1/04/1989 12/29/2006
Presample missing value lagged residuals set to zero.
1,600
Series: Residuals
1,400 Sample 1/04/1989 12/29/2006
Observations 4693
1,200
Mean 1.22e-15
1,000 Median 0.532233
Maximum 240.5610
800 Minimum -301.2473
Std. Dev. 42.80986
600
Skewness -0.298449
Kurtosis 8.059792
400
Jarque-Bera 5075.829
200
Probability 0.000000
0
-300 -200 -100 0 100 200
DIAGNOSTIC CHECKING
The model adequacy should be investigated in the last step of the
model building cycle. Following residual analysis may be performed;
∑ (12.44)
248 Applied Econometrics with Eviews Applications
CHAPTER 13
VOLATILITY
ARCH MODEL
This model is closely related to the Autoregressive Conditional
Heteroscedasticity (ARCH), (Engle, 1982).
, (13.1)
, (13.2)
, (13.3)
250 Applied Econometrics with Eviews Applications
If this is the case, the behavior of the series is not possible to predict.
Instead, the volatility of financial time series can be explored. In fact,
volatility changes over time are examined.
Let’s assume that either asset price follows a pure random walk
model or a random walk with drift:
̅, (13.4)
, (13.5)
All the series in the model should be stationary. If not, there should
be converted into stationary before proceed to estimate ARCH and
GARCH (Bollerslev, 1986) model. model.
Example 13.1
Let’s start with the model that includes Greece stock index.
GREECE C
Estimate the equation and check the residual through graph to
identify whether the periods of high volatility is followed by period of
high volatility and vice versa.
37
Koop, G. (2005)
252 Applied Econometrics with Eviews Applications
Go to estimate and choose ARCH model from the list. Choose 1 for
ARCH, 0 for GARCH, and normal distribution, then run the model
which is GARCH(1,0).
Table 13.3 Eviews Output for ARCH Test
Dependent Variable: D(GREECE)
Method: ML - ARCH (Marquardt) - Normal distribution
Sample (adjusted): 1/02/1989 12/29/2006
Included observations: 4695 after adjustments
Convergence achieved after 20 iterations
Presample variance: backcast (parameter = 0.7)
GARCH = C(2) + C(3)*RESID(-1)^2
Variable Coefficient Std. Error z-Statistic Prob.
C 1.113585 0.278842 3.993613 0.0001
Variance Equation
C 774.5477 6.553909 118.1810 0.0000
RESID(-1)^2 0.890166 0.021965 40.52626 0.0000
R-squared -0.000033 Mean dependent var 0.874198
Adjusted R-squared -0.000459 S.D. dependent var 41.93003
S.E. of regression 41.93965 Akaike info criterion 9.985032
Sum squared resid 8252919. Schwarz criterion 9.989156
Log likelihood -23436.86 Hannan-Quinn criter. 9.986482
Durbin-Watson stat 1.662417
TARCH model
Two intervals mean that the division is normally at zero and the
impact of positive and negative shocks on the volatility is
differentiated.
Variance Equation
38
See Glosten et al. (1993) for more information.
Volatılıty 255
EGARCH model
| |
(13.6)
39
See Verbeek (A guide to modern econometrics, 2004) for more information.
256 Applied Econometrics with Eviews Applications
DIAGNOSTIC CHECKING
1- serial correlation
Test Equation:
Dependent Variable: WGT_RESID^2
Method: Least Squares
Sample (adjusted): 1/03/1989 12/29/2006
Included observations: 4694 after adjustments
This model does not have serial correlation, ARCH effect but it is not
normally distributed. This model still can be accepted although
residuals are not normally distributed.
(13.7)
Variance Equation
(13.8)
8,000
6,000
4,000
4,000
3,000 2,000
2,000
0
1,000
-1,000
-2,000
1992 1994 1996 1998 2000 2002 2004 2006
Variance Equation
Turkey stock market and England stock market are also significant
meaning that Turkey and England stock market volatility influence
the volatility of Greece stock market. In In summary, volatility
spillover from Turkey and England stock markets to Greece stock
market is detected. Greece stock market volatility is mainly defined
by its own shocks such as ARCH and GARCH. It is also influenced by
the Turkey and England stock market volatility. Volatility of the
Germany stock market does not contribute in the volatility of Greece
stock market.
262 Applied Econometrics with Eviews Applications
Granger Causalıty 263
CHAPTER 14
GRANGER CAUSALITY
, (14.1)
264 Applied Econometrics with Eviews Applications
This model implies that last period’s value of x has explanatory power
for the current value of y. The coefficient measures the level of
the influence of on . If , then past values of x have no
effect on y and x does not Granger cause y. An alternative way of
expressing this concept is to say that if then past values of x
have no explanatory power for y over sample period”. Since we know
how to estimate the ARDL and carry out hypothesis tests, it is simple
to test for Granger causality.
,
(14.2)
, (14.3)
Example 14.1
(14.4)
(14.5)
CHAPTER 15
VECTOR AUTOREGRESSION
MODEL (VAR)
DEFINITION
In estimating Granger causality between x and y, restricted version of
an ARDL (p, q) model is used with y as the dependent variable to
investigate whether x Granger cause y. Causality in the other
direction can also be examined switching the roles of x and y in the
ARDL. The two equations can be written as follows:
(15.1)
(15.2)
The first one is used to test whether x Granger causes y and the
second one, whether y Granger causes x.
The Eq. (15.1) and Eq. (15.2) form a VAR model with two variables.
It is common to set p = q and use the same lag length for every
variable in each equation. The resulting model is known as a VAR( p)
model.
For example, the following VAR (p) has three variables, y, x and z;
(15.3)
(15.4)
(15.5)
It is assumed that all the variables in the VAR (p) are stationary.
Estimation and testing can be carried out by using OLS. P-values or t-
statistics help to examine whether individual coefficients are
significant.
Vector Autoregressıon Model (VAR) 269
- It is easy to use.
One of the drawback in VARs is they are not theoretical which are not
based on economic theory. There is theory in selecting the variables
for the VAR.
For example, the interaction between interest rates, the price level,
money supply and real GDP. There have been created many
sophisticated models for this relationship (The IS–LM model).
However, a VAR model says: Interest rates, price level, money supply
and real GDP are related. This relationship can be modeled as each
variable depends on lags of itself and all other variables. There is no
need to establish any link between the empirical VAR and a
theoretical macroeconomic model.
270 Applied Econometrics with Eviews Applications
(15.6)
(15.7)
Vector Autoregressıon Model (VAR) 271
(15.8)
̂ ̂ ̂ ̂ ̂ (15.9)
The strategy for how to forecast one period into the future can be
used for two periods on the condition that one is an extension. In the
one period, and are used to create ̂ and ̂ . In the two
period, ̂ and ̂ subject to and . Since the data runs
until period T, what is and are not known. Consequently
and are replaced by ̂ and ̂ . That is, use the
relevant equation from the VAR, is used ignoring the error, replacing
the coefficients by their OLS estimates and replacing past values of
the variables that are not observed by forecast.
In a formula:
̂ ̂ ̂ ̂ ̂ ̂ ̂ (15.10)
̂ ̂ ̂ ̂ ̂ ̂ ̂ (15.11)
(15.12)
(15.13)
. (15.14)
The VECM is the same as a VAR with differenced variables, except for
the term . This error correction variable can be estimated by
running an OLS regression of y on x and saving the residuals. Then
OLS can be used to estimate ECMs, and p-values and correspondent
confidence intervals can be obtained. Lag length selection and
forecasting also can be done in a similar way to the VAR, with the
forecasts of the error correction term, must be calculated.
Vector Autoregressıon Model (VAR) 273
Example 15.1
Table 15.1 Eviews Output for VAR Analysis
Vector Autoregression Estimates
Sample (adjusted): 1/01/1991 12/29/2006
Included observations: 4174 after adjustments
Standard errors in ( ) & t-statistics in [ ]
REFERENCES
Baltagi, B.H. (2008).Econometric Analysis of Panel Data. Fourth
Edition. Great Briatin: John Wiley & Sons
Brown, R.L.; Durbin, J. & Evans, J.M. (1975). Techniques for Testing
the Constancy of Regression Relationships over Time. Journal
of the Royal Statistical Society. Ser.B. 37.p.149 – 192.
Engle, R.F. & B.S. Yoo (1991). Cointegrated Economic Time Series: An
Overview with New Results” in R.F. Engle and C.W.J. Granger
(eds.), Long-run Economic Relationships: Readings in
Cointegration, Oxford: Oxford University Press.
Fox, J.H.(1958). Criteria of Good Research. Phi Delta Kappan. Vol. 39.
P. 285–86.
Savin, N.E. & White, K.J. (1977). The Durbin-Watson Test for Serial
Correlation with Extreme Sample Sizes or Many Regressors.
Econometrica 45.p. 1989-1996.
Stock, J.H. & Watson, M.W. (1988). Testing for Common Trends.
Journal of the American Statistical Association 83.p. 1097-
1107.
Z 0,00 0,01 0,02 0,03 0,04 0,05 0,06 0,07 0,08 0,09
0,0 0,0000 0,0040 0,0080 0,0120 0,0160 0,0199 0,0239 0,0279 0,0319 0,0359
0,1 0,0398 0,0438 0,0478 0,0517 0,0557 0,0596 0,0636 0,0675 0,0714 0,0753
0,2 0,0793 0,0832 0,0871 0,0910 0,0948 0,0987 0,1026 0,1064 0,1103 0,1141
0,3 0,1179 0,1217 0,1255 0,1293 0,1331 0,1368 0,1406 0,1443 0,1480 0,1517
0,4 0,1554 0,1591 0,1628 0,1664 0,1700 0,1736 0,1772 0,1808 0,1844 0,1879
0,5 0,1915 0,1950 0,1985 0,2019 0,2054 0,2088 0,2123 0,2157 0,2190 0,2224
0,6 0,2257 0,2291 0,2324 0,2357 0,2389 0,2422 0,2454 0,2486 0,2517 0,2549
0,7 0,2580 0,2611 0,2642 0,2673 0,2704 0,2734 0,2764 0,2794 0,2823 0,2852
0,8 0,2881 0,2910 0,2939 0,2967 0,2995 0,3023 0,3051 0,3078 0,3106 0,3133
0,9 0,3159 0,3186 0,3212 0,3238 0,3264 0,3289 0,3315 0,3340 0,3365 0,3389
1,0 0,3413 0,3438 0,3461 0,3485 0,3508 0,3531 0,3554 0,3577 0,3599 0,3621
1,1 0,3643 0,3665 0,3686 0,3708 0,3729 0,3749 0,3770 0,3790 0,3810 0,3830
1,2 0,3849 0,3869 0,3888 0,3907 0,3925 0,3944 0,3962 0,3980 0,3997 0,4015
1,3 0,4032 0,4049 0,4066 0,4082 0,4099 0,4115 0,4131 0,4147 0,4162 0,4177
1,4 0,4192 0,4207 0,4222 0,4236 0,4251 0,4265 0,4279 0,4292 0,4306 0,4319
1,5 0,4332 0,4345 0,4357 0,4370 0,4382 0,4394 0,4406 0,4418 0,4429 0,4441
1,6 0,4452 0,4463 0,4474 0,4484 0,4495 0,4505 0,4515 0,4525 0,4535 0,4545
1,7 0,4554 0,4564 0,4573 0,4582 0,4591 0,4599 0,4608 0,4616 0,4625 0,4633
1,8 0,4641 0,4649 0,4656 0,4664 0,4671 0,4678 0,4686 0,4693 0,4699 0,4706
1,9 0,4713 0,4719 0,4726 0,4732 0,4738 0,4744 0,4750 0,4756 0,4761 0,4767
2,0 0,4772 0,4778 0,4783 0,4788 0,4793 0,4798 0,4803 0,4808 0,4812 0,4817
2,1 0,4821 0,4826 0,4830 0,4834 0,4838 0,4842 0,4846 0,4850 0,4854 0,4857
2,2 0,4861 0,4864 0,4868 0,4871 0,4875 0,4878 0,4881 0,4884 0,4887 0,4890
2,3 0,4893 0,4896 0,4898 0,4901 0,4904 0,4906 0,4909 0,4911 0,4913 0,4916
2,4 0,4918 0,4920 0,4922 0,4925 0,4927 0,4929 0,4931 0,4932 0,4934 0,4936
2,5 0,4938 0,4940 0,4941 0,4943 0,4945 0,4946 0,4948 0,4949 0,4951 0,4952
2,6 0,4953 0,4955 0,4956 0,4957 0,4959 0,4960 0,4961 0,4962 0,4963 0,4964
2,7 0,4965 0,4966 0,4967 0,4968 0,4969 0,4970 0,4971 0,4972 0,4973 0,4974
2,8 0,4974 0,4975 0,4976 0,4977 0,4977 0,4978 0,4979 0,4979 0,4980 0,4981
2,9 0,4981 0,4982 0,4982 0,4983 0,4984 0,4984 0,4985 0,4985 0,4986 0,4986
3,0 0,4987 0,4987 0,4987 0,4988 0,4988 0,4989 0,4989 0,4989 0,4990 0,4990
283
The table shows the critical t-values for a given alpha level (one-tailed) and degrees of
freedom
The degrees of freedom are the rows (denoted by df)
Note: The probability levels represent the whole of alpha (you must divide alpha by 2
if you want the t-value for a two-tailed test)
5 6,61 5,79 5,41 5,19 5,05 4,95 4,88 4,82 4,77 4,74 4,68 4,62 4,56 4,53 4,50 4,46 4,43 4,40 4,36
6 5,99 5,14 4,76 4,53 4,39 4,28 4,21 4,15 4,10 4,06 4,00 3,94 3,87 3,84 3,81 3,77 3,74 3,70 3,67
7 5,59 4,74 4,35 4,12 3,97 3,87 3,79 3,73 3,68 3,64 3,57 3,51 3,44 3,41 3,38 3,34 3,30 3,27 3,23
8 5,32 4,46 4,07 3,84 3,69 3,58 3,50 3,44 3,39 3,35 3,28 3,22 3,15 3,12 3,08 3,04 3,01 2,97 2,93
9 5,12 4,26 3,86 3,63 3,48 3,37 3,29 3,23 3,18 3,14 3,07 3,01 2,94 2,90 2,86 2,83 2,79 2,75 2,71
10 4,96 4,10 3,71 3,48 3,33 3,22 3,14 3,07 3,02 2,98 2,91 2,85 2,77 2,74 2,70 2,66 2,62 2,58 2,54
11 4,84 3,98 3,59 3,36 3,20 3,09 3,01 2,95 2,90 2,85 2,79 2,72 2,65 2,61 2,57 2,53 2,49 2,45 2,40
12 4,75 3,89 3,49 3,26 3,11 3,00 2,91 2,85 2,80 2,75 2,69 2,62 2,54 2,51 2,47 2,43 2,38 2,34 2,30
13 4,67 3,81 3,41 3,18 3,03 2,92 2,83 2,77 2,71 2,67 2,60 2,53 2,46 2,42 2,38 2,34 2,30 2,25 2,21
14 4,60 3,74 3,34 3,11 2,96 2,85 2,76 2,70 2,65 2,60 2,53 2,46 2,39 2,35 2,31 2,27 2,22 2,18 2,13
15 4,54 3,68 3,29 3,06 2,90 2,79 2,71 2,64 2,59 2,54 2,48 2,40 2,33 2,29 2,25 2,20 2,16 2,11 2,07
16 4,49 3,63 3,24 3,01 2,85 2,74 2,66 2,59 2,54 2,49 2,42 2,35 2,28 2,24 2,19 2,15 2,11 2,06 2,01
17 4,45 3,59 3,20 2,96 2,81 2,70 2,61 2,55 2,49 2,45 2,38 2,31 2,23 2,19 2,15 2,10 2,06 2,01 1,96
18 4,41 3,55 3,16 2,93 2,77 2,66 2,58 2,51 2,46 2,41 2,34 2,27 2,19 2,15 2,11 2,06 2,02 1,97 1,92
19 4,38 3,52 3,13 2,90 2,74 2,63 2,54 2,48 2,42 2,38 2,31 2,23 2,16 2,11 2,07 2,03 1,98 1,93 1,88
20 4,35 3,49 3,10 2,87 2,71 2,60 2,51 2,45 2,39 2,35 2,28 2,20 2,12 2,08 2,04 1,99 1,95 1,90 1,84
21 4,32 3,47 3,07 2,84 2,68 2,57 2,49 2,42 2,37 2,32 2,25 2,18 2,10 2,05 2,01 1,96 1,92 1,87 1,81
22 4,30 3,44 3,05 2,82 2,66 2,55 2,46 2,40 2,34 2,30 2,23 2,15 2,07 2,03 1,98 1,94 1,89 1,84 1,78
23 4,28 3,42 3,03 2,80 2,64 2,53 2,44 2,37 2,32 2,27 2,20 2,13 2,05 2,01 1,96 1,91 1,86 1,81 1,76
24 4,26 3,40 3,01 2,78 2,62 2,51 2,42 2,36 2,30 2,25 2,18 2,11 2,03 1,98 1,94 1,89 1,84 1,79 1,73
25 4,24 3,39 2,99 2,76 2,60 2,49 2,40 2,34 2,28 2,24 2,16 2,09 2,01 1,96 1,92 1,87 1,82 1,77 1,71
26 4,23 3,37 2,98 2,74 2,59 2,47 2,39 2,32 2,27 2,22 2,15 2,07 1,99 1,95 1,90 1,85 1,80 1,75 1,69
27 4,21 3,35 2,96 2,73 2,57 2,46 2,37 2,31 2,25 2,20 2,13 2,06 1,97 1,93 1,88 1,84 1,79 1,73 1,67
28 4,20 3,34 2,95 2,71 2,56 2,45 2,36 2,29 2,24 2,19 2,12 2,04 1,96 1,91 1,87 1,82 1,77 1,71 1,65
29 4,18 3,33 2,93 2,70 2,55 2,43 2,35 2,28 2,22 2,18 2,10 2,03 1,94 1,90 1,85 1,81 1,75 1,70 1,64
30 4,17 3,32 2,92 2,69 2,53 2,42 2,33 2,27 2,21 2,16 2,09 2,01 1,93 1,89 1,84 1,79 1,74 1,68 1,62
40 4,08 3,23 2,84 2,61 2,45 2,34 2,25 2,18 2,12 2,08 2,00 1,92 1,84 1,79 1,74 1,69 1,64 1,58 1,51
60 4,00 3,15 2,76 2,53 2,37 2,25 2,17 2,10 2,04 1,99 1,92 1,84 1,75 1,70 1,65 1,59 1,53 1,47 1,39
120 3,92 3,07 2,68 2,45 2,29 2,17 2,09 2,02 1,96 1,91 1,83 1,75 1,66 1,61 1,55 1,50 1,43 1,35 1,25
inf 3,84 3,00 2,60 2,37 2,21 2,10 2,01 1,94 1,88 1,83 1,75 1,67 1,57 1,52 1,46 1,39 1,32 1,22 1,00
285
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………
……………………………………………………………………………………………………………