Research What Is Research?
Research What Is Research?
What is Research?
Research is a process to discover new knowledge.
1)Research is a careful and detailed study into a specific problem,
concern, or issue using the scientific method.
2)Research is a systematic inquiry that investigates hypotheses, suggests
new interpretations of data or texts, and posses new questions for future
research to explore.
Types of Research
Research may be very broadly defined as systematic gathering of data
and information and its analysis for advancement of knowledge in any
subject. Research attempts to answer intellectual and practical questions
through application of systematic methods. The types of research
methodologies vary and are often classified into few categories. Specific
academic fields tend to apply certain methodologies more than others:
Qualitative: Involves describing in details specific situations using
research tools like interviews, surveys, and observation.
Quantitative: Requires quantifiable data involving numerical and statistical
explanations. Quantitative researchers seek to explain the causes of change
primarily through objective measurement and quantitative analysis
(statistics).
Correlation/Regression Analysis: Involves determining the strength of the
relationship between two or more variables. Correlation / regression
researchers determine whether correlations exist between two quantitative
variables.
Experimental: Relies on controlled experiments that compare the outcome
for an experimental and a control group that differ in a defined way.
What is Econometrics?
1) Econometrics may be defined as the social science in which the
tools of economic theory, mathematics, and statistical inference are
applied to the analysis of economic phenomenon.
2) Econometrics is concerned with the empirical determination of
economic laws.
Methodology of Econometrics
The traditional or classical methodology can be explained in the following
steps.
1)Statement of the theory
2)Specification of the mathematical model of the theory
3)Specification of the statistical, or econometric, model.
4) Obtaining the data
5)Estimation of the parameters of econometric model
6)Hypothesis testing
7)Forecasting or prediction
8)Using model for control or policy purposes.
To explain the preceding steps , let us consider the well-known Keynesian
1:Statement of Theory or Hypothesis
Keynesian postulated that the marginal propensity to consume (MPC),
the rate of change of consumption for a unit (say, a dollar) change in
income, is greater than zero but less than 1.
2. Specification of the Mathematical Model of Consumption
Although Keynes postulated a positive relationship between consumption
and income, he did not specify the precise form of the functional
relationship between the two. For simplicity, a mathematical economist
might suggest the following form of the Keynesian consumption
function:
Y=β1+ β2X 0< β2<1………………………………..1)
Where Y= consumption expenditure and X =income, where β1 and β2 known as the
parameters of the model, are, respectively, the intercept and slope coefficients.
The slope coefficient β2 measures the MPC. This equation states that consumption is
linearly related to income, is an example of mathematical model of the relationship between
consumption and income that is called the consumption function in economics. A model is
simply a set of mathematical equations. If the model has only one equation, as indicated, it
is called a single equation model, whereas if it has more than one equation, it is known as a
multiple-equation model.
The variable appearing on the left side of the equality sign is called the dependent variable
and the variable(s) on the right side is called the independent variables. Thus, in Keynesian
function, consumption expenditure is dependent variable and income is the explanatory
variable.
1) Specification of Econometric Model
The above mathematical model of the consumption function is of limited
interest to econometrician as it shows the exact relationship between
consumption and income. But relationships between economic variables
are generally inexact. There are other variables that also affect
consumption expenditures rather than income. For example, size of family,
ages of the members in the family, family religion, etc are likely to exert
some influence on consumption.
To allow for the inexact relationship between economic variables, the
econometrician would modify the deterministic consumption function in
the above equation.
Y= β1+ β2X +µ………………………….2)
Where u, known as the disturbance, or error term is a random (stochastic)
variable that has well defined probabilistic properties. The disturbance term u
may well represent all those factors that affect consumption but are not taken
into account explicitly.
The above equation is an example of econometric model. More technically, it is
an example of a linear regression model. This function hypothesizes that the
dependent variable Y(consumption) is linearly related to the explanatory
variable X(income) but that the relationship between the two is not exact; it is
subject to individual variation.
4) Obtaining the Data
To estimate the econometric model mention above, there is a need to obtain
numerical values of β1 and β2 we need data.
5) Estimation of the Econometric Model
After obtaining data, next step is to estimate the parameters of the consumption
function. The numerical estimates of the parameters give empirical content to
the consumption function. The statistical technique of regression analysis is the
main tool to obtain the estimates. By using this technique and data, the estimates
of β1 and β2 can be obtained.
6) Hypothesis Testing
The obtained estimates are in accord with the expectation of the theory that is
being tested.
7) Forecasting or Prediction
If the chosen model does not refute the hypothesis or theory under
consideration, it can be used to predict the future value(s) of the dependent or
forecast, variable Yon the basis of the known or expected future (s) of the
explanatory, or, predictor, variable X.
8) Using of Model for Control or Policy Purpose
The estimated model can be used to control the issue or for further policy
suggestion or proposal.
Data
Economic data sets comes in various forms. We describe the most important
data structures encountered in applied econometrics.
1)Time Series Data
In time series data we observe the values of one or more variables over a period of
time (e.g., GDP for several quarters or years.
2) Cross-sectional Data
In cross section data, values of one or more variables are collected for several
sample units, or entities, at the same point in time (e.g., crime rates for 50 states in
the United States for a given year)
3) Panel Data
In panel data the same cross-sectional unit (say a family or a firm or a state) is
surveyd over time. In short panel data has space as well as time dimensions. There
are other names for panel data, such as pooled data (pooling of time-series and
cross-sectional observations) etc.
Research problem and Use of Statistical Soft ware
There are many statistical soft wares. We describe some important
software here.
1) SPPS 2) Eviews
What is SPSS?
It is a Statistical Package for the Social Sciences. SPSS is chosen because
of its popularity within both academic and business circles, making it the
most widely used package of its type. SPSS is also a versatile package
that allows many different types of analyses, data transformations, and
forms of output in short, it will more than adequately serve our purposes.
SPSS and Ordinary Least Square (OLS) Regression Technique
The dependence of variable on other independent variables. Suppose we
have few variables. The regression function is as follows.
Y= 𝛽1 𝑥1 + 𝛽2 𝑥2 + ⋯ 𝐵𝑘 𝑥𝑘
Y = dependent variable
𝑥1 , 𝑥2 , 𝑥𝑘 are independent variables.𝛽1 , 𝛽2 , … . 𝛽𝑘 are the coefficients.
We have to check the effect of these explanatory variables on the
dependent variables.
1 ) Linear Probability Model
We begin by an examination of the simplest possible model, which has a
dichotomous dummy variable as the dependent variable. We assume that
the dummy dependent variable is explained by only one regressor. For
example, we are interested in examining the labour force participation
decision of adult females. The question is : why do some women enter
the labour force while others do not? We assume that the decision to go
out to work or not is affected by only one explanatory variable (X2i) the
level of family income.
The model is:
Y= 𝛽1 + 𝛽2 + 𝑢𝑖
But since Yi is a dummy variable, we can rewrite the model as:
Di= 𝛽1 + 𝛽2 𝑥2 + 𝑢𝑖
Where 𝑥2𝑖 + is the level of family income (a continuous variable); Di is dichotomous
dummy defined as:
Di= {1 if the ith individual is working
0 if the ith individual is not working
And 𝑢𝑖 is the usual disturbance.
One of the assumptions of the CLRM is that E (𝑢𝑖)=0. This, for given 𝑥2𝑖
𝐸 𝐷𝑖 = 𝛽1 + 𝛽2 𝑥2𝑖
However, since Di is of qualitative nature, here the interpretation is different. Let us
define pi as the probability of Di =1 𝑃𝑖= 𝑃𝑟 𝐷𝑖 = 1 ; therefore 1-Pi is the probability of
… … … . . Di =0 1 − 𝑃𝑖= 𝑃𝑟 𝐷𝑖 = 0
To put this mathematically:
that the time series behavior of Yt is largely determined by its own value in
the preceeding period. So what will happen in this largely dependent on
what happened in t-1.
Alternatively, what will happen in t+1 will be determined by the behavior
of the series in the current time t.
Condition for stationarity
The above equation introduces the constant 𝜙 <1 in order to guarantee
stationarity as defined in the previous section. If we have 𝜙 >1, then Yt
will tend to get larger in each period, so we would have an explosive
series.
𝑌𝑡 = 𝜙𝑌𝑡−1 + 𝜙2𝑌𝑡−2 + 𝜇𝑡
Similarly, the AR(P) model will be an autoregressive model of order p, and will have p
lagged terms, as in the following:
𝑌𝑡 = 𝜇𝑡 + 𝜃𝑡−1
Thus the implication behind the MA (1) model is that Yt depends on the value of the
immediate past error, which is known at time t.
The MA(q) model
The general form of the MA model is an MA(q) model of the form:
Where ai can now differ for each country in the sample. At this point there
may be a question of whether the B coefficient should also vary across
different countries, but this would require a separate analysis for each one of
the N cross-sectional units and the pooling assumption is the basis of panel
data estimation.
Different Methods of Estimation
In general, simple linear panel data can be estimated using three different
methods: a) with a common constant as in equation
b) Allowing for fixed effects
c) allowing for random effects
a) the model estimates a common constant (pooled OLS method) a for all
cross-sections (common constant for countries). It implies that there are no
differences between the estimated cross-section and it is useful under the
null hypothesis that the data set is a priori homogenous.
Fixed Effects Method
In the fixed effects methods the constant is treated as group (section)-
specific. This means that the model allows for different constants for each
group (section). So the model is similar to that of first equation of the model.
The fixed effect estimator is also known as the least squares dummy variable
(LSDV) estimator because, to allow for different constant for each group, it
includes a dummy variable for each group.
𝐴𝑖 = 𝑎 + 𝑣𝑖
𝑌𝑖𝑡 = 𝛼𝑖 + 𝛽1 𝑥1 𝑖𝑡 + 𝛽2 𝑥2 𝑖𝑡 + ⋯ 𝛽𝑅 𝑥𝑅 𝑖𝑡 + 𝑣𝑖 + 𝑣𝑖
Eviews and Panal Data Analysis
In panel data we will use both time series and cross section data. We have to stake the
data.
1) Short Panel (in short panel n >T) where n is no of cross section
2) Long Panel(T>n) where T is years or time
3) To work on short panel data is good
4) Here again in panel data we check the stationarity.
5) Is there a 1(0) means stationary at level or is there stationary at first difference 1(1)
6) If all the variables are stationary then we are able to apply regression fixed effects
and random effects.
7) Results from fixed and random can be similar but we have to check which model will
be more appropriate.
1)For this purpose we will have to apply Hausman test (Hausman test)
2)H0= random effect is appropriate
3)H1 = fixed effect is appropriate
4)If hausman test is significant then we will use random effect
5)If hausman test is insignificant then we can use fixed effect
6) We have a panel data.
7) Open data sheet
8)Select the data from original sheet and past it in eviews.
9)We have the variables with data.
1)Panel Data and Eviews
2)1)for data, double click on eviews
3)Click on file
4)Click on import
5)Click on data name, next, next and finish.
6)You have the data variable
1)Unit Roots and Panel Data
2)Select the variables
3) Go to quick
4) Go to series statistics
5) Click on Unit root
6)Enter the name and press ok
7)You have unit root tests
8) Click on level, intercept, and press ok
9) We have results of unit roots
10)If the statistics of all(i.e., LLC, IMP, ADF and PP) or few of them are
significant then the series are stationary.
OLS and Panel data