0% found this document useful (0 votes)

13 views14 pages

Solutions for Tutorial 2

This document provides a tutorial on simple and multiple regression analysis using STATA, covering key concepts such as coefficient estimation, omitted variable bias, and variance analysis. It discusses the derivation of the intercept and slope coefficients, the implications of including or excluding variables in a regression model, and the trade-offs between bias and variance. Additionally, it explains how to calculate percentiles in a dataset.

Uploaded by

kvffhryykg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views14 pages

Solutions for Tutorial 2

Uploaded by

kvffhryykg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 14

Tutorial 2

Simple & Multiple Regression & STATA

 We know that the distribution is normally distributed with a mean of “a” and a
variance of σ 2. Our explanatory variable X is also normally distributed with a mean of
“b” and a variance of σ 2.
 We know that β 0 is the intercept of our regression.
 Therefore, we need to find the estimate for our coefficient: ^β 0
 We know the formulas for the two estimators from the lecture:

 Additionally, we can remember our formulas for y∧x (and remember that in this
case, the line above y and x represents the mean)
 We know the formulas for the means (& even though we do not need them to solve
this exercise, it is good to be aware of them again!)

n n
1
y= ∑ y ∧x= 1n ∑ x i
n i=1 i i =1

 So for this example, we could also see our equation as:

( ) ( )
n n
1 1
E [ ^β 0 ]=E y= ∑ y i − ^β 1 E x = ∑ x i
n i=1 n i=1

 And given that we know that ^β 1=10, the solution is relatively easy:

E [ ^β 0 ]= y−10 x
 To really understand what we are doing behind the scenes of this solutions, it is
important to remember how we derive at the formula for ^β 0

 ^β comes from the minimization of the sum of squares residuals:

0
n
minimise : ∑ u^ 2i
i=1
 How do we derive at the sum of squared residuals?

n n
min : ∑ u^ 2i =min ∑ ( y i− ^β0 − ^β 1 x i)2
i=1 i=1

 In other words: We have our values of y (our dependent variable/explained variable)

minus the way we predict these values with ^β 0 and ^β 1×x i all squared
 We know that in order to solve the minimization, we have to take the derivative of
the sum of square residual with respect to the parameter of interest ^β 0:
o ! Calculus also provides us with rules such as the chain rule in order to solve
the derivation equation.
o Therefore, the two in front of our equation comes from the previous
exponent. And its negative sign from the derivative of the argument of the
objective function with respect to ^β 0 which is -1

n
∂ ∑ u^ 2i n
i=1
=−2 ∑ ( y i− β^ 0 − ^β1 x i)
∂ β^ 0 i=1

 And this derivative needs to be set to 0.

n
∂ ∑ u^ 2i n
i=1
=−2 ∑ ( y i− β^ 0− ^β1 x i) =0
∂ β^ 0 i=1

 As we are setting it to 0, we can “ignore” the -2 by multiplying our right side of the
equation, our 0 by -2. Furthermore, we also need to remember that when we have a
summation outside of our brackets, this means that we have to sum up all the
individual terms, which gives us the following:

n n n
→∑ y i−¿ ∑ ^β 0−∑ β^ 1 xi ¿
i=1 i=1 i=1
 And again, can factor out our estimators in front of the equation, which in the case of
^β leaves us with:
0
n
^β 0 ∑ 1=n
i=1

 By solving this equation, we can isolate ^β 0

n n
→ ∑ y i−n ^β 0− β^ 1 ∑ xi
i=1 i=1

n n
1 1
→ ^β 0= ∑
n i=1
y i− β^ 1 ∑ x i
n i=1

 Now, after we have taken the derivative, we are back at the start and we can see
where our equation originally comes from that helped us estimate our ^β 0

 We know that ^β 1 is an unbiased estimator in model 1 which has two explanatory

variables, X1 and X2. In comparison, our model 2 has only one explanatory variables
~
 To compare the beta 1 estimators, we can call the beta1 of model 2: β 1 to
differentiate the two coefficients
 The answer is:
No. Model 1 should not be preferred over Model 2.
 And here is why:
 Whenever we include another variable, we have a trade-off:
o A trade-off between variance and bias
 In model 2, we exclude X2 and its coefficient  therefore, everything we have in our
X2 and its coefficient goes into the error term of model 2.
 In order to solve this exercise & find an answer, we need to consider a few possible
scenarios concerning the values of our coefficients & investigate the trade-offs.
Step 1. Examining the Bias-Aspect:

Helpful Notes: Remember the definition for Omitted-variable bias

 The OVB occurs when a statistical model leaves out one or more relevant variables. The
bias results in the model attributing the effect of the missing variables to those that were
included. It arises when the regressor X is correlated with an omitted variable. For omitted
variable bias to occur, two conditions must be fulfilled:
~
1. The omitted variable is correlated with the included regressor, δ 1 ≠ 0
2. The omitted variable is a determinant of the dependent variable Y, ^β 2 ≠ 0

Together, this results in a violation of the OLS assumption: E ( ui| X i )=0

Case 1: Beta 2 is different from 0.

β2≠ 0

 In this case 1, our Model 2 would exclude an important variable. This would
introduce a bias!
~
 Therefore, our β 1 from Model 2 would be biased.
o Because in this case, including X2 from Model actually explains our model
better than Model 1. As otherwise, our β 2would have been “absorbed” by the
error term
 This bias comes from the relationship between X1 and X2  which can be indicated
~
with δ 1 * β 2(this describes the bias in this example)
~
 (We also know that δ 1 is the regression coefficient where the excluded variable is
the dependent variable.)

Case 2: Beta 2 is equal to 0.

β 2=0

~
o Here, we know that β 1 from Model 2 would be unbiased
~
o Why? Because the bias ( δ 1 * β 2), which is a product, has one of the elements equal to
0. It would therefore disappear in this case.
o Note: In model 1, ^β 1 remains unbiased

 By simply looking at the bias, we would always choose Model 1. As this prevents us from
having an omitted variable bias.

Examining the Variance-Aspect:

 We are now looking at the variance

 We know that the variance of ^β 1 in Model 1 is:
2
σ
var ( ^β 1) = 2
SST X 1 (1−R X 1 )

 It is the ratio of the variance of the error divided by the Total Sum of Squares of the
relevant variable for ^β 1 which is X1, times 1 – R squared of X1.

Note & remember: R2 is a measure of how close the data is fitted to the regression line

 Now that we looked at Model 1, we need to look at –

~
 Variance of β 1 in Model 2:
2
~ σ
var ( β 1) =
SST X 1

Comparing these two variances:

Case 1:
 If X1 and X2 are uncorrelated, then R squared of X1 is equal to 0

~
corr ( X 1 , X 2) =0 , then R2X 1=0∧then→ var ( β^ 1 ) =var ( β1 )

o When this is the case, the two variances are the same!

Case 2:

 If X1 and X2 are correlated, then our R squared of X1 won’t be equal to 0

~
corr ( X 1 , X 2) ≠ 0 , then R2X 1 ≠ 0∧then→ var ( β^ 1 ) > var ( β 1 )

~
 Here, the variance of β 1 will be smaller than the variance of ^β 1 as we will always have
a smaller number in the denominator
 This is because with ^β 1, we always multiply the Total Sum of Squares with (1−R 2X 1) ,
which is smaller than 1 if they are correlated
~
o Therefore, with ^β 1 from Model 1, the SST is a smaller number than β 1 from
Model 2.
~
o And our variance of ^β 1 will be greater than our variance of β 1
o  this is always the case in this circumstance
Bringing these results together:

Case 1  We are Choosing Model 1

 When we have a Model 1 where β 2 ≠ 0 (in this case, beta 2 has explanatory power in
~
relation to the outcome), then β 1 from Model 2 is biased!
o Still, our ^β 1 is unbiased
~
 At the same time, the variance of β 1 < variance of ^β 1 (or as we said above:
~
var ( ^β 1) >var ( β1 ))
 Therefore, we choose Model 1
 It is the only Model that leads us to a coefficient which is unbiased

Case 2  Choosing Model 2

~
 When we have a model 1 where β 2=0 , then we know that β 1 from Model 2 is
unbiased (here, we do not have an omitted variable bias).
~
 Then, ^β 1 is also unbiased with variance of β 1 < variance of ^β 1
 Therefore, two coefficients are unbiased
 And the variance of ^β 1 is smaller.  we gain more precision in our estimator of β 1
 More precision. Less Bias. Both in Model 2.  therefore, we choose this one.

 We know that ui is distributed Normally with a mean of 0 and a variance of σ 2

 Just as last week, it can be incredibly useful to write-out the squared term of our
summation:
o Tip: remember the binomial formulas we discussed!

n
E [ ∑ ( u2i −2∗ui∗u+u 2 ) ]
i=1

 Once more, to work with the individual terms and isolate our various parts of the
equations, it can be easier to re-write it with the summations inside of our bracket:

n n n
E [ ∑ ui2−2∗∑ u i∗u+ ∑ u 2]
i=1 i=1 i=1
 Let’s have a look at the blue part of the equation. We can see that the first part of it
is very similar to the formula of an average. Remember:

n
1
u= ∑u
n i=1 i
1
 What we do not have in this particular blue term, however, is our . Therefore, if we
n
1
divide the summation of ui with , then we also need to multiply the term by n to
n
compensate for this step.
o We du this to replace the summation of ui with another u , so that we can
simplify the equation with a squared u:

n n
E [ ∑ ui2−2∗n∗u2 + ∑ u 2]
i=1 i=1

 We can now look at the green part of the equation.

o Tip: remember that when we have a small i with our term, it means that we
are taking an iteration. However, when we have a bar, it means that we are
considering a constant factor.
n
 Given that u , we can take it outside of the summation term. Leaving us with ∑ 1 .
2

i=1
And again, when we have a case like this, it leaves us with n instead of a summation.
Therefore, we can simplify the equation:

[∑ ]
n
E u 2i −2∗n∗u2 +n∗u2 which is :
i=1

n
E [ ∑ ui2−2 nu 2+ nu 2]
i=1
 Now, we have a simply calculation of -2 + 1:

n
E [ ∑ ui2−n u2 ]
i=1

 In order to continue, we can make use of the properties of the Expected value!
Where the expected value of a sum of term is the sum of the expected value of each
of the elements

∑ E ( u2i )−n∗E (u2 )

i=1
 In a final step, we need to make a leap in order to solve the equation further.
 Looking back at the lectures, we know the following:

var ( ui )=E ( u 2i )−¿

 We can re-arrange this formula to simplify our above equation
 If we simply take ¿ to the other side of the equation, we can get a term that can
replace E ( u i )
2

∑ ¿¿ ¿
i=1

 All we have to do now is think back to what we know from the exercise:
o We know that ui is distributed Normally with a mean of 0 and a variance of σ 2
2
σ
o And we know that var( u) is
n
 So in the next step, we simply fill in the terms into our equation:

∑ ¿¿ ¿
i=1

n
n∗σ 2
¿ ∑ σ 2+ 0− +0
i=1 n

 We now have a sum of numbers that is sigma squared n times:

2
2 σ 2
¿ n σ −n =σ (n−1)
n

 Interpretation of the Expected Value:

o Given that we have to our σ 2 multiplied by n-1, we know that it is not
unbiased
STATA Part: Exercise Number 4

Exercise 4 (i)

 We want to know what the value of the 25th Percentile is and what it means.
 The 25th percentile is often also called the First Quartile Q1
o Note: The Second Quartile Q2 is what we know as the Median and it is the 50th
percentile
 We can also picture it as the area under our distribution in which a certain
percentage lies  In this case, it marks the value of our distribution of wages under
which we can find 25% of the observations

Step 1: Load in our Data

Step 2:
 We can use the command “sum wage, detail”
 Alternatively, we can go into “Statistics  Other Tables  Compact table of
summary statistics” and fill in what kind of statistics we want
 Or: we can also use: centile (wage), centile (25 75)
 We can now see that the 25% percentile is 3.33
 Under the value of 3.33 hourly wage, we find 25% of average earnings

Exercise 4 (ii)

 We want to compute the 95% confidence interval for the mean of our wage variable
 Firstly: What is the 95% confidence interval?
o A 95% confidence interval for μY is a random variable that contains the true
value of μY in 95% of all possible random samples. Or, another definition:
o “For a given statistic calculated for a sample of observations (e.g. mean), the
confidence interval is a range of values around that statistic that are believed
to contain, with a certain probability (e.g. 95%), the true value of the statistic
(i.e., the population value)
 We can of course use STATA to calculate our Cis, but it is important to understand
the Maths behind it as well.
 The 95% CI is calculated as follows:

lower boundary of CI =X−(1.96∗Standard Error)

upper boundary of CI =X +(1.96∗Standard Error )

Note. Definition of Standard Error: the standard deviation of the sampling distribution* of a
statistic. For a given statistic (e.g., the mean) it tells us how much variability there is in the
statistic across samples from the same population. Large values, therefore, indicate that a
statistic from a given sample may not be an accurate reflection of the population from which
the sample came.

Note & Reminder. Definition of Sampling Distribution: the probability distribution of a

statistic. We can think of this as follows: if we take a sample from a population and calculate
some statistic (e.g. the mean), the value of this statistic will depend on the sample we took.
As suck, the statistic will vary slightly from sample to sample. If we took lots and lots of
samples from the population and calculated the statistic of interest, we could create a
frequency distribution of the values we got. The resulting distribution is what the sampling
distribution represents: the distribution of possible values of a given statistic that we could
expect to get from a given population

 We know that the critical value (z-Score) for 95% CI is 1.96

 Our mean is, as we saw in the table, 5.896
 The standard error is calculated by dividing the standard deviation of the sample
size’s (here, n = 526) square root:

3.693
SE= ≈
√ 526
 We can then do the calculations in STATA:

Alternatively, we can also go into Statistics  Summary Statistics  Confidence Intervals

Exercise 4 (iii)

 We are running a simple regression of wage on education to examine the predicted

effect of 4 more years of education on wages ()
 The command is: regress wage educ
 It can be found under: Statistics  Linear Models & Related  Linear Regression
 We set our dependent variable as wage and our independent variable as education

Legend:
 SS – These are the Sum of Squares associated with the source of the variance, Total,
Modal and Residual
 df – Degrees of freedom. The total variance has N-1 degrees of freedom. In this case,
there were 526 participants, so the DF for the residual is 524. And the df for our
model corresponds to the number of predictors minus 1 (K -1)
 MS – These are the Mean Squares, the Sum of Squares divided by their respective df.

 The coefficient indicates the change of our dependent variable wage with one extra
year of education = .54135
 To answer the question, we simply have to multiply this by four:

 The answer to the question is: with 4 years of education, a person from the
sample earns, on average, 2.165 pounds more
Exercise 4 (iv)

 We are doing & analysing a scatterplot!

 One important thing you can learn when trying to write code for any statistical
program (regardless if you use STATA, SPSS or R), it can be useful to research your
commands & play around with what you find.
 Here is an example for scatterplots:

https://stats.oarc.ucla.edu/stata/faq/how-can-i-do-a-scatterplot-with-regression-line-in-
stata/

 Alternatively, you can also go into Graphics  Twoway graphs (scatter, line etc.)

 Using the command console gives us more freedom to edit our graphs and to add the
regression line:
 graph twoway (scatter wage educ) (lfit wage educ), graphregion(color(white))

 What can we see from this plot? What would you say about the spread?
 We can see that for lower years of education, the variables are scattered closely to
the regression line while at higher levels of education, the spread of the fitted value
is much wider  this can also be confirmed when we look at:

Exercise 4 (v)

 Where we are going to plot the residuals against the variable of education
 To do this, we first have to use the following command:
o predict residual, residual
 predict either gives us the fitted value or the residual & here, we
are telling STATA that we want to call it the residual. And the
second time of writing residual tells STATA that we want u^
 And to draw the graph we use:
o twoway (scatter residual educ) (lfit resid educ)
 The command says that we want to do a scatterplot where we plot the residuals
against the years of education and add the linear fit.

 We can see that at low levels of education, the residuals are closer to 0
 However, when years of education increase, the spread around 0 gets wider and
wider
 The variation of the residuals increases with education
 This is a sign that the variance of the error term is not constant for every given
variable for years of education. Therefore  we have heteroskedasticity

Simple-Linear-Regression-Model-3 24
No ratings yet
Simple-Linear-Regression-Model-3 24
87 pages
Real Estate Valuation Theory 1st Edition Julian Diaz Iii J Andrew Hansz Auth pdf download
No ratings yet
Real Estate Valuation Theory 1st Edition Julian Diaz Iii J Andrew Hansz Auth pdf download
78 pages
EC2C4__Econometrics_II (11)
No ratings yet
EC2C4__Econometrics_II (11)
56 pages
Bekelech
No ratings yet
Bekelech
43 pages
Simple Linear Regression
100% (1)
Simple Linear Regression
50 pages
Econometric estimation BETA
No ratings yet
Econometric estimation BETA
36 pages
Machine Learning Lecture Notes Undergrad (1)
No ratings yet
Machine Learning Lecture Notes Undergrad (1)
19 pages
Athey, Imbens (2019) - Machine Learning Methods Economists Should Know About
No ratings yet
Athey, Imbens (2019) - Machine Learning Methods Economists Should Know About
41 pages
CH 03
No ratings yet
CH 03
28 pages
MIT15 097S12 Lec04
No ratings yet
MIT15 097S12 Lec04
6 pages
Multiple Linear Regression Model by Jeevan Bista[1]
No ratings yet
Multiple Linear Regression Model by Jeevan Bista[1]
16 pages
L3 SLR model 3
No ratings yet
L3 SLR model 3
16 pages
Me 491 2 Planning
No ratings yet
Me 491 2 Planning
73 pages
Linear Regression Using TensorFlow PDF
No ratings yet
Linear Regression Using TensorFlow PDF
5 pages
RegEstimationLS_ML_StatColumbia
No ratings yet
RegEstimationLS_ML_StatColumbia
44 pages
dis2-sol
No ratings yet
dis2-sol
12 pages
Econ20222 MJAbackgr
No ratings yet
Econ20222 MJAbackgr
164 pages
Gary Chamberlain Econometric S
No ratings yet
Gary Chamberlain Econometric S
152 pages
Regression:: Predicting House Prices
No ratings yet
Regression:: Predicting House Prices
42 pages
Lecture 09 Model Misspecification
No ratings yet
Lecture 09 Model Misspecification
5 pages
Time Varying Parameter VAR in R
No ratings yet
Time Varying Parameter VAR in R
23 pages
Notes2
No ratings yet
Notes2
16 pages
Properties of OLS Estimators: Assumptions Underlying Model
100% (1)
Properties of OLS Estimators: Assumptions Underlying Model
23 pages
Econometrics Chap 3
No ratings yet
Econometrics Chap 3
19 pages
Pengaruh Saluran Distribusi Dan Kualitas Pelayanan Terhadap Keputusan Pembelian Produk Alat Kesehatan Merek Omron Pada Pt. Sumber Medika Indonesia Medan (Distributor Alat Kesehatan)
No ratings yet
Pengaruh Saluran Distribusi Dan Kualitas Pelayanan Terhadap Keputusan Pembelian Produk Alat Kesehatan Merek Omron Pada Pt. Sumber Medika Indonesia Medan (Distributor Alat Kesehatan)
14 pages
Word Ekonometrika II
No ratings yet
Word Ekonometrika II
5 pages
03 - Causality PDF
No ratings yet
03 - Causality PDF
80 pages
Education and Research: UP School of Statistics Student Council
No ratings yet
Education and Research: UP School of Statistics Student Council
26 pages
Lecture 21: Model Selection 1 Choosing Models
No ratings yet
Lecture 21: Model Selection 1 Choosing Models
14 pages
Effect of Price Instability On Economic Growth.
100% (2)
Effect of Price Instability On Economic Growth.
18 pages
2 Regression With Multiple Regressors 1
No ratings yet
2 Regression With Multiple Regressors 1
22 pages
Sskripsi Baruu
No ratings yet
Sskripsi Baruu
15 pages
Standard Errors For Regression Equations
No ratings yet
Standard Errors For Regression Equations
4 pages
Econometrics For Management Assignment
No ratings yet
Econometrics For Management Assignment
3 pages
Multiple Regression Analysis, The Problem of Estimation
No ratings yet
Multiple Regression Analysis, The Problem of Estimation
53 pages
Properties of The OLS Estimator: Quantitative Methods 2
No ratings yet
Properties of The OLS Estimator: Quantitative Methods 2
57 pages
Tutorial 8 Sem 2 2020-21
No ratings yet
Tutorial 8 Sem 2 2020-21
2 pages
Econometrics - Exercise set 2 (solution)
No ratings yet
Econometrics - Exercise set 2 (solution)
12 pages
MA 324, Lecture 1: Yohann Tendero Yohann - Tendero@
No ratings yet
MA 324, Lecture 1: Yohann Tendero Yohann - Tendero@
19 pages
ES12010 - Extra Exercise WK25 Solutions
No ratings yet
ES12010 - Extra Exercise WK25 Solutions
13 pages
I. Ii. Iii. Iv. V.: EBE 2174/EBQ2074 Econometrics Tutorial 2 (ANSWERS) Evan Lau
100% (1)
I. Ii. Iii. Iv. V.: EBE 2174/EBQ2074 Econometrics Tutorial 2 (ANSWERS) Evan Lau
3 pages
HW9 Solutions
No ratings yet
HW9 Solutions
9 pages
Econometric s 1
No ratings yet
Econometric s 1
5 pages
Using Autometrics
No ratings yet
Using Autometrics
20 pages
MITOCW - MITRES6 - 012S18 - L17-03 - 300k
No ratings yet
MITOCW - MITRES6 - 012S18 - L17-03 - 300k
3 pages
Clase III
No ratings yet
Clase III
28 pages
CH 03
No ratings yet
CH 03
17 pages
Problem Set 2 - Answers
No ratings yet
Problem Set 2 - Answers
5 pages
Nonlinear Model
No ratings yet
Nonlinear Model
3 pages
Econometric Theory: Module - Ii
No ratings yet
Econometric Theory: Module - Ii
11 pages
Introduction To Mathematical Modeling: Simple Linear Regression
No ratings yet
Introduction To Mathematical Modeling: Simple Linear Regression
21 pages
Lecture 11_Stochastic Regressors Measurement Errors
No ratings yet
Lecture 11_Stochastic Regressors Measurement Errors
6 pages
Classical Linear Regression and Its Assumptions
No ratings yet
Classical Linear Regression and Its Assumptions
63 pages
Simple Linear Regression 1. Review of Least Squares Procedure 2. Inference For Least Squares Lines
No ratings yet
Simple Linear Regression 1. Review of Least Squares Procedure 2. Inference For Least Squares Lines
51 pages
Forecasting
No ratings yet
Forecasting
4 pages
3 SimpleLinearRegression
No ratings yet
3 SimpleLinearRegression
30 pages
Bsc-Iii Econometrics I (ECO 204) Quiz 3 Spring 2021 Azam Chaudhry Aimal Tanvir
No ratings yet
Bsc-Iii Econometrics I (ECO 204) Quiz 3 Spring 2021 Azam Chaudhry Aimal Tanvir
3 pages
5) Multiple Regression
100% (1)
5) Multiple Regression
8 pages
Design of Experiment
No ratings yet
Design of Experiment
5 pages
Simple Linear Regression Model
No ratings yet
Simple Linear Regression Model
6 pages
Assignment3SolNew_Fall2024 (1)
No ratings yet
Assignment3SolNew_Fall2024 (1)
9 pages
econometrics-cheat-sheet
No ratings yet
econometrics-cheat-sheet
4 pages
ECONF241 GaussMarkov Theorem
No ratings yet
ECONF241 GaussMarkov Theorem
25 pages
統計摘要
No ratings yet
統計摘要
12 pages
Lecture 4: Simple Linear Regression Models, With Hints at Their Estimation
No ratings yet
Lecture 4: Simple Linear Regression Models, With Hints at Their Estimation
12 pages
Multiple Regression Analysis: Võ Đ C Hoàng Vũ
No ratings yet
Multiple Regression Analysis: Võ Đ C Hoàng Vũ
20 pages
Course Outline Eco 422 2020 2023
No ratings yet
Course Outline Eco 422 2020 2023
4 pages
Derivation of BLUE Property of OLS Estimators
100% (2)
Derivation of BLUE Property of OLS Estimators
4 pages
Bias-Variance Tradeoffs: 1 Single Sample MLE
No ratings yet
Bias-Variance Tradeoffs: 1 Single Sample MLE
7 pages
Regression With One Regressor
No ratings yet
Regression With One Regressor
25 pages
Stepwise Reg
No ratings yet
Stepwise Reg
31 pages
Chaeat Sheet Econometrics
100% (2)
Chaeat Sheet Econometrics
5 pages
Lecture 24: Weighted and Generalized Least Squares 1 Weighted Least Squares
No ratings yet
Lecture 24: Weighted and Generalized Least Squares 1 Weighted Least Squares
8 pages
Nardl Package: Cointegration Bounds Test Dynamic Multipliers Plot
No ratings yet
Nardl Package: Cointegration Bounds Test Dynamic Multipliers Plot
1 page
Unit - 1
No ratings yet
Unit - 1
8 pages
Econometricstutorials Exam QuestionsSelectedAnswers
100% (1)
Econometricstutorials Exam QuestionsSelectedAnswers
11 pages
Universty of Gondar Faculty of Agriculture Department of Agricultural Economics
100% (1)
Universty of Gondar Faculty of Agriculture Department of Agricultural Economics
30 pages
Violations of Gauss Markov Assumptions: Omitted Variable Bias
No ratings yet
Violations of Gauss Markov Assumptions: Omitted Variable Bias
10 pages
Forecasting
No ratings yet
Forecasting
50 pages
Ec2 1
No ratings yet
Ec2 1
11 pages
Chapter Three: Estimation of Multiple Linear Regression Model
No ratings yet
Chapter Three: Estimation of Multiple Linear Regression Model
18 pages
Simple Linear Regression: Parameters
No ratings yet
Simple Linear Regression: Parameters
34 pages
Im ch08
No ratings yet
Im ch08
12 pages
Econometrics I: TA Session 5: Giovanna Ubida
No ratings yet
Econometrics I: TA Session 5: Giovanna Ubida
20 pages
Emet2007 Notes
No ratings yet
Emet2007 Notes
6 pages
Dummy Dependent Variables Models
No ratings yet
Dummy Dependent Variables Models
15 pages
Exercises of Equations and Disequations
From Everand
Exercises of Equations and Disequations
Simone Malacrida
No ratings yet
Differential Equations (Calculus) Mathematics E-Book For Public Exams
From Everand
Differential Equations (Calculus) Mathematics E-Book For Public Exams
Mohmmad Khaja Shareef
5/5 (1)
Calculus-II (Mathematics) Question Bank
From Everand
Calculus-II (Mathematics) Question Bank
Mohmmad Khaja Shareef
No ratings yet
Algebraic Equations
From Everand
Algebraic Equations
Demetrios P. Kanoussis
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Solutions for Tutorial 2

Uploaded by

Solutions for Tutorial 2

Uploaded by

Tutorial 2

Simple & Multiple Regression & STATA

 So for this example, we could also see our equation as:

 ^β comes from the minimization of the sum of squares residuals:

 In other words: We have our values of y (our dependent variable/explained variable)

 And this derivative needs to be set to 0.

 By solving this equation, we can isolate ^β 0

 We know that ^β 1 is an unbiased estimator in model 1 which has two explanatory

Helpful Notes: Remember the definition for Omitted-variable bias

Together, this results in a violation of the OLS assumption: E ( ui| X i )=0

Case 1: Beta 2 is different from 0.

Case 2: Beta 2 is equal to 0.

Examining the Variance-Aspect:

 We are now looking at the variance

 Now that we looked at Model 1, we need to look at –

Comparing these two variances:

 If X1 and X2 are correlated, then our R squared of X1 won’t be equal to 0

Case 1  We are Choosing Model 1

Case 2  Choosing Model 2

 We know that ui is distributed Normally with a mean of 0 and a variance of σ 2

 We can now look at the green part of the equation.

∑ E ( u2i )−n∗E (u2 )

var ( ui )=E ( u 2i )−¿

 We now have a sum of numbers that is sigma squared n times:

 Interpretation of the Expected Value:

Step 1: Load in our Data

lower boundary of CI =X−(1.96∗Standard Error)

upper boundary of CI =X +(1.96∗Standard Error )

Note & Reminder. Definition of Sampling Distribution: the probability distribution of a

 We know that the critical value (z-Score) for 95% CI is 1.96

Alternatively, we can also go into Statistics  Summary Statistics  Confidence Intervals

 We are running a simple regression of wage on education to examine the predicted

 We are doing & analysing a scatterplot!

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.