0% found this document useful (0 votes)

2 views9 pages

HW-1

The document discusses various statistical concepts related to Ordinary Least Squares (OLS) estimation, including its assumptions, the differences between Sample and Population Regression Functions, and the implications of violating these assumptions. It also evaluates specific regression models and their applicability for estimation, along with hypothesis testing regarding coefficients. Additionally, it highlights the limitations of linear regression and the importance of considering confounding factors in causal interpretations.

Uploaded by

khanhle.31231022429

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views9 pages

HW-1

Uploaded by

khanhle.31231022429

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 9

TABLE OF EVALUATION

No. Full name Level of completion

1. Le Ngoc Phương Khanh 100%
2. Mai Linh 100%
3. Doan My Huyen 100%
4. Nguyen Tri Khang 100%

CHRIS BROOKS
1.
(a) It is because the main goal of using OLS estimation is to fit a line to the data by
minimizing the total sum of the squares, also known as residuals. If we consider
horizontal distance, that means we are adjusting the value of x to determine y, which
is not realistic because x are variables that are assumed to be fixed rather than be
random like y.

(b) Since some points can lie above or below the line, minimizing the sum of squared
distances ensures a unique solution, unlike simply minimizing the sum of residuals,
which could cancel out. When the total distance of points above equals the total
below, many lines could fit the data, as their residuals would sum to zero. Squaring
the residuals prevents this cancellation, ensuring a unique line that provides the best fit
without needing a special solution for the coefficients.

(c) Applying squaring to residuals would make it simpler for calculations compared to
using the absolute value. Moreover, it would prevent a case in which positive and
negative residuals cancel each other out.

2.
Sample Regression Function (SRF): The Sample Regression Function (SRF)
represents the relationship derived from analyzing sample observations. It is
essentially the estimated form of the Population Regression Function (PRF),
expressed through its equation.

^y t =α^ + ^β x t
In this equation:

● α^ and ^β are the estimates of α and β , obtained from the sample data.

● ^y t is the predicted value of y based on the sample x .

Population Regression Function (PRF): The Population Regression Function (PRF)

describes the actual relationship between the independent variable xxx and dependent
variable y and for the entire population. It is typically represented by the equation:

y t =α + β x t +ut

Where:

● α is the intercept, representing the expected value of y when x = 0.

● β is the regression coefficient, indicating the alteration in y for a 1-unit rise in x

● ut is the random error term, accounting for factors not explained by x .

Key Differences:

● The PRF describes the true relationship in the entire population, but since the
population data is typically unobservable, the PRF is often unknown.

● is derived from sample data and serves as an approximation of the Population

Regression Function (PRF). It decomposes the observed value of y t into two

parts: y t = ^y t +u^ t

Where:
o ^y t : The predicted value of yy.
o u^ t : The residual (error term), representing the difference between the
observed value and the predicted value.

3.
An estimator is a formula used to determine parameter estimates, which describe the
relationship between two or more explanatory variables. There are many different
types of estimators, but the OLS estimator is one of the most commonly preferred.
OLS is considered "optimal" because it has the lowest variance among linear and
unbiased estimators, meaning no other linear and unbiased estimator has a smaller
sampling variance. While an estimator with a lower sampling variance than OLS can
be defined, it would either be non-linear, biased, or both. Therefore, choosing an
estimator always involves a trade-off between bias and variance.

The OLS estimator is considered optimal only when the assumptions of the classical
linear regression model are satisfied. These assumptions include linearity, no
endogeneity and homoscedasticity (the error terms have constant variance). Under
these conditions, the Gauss-Markov theorem states that OLS is the Best Linear
Unbiased Estimator (BLUE).

However, if these assumptions are violated—such as in cases of endogeneity or

heteroscedasticity—OLS can still produce unbiased estimates but may lose efficiency,
meaning it might no longer have the smallest variance. In such situations, alternative
estimators, such as Generalized Least Squares (GLS) or Instrumental Variables (IV),
may perform better by addressing these violations. Thus, while OLS is optimal when
the classical assumptions hold, it may not always be the best choice if those
assumptions are breached.

4.
Five assumptions are usually made about the unobservable error terms in the CLRM:
● E(ut) = 0 → The error terms are assumed to have an expected value of zero

● Var(ut) = 𝜎2 → The error term is constant and finite across all values of x t.

● Cov(ui, uj) = 0 for i ≠ j→ The errors are linearly independent of one

another

● Cov(ut, xt) = 0 → The error term is uncorrelated with corresponding x.

● ut ~ N(0,𝜎2) → The error term ut follows a normal distribution.

In Box 3.3 of the book, the assumptions of the disturbance terms of the classical linear
regression model are listed. To demonstrate that the ordinary least squares estimators
of α and β are the "best," or that among the class of linear unbiased estimators they
have the minimum variance, we must first establish the first four assumptions. The
Gauss-Markov theorem establishes that OLS estimators are BLUE (had fulfilled
assumptions). It's possible that OLS estimators lose their unbiased and "efficient" if
these assumptions are violated, a topic covered in Chapter 4. In other words, they
might be not accurate or subject to variations between samples. The fifth assumption,
stating that the disturbances follow a normal distribution, is necessary for drawing
statistical inferences about population parameters from the sample data. This
assumption allows hypothesis testing of the coefficients, ensuring that the test
statistics adhere to a t-distribution, assuming all other conditions are met.
conclusion: five assumptions of the CLRM's error terms ensure that OLS estimators
are BLUE, as established by the Gauss-Markov theorem. The first four assumptions
guarantee unbiasedness and efficiency, while the fifth assumption, requiring normally
distributed errors, enables valid hypothesis testing using the t-distribution. Violations
of these assumptions can compromise the reliability of OLS estimators.

Evaluation:

1. Model (3.39):

y t =α + β x t +ut

This is linear in parameters (α , β ), so it can be estimated using OLS.

2. Model (3.40):
α β ut
y t =e x t e

This is nonlinear in parameters. However, by taking the natural log of both

sides:
ln( y t ¿=α + β ln (x)+ut
It becomes linear in parameters and can be estimated using OLS.
3. Model (3.41):
y t =α + β y x t +ut

This is nonlinear in parameters ( β y ), so it cannot be estimated using OLS.

4. Model (3.42):
ln( y t ¿=α + β ln (x t )+u t
This is already linear in parameters (α , β ), so it can be estimated using OLS.
5. Model (3.43):
y t =α + β1 x t + β 2 z t +u t

This is linear in parameters (α , β 1, β 2), so it can be estimated using OLS.

Conclusion:

Models (3.39), (3.40) (after transformation), (3.42), and (3.43) can be estimated using
OLS. Model (3.41) cannot.

6.
It is said by the null hypothesis that the value of β = 1, so

H0: β = 1

H1: β > 1

^
−B∗¿ 1.147−1
Test statistic = B ¿= = 2.682
SE ¿ ¿ 0.0548

We have n is the sample size => T = 62, and there are 2 parameters (αi, βi). In this
case, for a one-tailed test at the 5% significance level with T – 2 = 60 degrees of
freedom, so according to the t-distribution table, the critical value is 1.671.
Since the calculated test statistic is 2.682 > 1.671, which falls in the 5% rejection
region, the null hypothesis must be rejected. In addition, we can say that the stock is
more risky than the market and the analyst’s claim is not empirically verified.

We test hypotheses regarding the actual coefficients rather than the estimated
values.This is due to the fact that the objective is to test hypotheses regarding the
underlying population parameters in order to infer their likely values. We don't need to
test hypotheses regarding estimated values because we know exactly what our
estimates are because we calculated them.

Conclusion: In order to draw inferences about the population parameters, we test

hypotheses regarding the actual coefficients (β). Since the estimated values are already
known from the calculations, there is no need to test them.
WOOLDRIDGE
4.
(i) Using the regression equation ^
bwght =119.77−0.514 × cigs:
- When cigs = 0, predicted birth weigh is 119.77 ounces.

- When cigs = 20, predicted birth weigh is: ^

bwght ¿ 119.77−0.514 × 20=109.49 ounces.

The difference is 10.28 ounces, which indicates that smoking 20 cigarettes per day
during pregnancy is associated with a reduction of approximately 8.6% in predicted
birth weight.

(ii) Not necessarily. The regression shows a statistical association between smoking
and birth weight, but it does not confirm causation. Other factors, such as the mother's
overall health, nutrition, access to prenatal care, or exposure to other harmful
substances, might influence birth weight and could also correlate with smoking.
Without controlling for these potential confounders, we cannot conclude that smoking
directly causes a decrease in birth weight.

(iii) Solving the equation:125=119.77−0.514 × cigs

119.77−125
cigs= ≈−10.18
−0.514

This result suggests that for a predicted birth weight of 125 ounces, the mother would
need to smoke a negative number of cigarettes, which is nonsensical. This highlights a
limitation of simple linear regression: predictions outside the observed data range
(extrapolation) may lead to unrealistic results. Additionally, the regression model
suggests the maximum predicted birth weight is 119.77 ounces, as this corresponds to
zero cigarette consumption.

(iv) The fact that about 85% of women in the sample do not smoke while pregnant
implies that most observed birth weights are clustered near the upper limit of the
regression prediction: predicted birth weight is 119.77 ounces. This suggests that the
data does not adequately represent heavier birth weights, limiting the model's ability
to explain such cases. Therefore, the sample composition reinforces why predicting a
birth weight of 125 ounces is not realistic within this framework.
5.
(i) According to the intercept, cons is expected to be - $124.84 when inc = 0.
Obviously, this is untrue and illustrates how consumption function may not be reliable
indicator of consumption at extremely low levels of income. On an annual basis,
however, $124.84 is not so far from zero.
(ii) Simply substitute 30,000: cons^= –124.84 + 0.853 * 30,000 = 25,465.16 dollars.
(iii) The following graph displays the MPC and the APC. Despite the negative
intercept, the smallest APC observed in the sample remains positive. An annual
income level of $1,000 (in 1970 dollars) is where the graph begins.

(i) The coefficient on log(dist) is 0.312, which tells that a 1% increase in the distance
from the incinerator leads to a 0.312% increase in the price of the house, holding all
else constant. This is a positive relationship, indicating that the further houses are
from the incinerator, the higher they tend to have higher prices.

(ii) The simple regression may not provide an unbiased estimator of the ceteris paribus
elasticity of price with respect to distance from the incinerator. This is because the
placement of the incinerator could be correlated with other unobserved factors that
influence housing prices, such as socioeconomic characteristics, zoning laws, or city
planning decisions. Consequently, housing prices do not directly correlate with
distance from the incinerator in a proportional manner.

(iii) Many other factors could affect a house's price, including:

● Size of the house (square footage): Larger houses generally sell for more.
● Age of the house: Older houses might be less expensive due to wear and tear,
unless they are historic or have unique features.
● Neighborhood characteristics: Proximity to schools, parks, shopping areas,
and public transportation.
● House condition: Well-maintained homes are more expensive.
● Local amenities: Features like swimming pools, garages, and modern
appliances can add value.

Group 6 - FNC01 - K48 - HW1
No ratings yet
Group 6 - FNC01 - K48 - HW1
11 pages
Chapter 2 The Classical Linear Regression Model (CLRM)
No ratings yet
Chapter 2 The Classical Linear Regression Model (CLRM)
20 pages
Simple-Linear-Regression-Model-3 24
No ratings yet
Simple-Linear-Regression-Model-3 24
87 pages
Econ 329 - Statistical Properties of The Ols Estimator: Sanjaya Desilva
No ratings yet
Econ 329 - Statistical Properties of The Ols Estimator: Sanjaya Desilva
12 pages
CHP 3 PDF
No ratings yet
CHP 3 PDF
31 pages
Week 2 - The Simple Linear Regression Model PDF
No ratings yet
Week 2 - The Simple Linear Regression Model PDF
47 pages
+part 04 - AMEFA - 2024 - Introduction and Repetition
No ratings yet
+part 04 - AMEFA - 2024 - Introduction and Repetition
69 pages
Application of ANOVA
100% (1)
Application of ANOVA
18 pages
First Course In Probability 9th Edition Ross Solutions Manual - Full Version Is Available For Instant Download
100% (2)
First Course In Probability 9th Edition Ross Solutions Manual - Full Version Is Available For Instant Download
55 pages
UnivariateRegression 2
No ratings yet
UnivariateRegression 2
72 pages
Classical Linear Regression Model (CLRM)
100% (1)
Classical Linear Regression Model (CLRM)
68 pages
1-Chap II Econometrics ABC DR Mitiku
No ratings yet
1-Chap II Econometrics ABC DR Mitiku
80 pages
ECO 401 Econometrics: SI 2021 Week 2, 14 September
100% (1)
ECO 401 Econometrics: SI 2021 Week 2, 14 September
47 pages
Properties of The OLS Estimator: Quantitative Methods 2
No ratings yet
Properties of The OLS Estimator: Quantitative Methods 2
57 pages
Econometria 2
No ratings yet
Econometria 2
16 pages
Ch3 Slides Ed4 2024
No ratings yet
Ch3 Slides Ed4 2024
72 pages
Econometrics Chap 3
No ratings yet
Econometrics Chap 3
19 pages
Efficiency and Blueness: N N T T T
No ratings yet
Efficiency and Blueness: N N T T T
2 pages
Statistics JEE (Main) 2024
No ratings yet
Statistics JEE (Main) 2024
78 pages
Ref. CH 3 Gujarati Book
No ratings yet
Ref. CH 3 Gujarati Book
51 pages
Idea of Ols
No ratings yet
Idea of Ols
2 pages
(2021) EC6041 Lecture 3 Properties of OLS
No ratings yet
(2021) EC6041 Lecture 3 Properties of OLS
25 pages
Set Domande Econometria 2
No ratings yet
Set Domande Econometria 2
19 pages
Ols 23-24
No ratings yet
Ols 23-24
87 pages
Eco 3
No ratings yet
Eco 3
68 pages
Lecture 2 SLR - 1
No ratings yet
Lecture 2 SLR - 1
28 pages
02 Simple Regression
No ratings yet
02 Simple Regression
29 pages
derex econom
No ratings yet
derex econom
13 pages
The Simple Regression Model: DR Jin Hongfei 1
No ratings yet
The Simple Regression Model: DR Jin Hongfei 1
41 pages
Ch1 Textbook Kieso
No ratings yet
Ch1 Textbook Kieso
58 pages
Jefffffffffffffff
No ratings yet
Jefffffffffffffff
435 pages
ECO375H_Slides_3
No ratings yet
ECO375H_Slides_3
39 pages
MFIN 305_Lecture1
No ratings yet
MFIN 305_Lecture1
77 pages
Basic Econometrics - II
No ratings yet
Basic Econometrics - II
30 pages
Introduction To Econometrics - Stock & Watson - CH 4 Slides
100% (2)
Introduction To Econometrics - Stock & Watson - CH 4 Slides
84 pages
Chapter3
No ratings yet
Chapter3
52 pages
Engineering Mathematics II - Removed
No ratings yet
Engineering Mathematics II - Removed
90 pages
Simple Regression
No ratings yet
Simple Regression
45 pages
Pargol
100% (1)
Pargol
34 pages
Lecture 2-3_Properties of the OLS Estimates
No ratings yet
Lecture 2-3_Properties of the OLS Estimates
20 pages
C1 English
No ratings yet
C1 English
26 pages
Ch3_slides_Ed4_2024_20(1)
No ratings yet
Ch3_slides_Ed4_2024_20(1)
72 pages
Econometrics: Two Variable Regression: The Problem of Estimation
No ratings yet
Econometrics: Two Variable Regression: The Problem of Estimation
28 pages
Econometrics jimma assignment
No ratings yet
Econometrics jimma assignment
6 pages
Ordinary Least Squares
No ratings yet
Ordinary Least Squares
21 pages
Simple Linear Regression Model
No ratings yet
Simple Linear Regression Model
6 pages
Standard Costing Multiple Choice
No ratings yet
Standard Costing Multiple Choice
6 pages
4.1-Descriptive-Measures
No ratings yet
4.1-Descriptive-Measures
34 pages
CHP 7 Study Guide
100% (1)
CHP 7 Study Guide
17 pages
Chapter 4 Profile
100% (1)
Chapter 4 Profile
9 pages
cheatsheet
No ratings yet
cheatsheet
2 pages
Chapter3 PDF
No ratings yet
Chapter3 PDF
52 pages
Casio Scientific Calculator Fx-570ms
No ratings yet
Casio Scientific Calculator Fx-570ms
26 pages
Wooldridge Notes
No ratings yet
Wooldridge Notes
15 pages
TCH442E Quantitative Methods For Finance
No ratings yet
TCH442E Quantitative Methods For Finance
21 pages
AG909 Quantitative Methods For Finance
No ratings yet
AG909 Quantitative Methods For Finance
7 pages
HW1 (1)
No ratings yet
HW1 (1)
7 pages
MCO-022-E-2024-25-MCOM-new-GSPH@9891268050-fy7wlx (1)
No ratings yet
MCO-022-E-2024-25-MCOM-new-GSPH@9891268050-fy7wlx (1)
20 pages
bài-12.08
No ratings yet
bài-12.08
24 pages
Introductory Statistics A Problem Solving Approach 2nd Edition Kokoska Test Bank - Free Download Available In PDF DOCX Format
100% (3)
Introductory Statistics A Problem Solving Approach 2nd Edition Kokoska Test Bank - Free Download Available In PDF DOCX Format
50 pages
Introduction To Econometrics - Summary
No ratings yet
Introduction To Econometrics - Summary
23 pages
Regression With One Regressor
No ratings yet
Regression With One Regressor
25 pages
Business Statistics Abridged: Australia and New Zealand 8Th Edition Eliyathamby A. Selvanathan Saroja Selvanathan Gerald Keller - Ebook PDF
100% (7)
Business Statistics Abridged: Australia and New Zealand 8Th Edition Eliyathamby A. Selvanathan Saroja Selvanathan Gerald Keller - Ebook PDF
51 pages
FGHJ
No ratings yet
FGHJ
16 pages
FECO Note 2 - Simple Linear Regression: Xuan Chinh Mai
No ratings yet
FECO Note 2 - Simple Linear Regression: Xuan Chinh Mai
7 pages
Chapter 2 Statistics Review 2023
No ratings yet
Chapter 2 Statistics Review 2023
21 pages
CHP 3 Notes, Gujarati
No ratings yet
CHP 3 Notes, Gujarati
4 pages
Econometrics I Lecture 4 Wooldridge
No ratings yet
Econometrics I Lecture 4 Wooldridge
33 pages
Chapter 2 Econometrics
No ratings yet
Chapter 2 Econometrics
9 pages
Measures of Central Tendency Dispersion Examples Sec A
No ratings yet
Measures of Central Tendency Dispersion Examples Sec A
14 pages
Marketing
No ratings yet
Marketing
12 pages
Gujarati D, Porter D, 2008: Basic Econometrics 5Th Edition Summary of Chapter 3-5
No ratings yet
Gujarati D, Porter D, 2008: Basic Econometrics 5Th Edition Summary of Chapter 3-5
64 pages
Jeffrey M. Wooldridge-Introductory Econometrics - A Modern Approach-South-Western College Pub (2016) - 113-115
No ratings yet
Jeffrey M. Wooldridge-Introductory Econometrics - A Modern Approach-South-Western College Pub (2016) - 113-115
3 pages
Two-Variable Regression Model - The Problem of Estimation
No ratings yet
Two-Variable Regression Model - The Problem of Estimation
35 pages
MCQ mth302 Without Ans-1
No ratings yet
MCQ mth302 Without Ans-1
11 pages
Mathmw1 Midterms
No ratings yet
Mathmw1 Midterms
9 pages
Written Assignment Unit V
No ratings yet
Written Assignment Unit V
5 pages
KHANG
No ratings yet
KHANG
7 pages
A Study The Analyzing of The Brand Equity and Resonance of Banking Services: Pondicherry
No ratings yet
A Study The Analyzing of The Brand Equity and Resonance of Banking Services: Pondicherry
13 pages
Statistics Assignment
No ratings yet
Statistics Assignment
4 pages
Emet2007 Notes
No ratings yet
Emet2007 Notes
6 pages
FM423 Practice Exam III
No ratings yet
FM423 Practice Exam III
7 pages
Learning Activity Sheet 6 STAT PROB
No ratings yet
Learning Activity Sheet 6 STAT PROB
3 pages
course outline-FA-K49
No ratings yet
course outline-FA-K49
5 pages
Ecc 284
No ratings yet
Ecc 284
2 pages
Chapter 5
No ratings yet
Chapter 5
3 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
Further Statistics 1 Unit Test 7 Central Limit Theorem
No ratings yet
Further Statistics 1 Unit Test 7 Central Limit Theorem
3 pages
What I Need To Know: System of Measurement
No ratings yet
What I Need To Know: System of Measurement
9 pages
General Physics 1 Module 1
No ratings yet
General Physics 1 Module 1
34 pages
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
Chi Squared for Beginners
From Everand
Chi Squared for Beginners
Stephanie Glen
No ratings yet
Exercises of Statistical Inference
From Everand
Exercises of Statistical Inference
Simone Malacrida
No ratings yet
Econometrics: A Simple Introduction
From Everand
Econometrics: A Simple Introduction
K.H. Erickson
3.5/5 (5)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

HW-1

Uploaded by

HW-1

Uploaded by

TABLE OF EVALUATION

No. Full name Level of completion

● ^y t is the predicted value of y based on the sample x .

Population Regression Function (PRF): The Population Regression Function (PRF)

● α is the intercept, representing the expected value of y when x = 0.

● β is the regression coefficient, indicating the alteration in y for a 1-unit rise in x

● ut is the random error term, accounting for factors not explained by x .

● is derived from sample data and serves as an approximation of the Population

Regression Function (PRF). It decomposes the observed value of y t into two

However, if these assumptions are violated—such as in cases of endogeneity or

● Cov(ui, uj) = 0 for i ≠ j→ The errors are linearly independent of one

● Cov(ut, xt) = 0 → The error term is uncorrelated with corresponding x.

● ut ~ N(0,𝜎2) → The error term ut follows a normal distribution.

This is linear in parameters (α , β ), so it can be estimated using OLS.

This is nonlinear in parameters. However, by taking the natural log of both

This is nonlinear in parameters ( β y ), so it cannot be estimated using OLS.

This is linear in parameters (α , β 1, β 2), so it can be estimated using OLS.

Conclusion: In order to draw inferences about the population parameters, we test

- When cigs = 20, predicted birth weigh is: ^

(iii) Solving the equation:125=119.77−0.514 × cigs

(iii) Many other factors could affect a house's price, including:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.