Assignment 3-121020307
Assignment 3-121020307
INSTRUCTIONS:
• Please label clearly each answer with the appropriate question number and letter. Securely
staple all answer sheets together, and make certain that your name(s) and student number(s)
are printed clearly at the top of each answer sheet.
• Please use STATA to do Question 1, and report your STATA commands and results
together with your answers to the questions.
• Hand-written answers must be legible. Illegible assignments will be returned unmarked.
• Please combine your answers with supporting documents into one Adobe PDF file and
submit.
MARKING: Marks for each question are indicated in parentheses. Total marks for the assignment
equal 140. Marks are given for both content and presentation.
Question 1 (50 marks)
Data Description: A random sample of 935 employees drawn from the 1980 U.S. population of
all employed paid workers.
Variable Definitions:
𝑤𝑎𝑔𝑒𝑖 : average monthly earnings of worker 𝑖 in 1980, in dollars.
𝑒𝑑𝑢𝑐𝑖 : years of formal education completed by worker 𝑖, in years.
𝑡𝑒𝑛𝑢𝑟𝑒𝑖 : years of worker 𝑖 with the current employer, in years.
𝑚𝑎𝑟𝑟𝑖𝑒𝑑𝑖 : an indicator variable equal to 1 if worker 𝑖 is married, and 0 if worker 𝑖 is single.
𝑏𝑙𝑎𝑐𝑘𝑖 : an indicator variable equal to 1 if worker 𝑖 is black, and 0 if worker 𝑖 is nonblack.
𝑠𝑜𝑢𝑡ℎ𝑖 : an indicator variable equal to 1 if worker 𝑖 lives in south, and 0 otherwise.
𝑢𝑟𝑏𝑎𝑛𝑖 : an indicator variable equal to 1 if worker 𝑖 lives in metropolitan area, and 0 otherwise.
Compute and present OLS estimates of the following population regression equation for the full
sample of 935 paid workers:
where 𝑢𝑖 is a random error term that is assumed to satisfy all the assumptions of the classical linear
regression model.
(10 marks)
(a) What is the approximate difference in monthly salary between blacks and nonblacks? Is this
difference statistically significant?
(10 marks)
(b) Add the variables 𝑒𝑥𝑝𝑒𝑟 2 and 𝑡𝑒𝑛𝑢𝑟𝑒 2 to the equation and show that they are jointly
insignificant at even the 20% level.
(10 marks)
(c) Extend the original model to allow the return to education to depend on race (black or nonblack)
and test whether the return to education does depend on race.
(10 marks)
(d) Again, start with the original model (1), but now allow wages to differ across four groups of
people: married and black, married and nonblack, single and black, and single and nonblack. What
is the estimated wage differential between married blacks and married nonblacks? Is this difference
statistically significant?
(10 marks)
(e) Start with the original model (1), use the Chow Test to test whether there is any difference in
regression models between blacks and nonblacks
Question 2 (54 marks)
You are conducting an econometric investigation into the hourly wage rates of female and male
employees. The sample data consist of observations for 526 employees on the following variables:
𝑤𝑎𝑔𝑒𝑖 : average hourly earnings of employee 𝑖, in dollars per hour.
𝑒𝑑𝑖 : years of formal education completed by employee 𝑖, in years.
𝑒𝑥𝑝𝑖 : the number of years of work experience accumulated by employee 𝑖.
𝑓𝑒𝑖 : an indicator variable equal to 1 if employee 𝑖 is female, and 0 if employee 𝑖 is male.
The regression model you propose to use is the log-level regression equation
log(𝑤𝑎𝑔𝑒𝑖 ) = 𝛽0 + 𝛽1 𝑒𝑑𝑖 + 𝛽2 𝑒𝑥𝑝𝑖 + 𝛽3 𝑒𝑑𝑖2 + 𝛽4 𝑒𝑥𝑝𝑖2 + 𝛽5 𝑒𝑑𝑖 𝑒𝑥𝑝𝑖 + 𝛽6 𝑓𝑒𝑖 + 𝛽7 𝑓𝑒𝑖 𝑒𝑑𝑖 +
+𝛽8 𝑓𝑒𝑖 𝑒𝑥𝑝𝑖 + 𝛽9 𝑓𝑒𝑖 𝑒𝑑𝑖2 + 𝛽10 𝑓𝑒𝑖 𝑒𝑥𝑝𝑖2 + 𝑢𝑖 (1)
Using the sample data described above, your research assistant computes OLS estimates of
regression equation (1) and of three restricted versions of equation (1). For each of the sample
regression equations estimated on the sample of n = 526 observations, the following table contains
the OLS coefficient estimates (with estimated standard errors in parentheses below the coefficient
estimates) and the summary statistics SSR (residual sum-of-squares), SST (total sum-of-squares),
and number of sample observations.
(10 marks)
(a) Compare the goodness-of-fit to the sample data of the four sample regression equations (1),
(2), (3) and (4) in the table. Calculate the value of an appropriate goodness-of-fit measure for each
of the sample regression equations (1), (2), (3) and (4) in the table. Which of the four sample
regression equations provides the best fit to the sample data?
(10 marks)
(b) Use the estimation results for regression equation (3) in the above table to perform a test of the
proposition that female employees of any given education and experience have lower average
wage than male employees of the same education and experience. Perform the test at the 5 percent
significance level (i.e., for significance level 𝛼 = 0.05). State the null hypothesis and the
alternative hypothesis. Show how you calculate the required test statistic. State the decision rule
you use, and the inference you would draw from the test.
(10 marks)
(c) State the coefficient restrictions that regression equation (2) in the table imposes on regression
equation (1). Explain in words what the restrictions mean. Use the estimation results given in the
table to perform a test of these coefficient restrictions at the 5 percent significance level (i.e., for
significance level 𝛼 = 0.05). State the null and alternative hypotheses, show how you calculate the
required test statistic. State the decision rule you use, and the inference you would draw from the
test. Would your inference be the same at the 1 percent significance level (i.e., for significance
level 𝛼 = 0.01)? Based on the outcome of the test, which of the two regression equations would
you choose, equation (1) or equation (2)?
(12 marks)
(d) Write the expression (or formula) for the marginal effect of 𝑒𝑥𝑝𝑖 on log (𝑤𝑎𝑔𝑒𝑖 ) for male
employees implied by regression equation (1). Use regression equation (1) to compute a test of the
proposition that the marginal effect of 𝒆𝒙𝒑𝒊 on 𝐥𝐨𝐠 (𝒘𝒂𝒈𝒆𝒊 ) for male employees is equal to
zero for male employees with any given values of 𝑒𝑑𝑖 and 𝑒𝑥𝑝𝑖 . State the coefficient restrictions
on regression equation (1) implied by this proposition; that is, state the null hypothesis and the
alternative hypothesis. Write the restricted regression equation implied by the null hypothesis.
OLS estimation of this restricted regression equation yields a Residual Sum-of-Squares value of
SSR = 5086.07. Use this information, together with the results from OLS estimation of equation
(1), to calculate the required test statistic. State the decision rule you use, and the inference you
would draw from the test. Choose an appropriate significance level for the test.
(12 marks)
(e) Write the expression (or formula) for the marginal effect of 𝑒𝑑𝑖 on log (𝑤𝑎𝑔𝑒𝑖 ) for female
employees implied by regression equation (1). Use regression equation (1) to compute a test of the
proposition that the marginal effect of 𝒆𝒅𝒊 on 𝐥𝐨𝐠 (𝒘𝒂𝒈𝒆𝒊 ) for female employees is constant
for female employees with any given values of 𝑒𝑑𝑖 and 𝑒𝑥𝑝𝑖 . State the coefficient restrictions on
regression equation (1) implied by this proposition; that is, state the null hypothesis and the
alternative hypothesis. Write the restricted regression equation implied by the null hypothesis.
OLS estimation of this restricted regression equation yields a Residual Sum-of-Squares value of
SSR =4348.85. Use this information, together with the results from OLS estimation of equation
(1), to calculate the required test statistic. State the decision rule you use, and the inference you
would draw from the test. Choose an appropriate significance level for the test.
0,263 the
, Interaction between education & race is States cake
⑤
On average the month salary of married blacks
,
is
approximately
1795% lower then that for married non -
blacks The
. difference Is statically
significant
[ 127,158 -
(13,055-1 108/875) / (61-1)
f- =
=
5.6414
(1%055-11081875) / [935-216+11]
Critical value at 5% Significance Kool Is Coos =
201
wage models
②⑨ EACH
1-418%31/515 0,4041
§
=
pzz =
7160411525 bell
Ega) Provide the
6914 fit while Gala) Pronto
,
=
' -
¥É¥% =
*•
*•
E-9137
=
1-
¥%?,¥÷ = 03739
8914)
RT = 1- 49%1%4,1%-5=03029
⑤ Equation(3) → log lwagei )
ftp.yexp?tBsediekPitRofeitUiGl oglwagei)1edi,expi,fei
=
Both edit Bzeapi + Red
-41 -
E- ( loglwagei ) / edi ,
expi , fei=o / =
Bo
Nall hypothesis Ho :P . =o for Po 20 )
HRH
-_Ñ% =?;;→= -
1- it
Degree of freedom =
D- 7--526-7=519
Decision rate : let tail f- test . Compare sample t statistic with 2 tend critical
I ,
1ft ≥ -
Ca , do not refer to of significance lend 2
2. If + < -
Cd
, reject Ho at the significance here/ &
Inference level 5%
since -1=-717382
-
1645 = -
\
⑥ Equation b) Imposes on Gambon Uthe exclusion restriction PEO Rio
Test ! Ho ! D8 -0 Bio -0
,
Hi ! Ho is not true
Interpretation of Ho : The marginal log lwagd effect of expi is equal lidenticall formale &female
workers
F statistics
14428,28-418%31112
F =
%{%%!%→ =
4185,31 1$26 -
to y
-
= 1495
Decision Rate
if F KG , do not roofed Ho at significance
level 12--005) is loos = 3.00
1 ,
Inference
Since 7--14,957300 =
Coos , retell to at 5% Significance kod
Since F =
149574161 =
Cool , reject Ho at 190 Significance lad
/
2-tlloskggei.gl?diiexPiifei-#--pz-zPaexPitPsedo
Ted
Pz Rs
'
Ho ! =
O ,
Ba = 0 20
,
Hi ! Ho ISnot the
F-Statistics
(508607-4185.31)/3
F
1%11%1%1,1 =
4185131 / 1526-10-1 ,
=
36195
Decision Rule
If F ≤ On donot total Hoot significance kid
✗
It
,
Inference
since F =
36,957260 =
Ciaos petal Ho, we the 5% Significance herd
,
2 edi
Test :
Ho ? the Pg -0 ,
Rs -0
true
Hi 1 Ho is not
F-Statistics
= (ssRr-SSRar =
(4348185-4185131)/2--10,06
SSRur / Ch -
k -7
4185,31 / (526-10-1)
Decision Mk
Inference
since 7--10106 23100 -
Coos , Med Ho at the 5% Significance had
& T.21-ii-PT-P.si/8i--0
¥ ( Yi
,
-
poi B) ⑦-
which implies
pi
Ñ=¥¥;i
-
Furthermore ,
pi ¥ ? ti k too Isi
=
-
=
# {✗ i -
p? ¥
Substituting for PT ,
we get
¥ §÷ Yi ¥ / ¥ Yi )
+ Iti
N-¥ ±§
-
A ↓ Eti Ii
- =
=
, , Si-0
-
,
,
Si-4
=
'
-2 Yi -
¥ ¥ .
Yi
Wi N Si
? ↓ -2g ¥ Esri Yi
binding be
NjM- get P
Yi -
=
,
we , , ,
⑥
pi = ±? ti
- -
¥ I #¥ ,
Yi -
¥¥ ,
◦
Yi )
1T¥ % E"
'
=
¥ / E. ,
'
" +
§
"
) -
,
Yi +
=
¥ E. Yi , ◦