Stats Mid-Term Exam
Stats Mid-Term Exam
Advanccd Statistics - L3
Note:
" For hypothesis testing, you should (i) write the hypotheses using proper mathematical notation, (ii) show all
steps in your t-statistic calculation, (iii) clearly state your conclusion
" Critical valuesare 1.645 for 10% significance level, 1.96 for 5% significance level and 2.576 for 1% significance
level
1. (1 point) One of the city politician suggests you should study whether, among the 5,000 residents, those
who have a gym membership are exercising more. Write down (i) a regression specification to estimate the
politician's suggestion, using appropriate variables, o. 31. and the error term u, (ii) the hypothesis the
politician wishes to test.
2. (3 points) The city administration implements the politician's suggestion and releases the following results:
erercise_hours = 2.34 + 1.2 gym_membership
(0.32) (0.6) 2,34
n=5,000, R = 0.23 O,32
(Standard errors in parent heses)
1
(n) Test 1he mall lypothesis that gym nenership has no effect on exercise hours against the alternative
hypothesis that it lhas a statistically significant cfect at the 10%, 5% and 1% significance levels.
-(b) Provide the fornmla to find the 95% confidence interval for the gym membership cocfficient.
(e) Do youthink that 1.2 correspond to the causal effect of gym nembership on hours of exercise? Justify
your answer using two different examples of your choice.
8. (3 points) The city decides to evaluate the program using a randomized controlled trial;
(a) What is a randomizcd controlled trial? What are balance checks useda for? to chedk whelhen a test
(ealy tamdsm
-(b) The validity of a randomized controlled trial is often cvaluated using the concepts of internal and external
validity. Recall the definition of these two concepts.
- (c) List two clements that could threaten the internal validity of this study.
Problem 2: Education and WVage Analysis (7 points)
We cstimate two models in order to explain the annual wage. We have the following variables: wage designates
the annual wage, education the number of years of education, ezperience the number of years of work experience,
parentincome the income of the parents of the individual, female a binary variable equal to 1 if the individual is
a woman.
Model 1:
log(wage) = 9.45 +0.16 education
(0.15) (0.02)
n=1,000, R = 0.21
Model 2:
log(wage) = 9.12 +0.08 education + 0.03 ezperience + 0.05 log(parentincome) - 0.2 female
(0.14) (0.02) (0.01) (0.01) (0.04)
n= 1,000, R²= 0.34