0% found this document useful (0 votes)

17 views13 pages

2021 Quiz2 Problems

Uploaded by

clkramer

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views13 pages

2021 Quiz2 Problems

Uploaded by

clkramer

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

MS&E 125 Quiz 2, 2021

Name:

SUNet ID: @stanford.edu

C-17-80752

1
Instructions

• You have 24 hours to complete this exam. Responses are due by Thursday, March 18 at
11:59pm Pacific Time.
• Upon completion of the quiz, you should submit your answers using this Google form:
https://forms.gle/zYUcXYHXeG4nKiW87
• There are a total of 20 questions, 5 true/false questions and 15 multiple choice questions.
• Apart from the teaching stuff, you may not talk to or consult with anyone about this
exam. If any questions arise, please create a private post on Piazza and the teaching
staff will respond as soon as possible.
• You may use all resources available (books, notes, homework solutions, and general
Internet) for this exam. However, we recommend against using materials outside of this
course, such as searching the Internet, since it will more likely result in over-complication,
confusion, and a waste of time.
• Unless otherwise noted, each problem is self-contained.
• You will get zero points for any incorrectly answered question (no negative points).
• Each problem is identically worth 5 points, for a maximum total of 100, so don’t spend
too much time working on any single question.
• Good luck!

2
True/false questions

Problem 1.
When comparing two models M1 and M2 , if M1 has lower bias than M2 , it must have higher variance than
M2 .

Problem 2.
For a simple linear regression model, the regression line and the standard deviation line always intersect at
(X̄, Ȳ ), where X̄ is the sample mean of X and Ȳ is the sample mean of Y .

Problem 3.
Prediction error on the test set will generally be higher than that of the training set.

Problem 4.
The coefficient β1 in a logistic regression model Pr(Y = 1) = logit−1 (β0 + β1 X) can be interpreted as the
average increase in odds of the event Y = 1 taking place as X increases by one unit.

Problem 5.
Your friend Alice generates a dataset composed of covariates Xi where i = 1, ..., m, and one target variable Y .
Alice generated the data using the following formula: Y = β0∗ + β1∗ X1 + ... + βm
∗
Xm + , where β ∗ is a vector
of predefined coefficients, and ∼ N (0, σ ). Alice gives you β and the entire dataset, but she leaves out the
2 ∗

target variable Y . Using the data and β ∗ , you can perfectly reconstruct Y .

3
Multiple choice questions

Problem 6.
Consider a randomized controlled experiment. Let pa be the proportion of always-treats, pc the proportion of
compliers, and pn the proportion of never-treats in the population. In terms of pa , pn and pc , approximately
what proportion of the control group does not receive treatment?
(a) pa + pn
(b) pa + pc
(c) pn + pc
(d) pa + pc + pn

Problem 7.
Suppose we obtain the following results by running a (unregularized) linear regression:

Ŷ = 0.305 + 0.710X1 + 2.725X2 + 5.523X3

Which of the following models is most likely to be the result of fitting a regression using the same data and
model formula, but with an L2 (ridge) penalty?
(a) Ŷ = 9.176 + 0.021X1 + 0.623X2 + 0.138X3
(b) Ŷ = 3 + 7X1 + 27X2 + 55X3
(c) Ŷ = 8.456 + 0.000X1 + 0.000X2 + 0.918X3
(d) Ŷ = 0.305 + 0.710X1 + 2.725X2 + 5.523X3

4
Problem 8.
We fit a regression using the formula Y = β0 + β1 X1 + β2 X2 + β3 X3 + β4 X4 + ε and estimated the following
coefficients:

Coefficient Estimate
β0 1.8
β1 0
β2 1.5
β3 0
β4 0

Which of the following regression models is most likely to have been used?
(a) Linear regression
(b) L1 regularized linear regression (lasso)
(c) L2 regularized linear regression (ridge)
(d) Logistic regression

Problem 9.
People who participate in job training programs are more likely to be unemployed after completing the
program compared to those who do not participate. Based on this information, which of the following is
most reasonable?
(a) Job training programs decrease likelihood of employment
(b) People who are more likely to be employed are more likely to attend job training programs
(c) People who are less likely to be employed are more likely to attend job training programs
(d) People who are less likely to be employed are less likely to attend job training programs

5
Problem 10.
We are given a data frame named df with four columns:
• salary is a continuous variable.
• age is a factor variable with 5 levels.
• edu is a factor variable with 4 levels.
• experience is a continuous variable.
We then fit a linear regression in R using following code:
my_model <- lm(salary ~ 1 + age + edu + experience, data = df)

How many coefficients, including the intercept, will we have in my_model? (Assume there is no missing data
in the data frame.)
(a) 4
(b) 9
(c) 10
(d) 11

Problem 11.
Suppose you are given observations Y1 , . . . , Yn and covariates X1 , . . . , Xn . Suppose you run linear regression
with only an intercept term. What would be the resulting value of R2 ?
(a) 1
(b) 0.5
(c) 0
(d) 0.8

Problem 12.
A car manufacturer is trying to understand the correlation between the fuel efficiency and price of one of
their vehicles. Using a dataset consisting of prior year sales volume (sales) and fuel efficiency (mpg) as
well as horsepower (hp) for a range of different vehicles, they run the following regression in R and find the
subsequent output:
lm(formula = price ~ 1 + mpg + I(mpg^2) + mpg:hp, data = df)
...
Coefficients:
Estimate ...
(Intercept) 26690.0 ...
mpg 271.0 ...
I(mpg^2) 5.0 ...
mpg:hp 0.2 ...
...
Suppose the current fuel efficiency of the vehicle under consideration is 20 mpg, and its horsepower is 200.
Which of the following is correct?
(a) A one unit increase in mpg is associated with a 271 increase in price.
(b) A one unit increase in mpg is associated with a 276 increase in price.
(c) A one unit increase in mpg is associated with a 316 increase in price.

6
(d) A one unit increase in mpg is associated with a 516 increase in price.

7
Suppose we are interested in predicting loan defaults, which happens when a borrower fails to repay a loan.
In collaboration with a local bank, we collect the following information regarding loans in the past 10 years:
• default: a binary indicator of whether or not a loan defaulted, where 1 indicates default and 0 indicates
repayment
• age: age of the borrower, a categorical variable with the following levels:
– 20-29
– 30-44
– 45-64
– 65+
• sex: categorical variable, either female or male
• income: a continuous variable indicating the borrower’s income
After fitting a logistic regression model using all available covariates to predict default, we obtain the following
output:

Covariate Estimate Std. error

(Intercept) 0.304 0.0180
sexmale 0.143 0.0204
age30-44 -0.741 0.4204
age45-64 -0.023 0.0018
age65+ -0.133 0.0685
income -1.246 0.2672

Use this table to answer problems 13 and 14.

Problem 13.
All else equal, which age group is most likely to default on a loan?
(a) 20-29
(b) 30-44
(c) 45-64
(d) 65+

Problem 14.
Suppose we have a male and female borrower who are both 50 years old and make $60k per year. The odds
of the male borrower defaulting is s × odds of the female borrower defaulting. What is the correct value
of s?
(a) −0.143
(b) 0.143
(c) exp(0.143)
(d) logit−1 (0.304 + 0.143)

8
Problem 15.
Which of the following statements about prediction intervals is false?
(a) Prediction intervals quantify the uncertainty around a specific response.
(b) For a given X = x, the prediction interval is the same size or larger than the mean confidence interval
of the same significance level.
(c) For a given X = x, the prediction interval is generally smaller when x is closer to the mean of the Xi ’s.
(d) For a given X = x, a 95% prediction interval is theoretically guranteed to cover 95% or more Y s
observed in the data with the same X.

9
Problem 16.
After fitting the linear regression Y = β0 + β1 X1 + β2 X2 + ε on a dataset, we find β̂2 = 2 (i.e., the coefficient
for X2 is 2).
What is the correct interpretation of this result?
(a) Having X1 fixed, for every unit increase in X2 , Y increases by 2 on average.
(b) Having X1 fixed, for every unit increase in Y , X2 increases by 2 on average.
(c) Having X1 fixed, for every unit increase in X2 , Y increases by 1/2 on average.
(d) Having X1 fixed, for every unit increase in X2 , Y increases by 1 on average.

10
Suppose we are conducting a randomized experiment to test the effect of a new drug on an individual’s
survival rate. Only people in the treatment group are given the chance to take the new drug. We record our
observations in following table. The “–” in the table represents unobserved values.

Group Size Survived patients Survival Rate

Assigned to treatment
Accepted new drug 20 15 75%
Refused new drug 20 10 50%
Total 40 25 62.5%
Assigned to control
Would have accepted new drug – – –
Would have refused new drug – – –
Total 60 32 53.3%

Using the above information, answer problems 17 and 18.

Problem 17.
What is the estimated number of never-treats in the control group? (Note that the total number of people in
the treatment and control groups are different.)
(a) 10
(b) 20
(c) 30
(d) 60

Problem 18.
What is the estimated survival rate among never-treats in the control group?
(a) 50%
(b) 53.3%
(c) 75%
(d) Not enough information to compute

11
Problem 19.
Which of the following relationships is NOT a linear regression model?
(a) yi = β0 + β1 xi + εi

(b) yi = β0 + β1 1+xxi
i
+ εi

Problem 20.
Suppose you build a logistic regression model to infer whether a message is spam. The outcome is a binary
variable Yi ∈ {0, 1} where 0 denotes that the ith message is not spam and 1 denotes the ith message is spam.
You collect a dataset of 1600 messages and use your model to make predictions, which yields the following
results:

Model Output=0 Model Output =1

Actual =0 350 650
Actual = 1 400 200

What is the precision of this model?

(a) 650
200+650

(b) 200
200+650

(d) 350+200
1600

12
Answer sheet

Name:

SUNet ID: @stanford.edu

C-17-80752

{True/false questions}
Fill-in the circle of the correct answer. (T = true, F = false)

1 T F

2 T F

3 T F

4 T F

5 T F

{Multiple choice questions}

6 a b c d 16 a b c d

7 a b c d 17 a b c d

8 a b c d 18 a b c d

9 a b c d 19 a b c d

10 a b c d 20 a b c d

11 a b c d

12 a b c d

13 a b c d

14 a b c d

15 a b c d

Full Download Supply Chain Management Text and Cases 2nd Edition Janat Shah PDF DOCX
100% (1)
Full Download Supply Chain Management Text and Cases 2nd Edition Janat Shah PDF DOCX
54 pages
Section 3
No ratings yet
Section 3
29 pages
ML QUES MOD-1
No ratings yet
ML QUES MOD-1
25 pages
2021 Quiz2 Sample
No ratings yet
2021 Quiz2 Sample
7 pages
Statistics Quiz
No ratings yet
Statistics Quiz
20 pages
RGRSSN Assgnmnt
No ratings yet
RGRSSN Assgnmnt
11 pages
Machine 2021 Jul-Dec
No ratings yet
Machine 2021 Jul-Dec
46 pages
all-old-final-exams
No ratings yet
all-old-final-exams
50 pages
02 ML Sol MidSem - Makeup - Sol - Upated (From Bits)
No ratings yet
02 ML Sol MidSem - Makeup - Sol - Upated (From Bits)
6 pages
Eco220y Au18
No ratings yet
Eco220y Au18
25 pages
RSM1282-2025-Session 9-Binary Dependent Variables & Logistic Regression - POST
No ratings yet
RSM1282-2025-Session 9-Binary Dependent Variables & Logistic Regression - POST
35 pages
ML Question bank
No ratings yet
ML Question bank
13 pages
AST Day 2 Slides
No ratings yet
AST Day 2 Slides
58 pages
Machine 2020 Jul-Dec
No ratings yet
Machine 2020 Jul-Dec
45 pages
52217defe4460b1755eb581c2fe41133_MIT18_650F16_PSet8
No ratings yet
52217defe4460b1755eb581c2fe41133_MIT18_650F16_PSet8
4 pages
Problem Set 2
No ratings yet
Problem Set 2
3 pages
1992 GTR Supp
83% (6)
1992 GTR Supp
279 pages
My Proposal - IDENTIFICATION AND ISOLATION OF PROBIOTIC MICROORGANISM IN EEL
100% (7)
My Proposal - IDENTIFICATION AND ISOLATION OF PROBIOTIC MICROORGANISM IN EEL
14 pages
ECS 4220
No ratings yet
ECS 4220
7 pages
Classification _ DPP 01
No ratings yet
Classification _ DPP 01
5 pages
Final Exam 102 w10 Solutions
No ratings yet
Final Exam 102 w10 Solutions
14 pages
Test 1 With Key 10-3
No ratings yet
Test 1 With Key 10-3
16 pages
ML Unit 03 MCQ
No ratings yet
ML Unit 03 MCQ
20 pages
ESB2021 Resit With Solution
No ratings yet
ESB2021 Resit With Solution
9 pages
Concordia University Machine Learning Assaignment with solutions
No ratings yet
Concordia University Machine Learning Assaignment with solutions
8 pages
Activity 7
No ratings yet
Activity 7
5 pages
Sample Exam With Solutions. Econometrics II 2015.
No ratings yet
Sample Exam With Solutions. Econometrics II 2015.
15 pages
12
No ratings yet
12
16 pages
2223_1_SEHH2313
No ratings yet
2223_1_SEHH2313
16 pages
ps_lregression
No ratings yet
ps_lregression
6 pages
627475908-5-MCQ-LR-no-answer
No ratings yet
627475908-5-MCQ-LR-no-answer
12 pages
5 MCQ LR No Answer
100% (2)
5 MCQ LR No Answer
12 pages
Machine 2021 Jan-Apr
No ratings yet
Machine 2021 Jan-Apr
45 pages
Stat_Model _exam_2017_DBU
No ratings yet
Stat_Model _exam_2017_DBU
20 pages
Ecntr Assmm
No ratings yet
Ecntr Assmm
23 pages
Dummy Dependent Variable
100% (1)
Dummy Dependent Variable
58 pages
STAT3301 - Term Exam 2 - CH11 Study Package
No ratings yet
STAT3301 - Term Exam 2 - CH11 Study Package
6 pages
2020 - 21 - Extra Exercise 1
No ratings yet
2020 - 21 - Extra Exercise 1
2 pages
Same Guidelines I Gave in ISQS 5347
No ratings yet
Same Guidelines I Gave in ISQS 5347
2 pages
Econ205 Final Ans
No ratings yet
Econ205 Final Ans
7 pages
Notes 13
No ratings yet
Notes 13
18 pages
Follow Up - Matrix 1
No ratings yet
Follow Up - Matrix 1
50 pages
DP-Lite: User Guide
No ratings yet
DP-Lite: User Guide
118 pages
Assignment 2
No ratings yet
Assignment 2
3 pages
metrikaq
No ratings yet
metrikaq
11 pages
Product Catalog 2023
No ratings yet
Product Catalog 2023
54 pages
Advanced_Stats_Final_Exam_Sample
No ratings yet
Advanced_Stats_Final_Exam_Sample
9 pages
2011MeteringPumps Pulsafeeder
No ratings yet
2011MeteringPumps Pulsafeeder
72 pages
Ecf630-Final Examination - May 2021
No ratings yet
Ecf630-Final Examination - May 2021
12 pages
A John Wyndham Checklist Philip Stephenson-Payne (Compiler) 2024 Scribd Download
100% (8)
A John Wyndham Checklist Philip Stephenson-Payne (Compiler) 2024 Scribd Download
41 pages
NSE BA Sample Paper With Solution
100% (1)
NSE BA Sample Paper With Solution
18 pages
Demo0 Sol1
No ratings yet
Demo0 Sol1
5 pages
ECON3334 Midterm Fall2023 Question
No ratings yet
ECON3334 Midterm Fall2023 Question
7 pages
EF3450 2122B MID
No ratings yet
EF3450 2122B MID
11 pages
Important Instructions To The Candidates:: Part B
No ratings yet
Important Instructions To The Candidates:: Part B
7 pages
Global Strategy and Sustainability PPT (1)
No ratings yet
Global Strategy and Sustainability PPT (1)
10 pages
Grade 3 Data Mining: Question Text
No ratings yet
Grade 3 Data Mining: Question Text
28 pages
Metrics Jan 2021
No ratings yet
Metrics Jan 2021
10 pages
Metrics Aug 2023
No ratings yet
Metrics Aug 2023
10 pages
Sinumerik 840D PL_SIMODRIVE 611_Rotor position synchronization_pole position identification
No ratings yet
Sinumerik 840D PL_SIMODRIVE 611_Rotor position synchronization_pole position identification
9 pages
PHD Thesis Template KTH
100% (3)
PHD Thesis Template KTH
7 pages
Limited Dependent Variables - Binary Dependent Variables
No ratings yet
Limited Dependent Variables - Binary Dependent Variables
24 pages
ECON 6001 Assignment1 2023
No ratings yet
ECON 6001 Assignment1 2023
9 pages
Aff700 1000 220401
No ratings yet
Aff700 1000 220401
8 pages
ML U3 MCQ
No ratings yet
ML U3 MCQ
20 pages
30 Questions To Test Your Understanding of Logistic Regression
No ratings yet
30 Questions To Test Your Understanding of Logistic Regression
13 pages
JUKOOL Installation Manual of FT-TAC-PI02
No ratings yet
JUKOOL Installation Manual of FT-TAC-PI02
7 pages
JC Exp - 1145
No ratings yet
JC Exp - 1145
9 pages
Carrier Cooling Load Hand Book-1 PDF
No ratings yet
Carrier Cooling Load Hand Book-1 PDF
16 pages
V07 (Architectural FOH Lighting)
No ratings yet
V07 (Architectural FOH Lighting)
231 pages
Solutions Problem Set 1
No ratings yet
Solutions Problem Set 1
7 pages
Survey (For Electric Crisis in Pakistan)
No ratings yet
Survey (For Electric Crisis in Pakistan)
2 pages
Graded Quiz Unit 3 PDF
No ratings yet
Graded Quiz Unit 3 PDF
10 pages
Acid Cooling
No ratings yet
Acid Cooling
15 pages
Softening Final
100% (1)
Softening Final
23 pages
Binary Dependent Var
100% (1)
Binary Dependent Var
5 pages
Exam Questions
No ratings yet
Exam Questions
3 pages
Boiler
No ratings yet
Boiler
10 pages
Research Paper On Future of 5G Wireless System: June 2021
No ratings yet
Research Paper On Future of 5G Wireless System: June 2021
6 pages
ATS
No ratings yet
ATS
8 pages
Promeco Extruder System PES Briquetting Machine
No ratings yet
Promeco Extruder System PES Briquetting Machine
2 pages
MYTHOLOGY 2024
No ratings yet
MYTHOLOGY 2024
6 pages
Angling Dharma Folklore
No ratings yet
Angling Dharma Folklore
3 pages
Class C and D Cargo Compartment Regulations
No ratings yet
Class C and D Cargo Compartment Regulations
2 pages
Bibliography
No ratings yet
Bibliography
2 pages
Ebrahim Was Not A Jew Nor A Christian But An Upright Muslim.
No ratings yet
Ebrahim Was Not A Jew Nor A Christian But An Upright Muslim.
3 pages
Fiberlign Cushion Clamp For Opgw
No ratings yet
Fiberlign Cushion Clamp For Opgw
2 pages
26 End Time Signs
No ratings yet
26 End Time Signs
2 pages
Portable Products: Power Factor: CAPO 2.5: Battery-Operated, 2.5 KV Capacitance Power Factor Test Set
No ratings yet
Portable Products: Power Factor: CAPO 2.5: Battery-Operated, 2.5 KV Capacitance Power Factor Test Set
2 pages
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

2021 Quiz2 Problems

Uploaded by

2021 Quiz2 Problems

Uploaded by

MS&E 125 Quiz 2, 2021

SUNet ID: @stanford.edu

Ŷ = 0.305 + 0.710X1 + 2.725X2 + 5.523X3

Covariate Estimate Std. error

Use this table to answer problems 13 and 14.

Group Size Survived patients Survival Rate

Using the above information, answer problems 17 and 18.

Model Output=0 Model Output =1

What is the precision of this model?

SUNet ID: @stanford.edu

{Multiple choice questions}

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.