0% found this document useful (0 votes)

9 views3 pages

Case 3

The document outlines a case study for applied econometrics focusing on predicting high earners using a dataset. It includes tasks such as data importation, summary statistics, variable checks, and model building using linear probability and logit models. The analysis aims to explore factors influencing annual income and assess the relative status of men and women in the labor market.

Uploaded by

rannvijaymazumdar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views3 pages

Case 3

Uploaded by

rannvijaymazumdar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Applied Econometrics for Managers: Case #3

High Earner Prediction

1. We will use the “data_case_3.csv” dataset for this session. The variable description is pro-
vided in the “data_doc_case_3.txt” file. Your first task is to import the dataset into R. How many
observations are in the imported dataset? (You should have 30162 observations)

2. Get the summary statistics of the data. Fill in the following table.

Variable Number of Observations Mean Standard Deviation Minimum Maximum Range

Age

3. We want to ensure that there is no coding error in variables education_num and education.
To check this, create a table with education as the row variable and education_num as the column
variable, and fill in the following.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
th
10
11th
12th
1st - 4th
5th -6th
7th -8th
9th
Assoc-acdm
Assoc-voc
Bachelors
Doctorate
HS-grad
Masters
Preschool
Prof-school
Some-college

4. How many individuals in the dataset are earning more than $50,000 annually?

1
5. If we are interested to know the relative status of men and women in the labor market
from this dataset, what can we do? Complete the following tables. Can we say anything about
the relative status from these tables?

Share of high earners and low earners (<=$50K annual income) in male and female (>$50K annual income)
Less than equal to $50K annual income Greater than $50K annual income
Female 100%
Male 100%

Share of male and female in high earners (>$50K annual income) and low earners (<=$50K annual income)
Less than equal to $50K annual income Greater than $50K annual income
Female
Male
100% 100%

6. In which occupation the share of high earners (>$50K annual income) is the highest? In
high earners, share of people with which education level is the highest? Also mention the highest
shares for both these questions.

7. Create a new variable inc50k which takes the value of ‘0’ if an individual earns less than
or equal to $50,000 per year and becomes ‘1’ if an individual earns greater than $50,000 per year.
What is its mean?

8. We are interested to know if the likelihood of a person being a high earner (>$50,000 annual
income) can be predicted. In order to answer this, build a linear probability model by regressing
inc50k on age, square of age, sex, race, and education_num. What is the value of the estimated
coefficient for sex? Is it statistically significant? Explain the other estimated coefficients as well.

9. Now drop education_num from the model in question 8 and add relationship, education,
workclass, occupation, hours_per_week, and capital_gain to that model as explanatory variables.
Explain the results. How has the estimated coefficient for sex changed, and what does that mean?

10. What is the prediction accuracy of the model in question 9? To check this, consider that an
individual will earn more than $50,000 annually with 100% certainty if predicted inc50k is greater
than or equal to 0.5, and will earn less than or equal to $50,000 annually if predicted inc50k is less
than 0.5.

11. Now, let’s build a logit model using inc50k as the dependent variable and age, square of
age, sex, race, and education_num as the explanatory variables. What is the value of the estimated
coefficient for sex? What message do the estimated coefficients convey in this case?

12. What is McFadden’s pseudo R-squared for the logit model in question 11? Also, test for
the overall significance of this model.

2
13. Now drop education_num from the model in question 11 and add relationship, education,
workclass, occupation, hours_per_week, and capital_gain to that model as explanatory variables.
What is the estimated coefficient for sex now? What message do estimated coefficients convey
in this case?

14. Run a statistical test to determine whether the new explanatory variables included in the
model in question 13 belong to the model or not.

15. Check for prediction accuracy of the model in question 13. To check this, consider that an
individual will earn more than $50,000 annually with 100% certainty if predicted inc50k is greater
than or equal to 0.5, and will earn less than or equal to $50,000 annually if predicted inc50k is less
than 0.5.

16. What is average partial effect (APE)? Calculate the average partial effects for the model
in question 13. What is the marginal effect for sex? Explain the other marginal effects as well.

River 9788770220262
No ratings yet
River 9788770220262
1 page
أساسيات الاقتصاد القياسي باستخدام ايفيوز د خالد السواعي موقع المكتبة
No ratings yet
أساسيات الاقتصاد القياسي باستخدام ايفيوز د خالد السواعي موقع المكتبة
290 pages
Scan 12 Jun 25 16 17 27
No ratings yet
Scan 12 Jun 25 16 17 27
10 pages
Mid Exam Econometric
No ratings yet
Mid Exam Econometric
6 pages
Chapter 2
No ratings yet
Chapter 2
97 pages
NLP Student
No ratings yet
NLP Student
11 pages
y β β x β x u SSE, - SER σ - . SSR R R
No ratings yet
y β β x β x u SSE, - SER σ - . SSR R R
3 pages
Dummy Variable Ques
No ratings yet
Dummy Variable Ques
7 pages
Data Science Session 8 Clustering V0
No ratings yet
Data Science Session 8 Clustering V0
30 pages
HW1 24
No ratings yet
HW1 24
4 pages
Assign Docs
No ratings yet
Assign Docs
20 pages
Mock Exam2
No ratings yet
Mock Exam2
17 pages
12th B BSS 7th Sem ECON 405 2021
No ratings yet
12th B BSS 7th Sem ECON 405 2021
3 pages
Test Metrics
No ratings yet
Test Metrics
10 pages
Unit V
No ratings yet
Unit V
22 pages
Problem Set 7
No ratings yet
Problem Set 7
5 pages
EViews Practical 2 Answers
No ratings yet
EViews Practical 2 Answers
5 pages
Sat Class 0811
0% (1)
Sat Class 0811
2 pages
True Regression Model: C - Logincome = Β + Β · B - Years Of Schooling + Β · D - Age + Β · E - Female + Β ·H - Smoker + Β · D - Age ·E - Female + Β · D - Age · H - Smoker + Ε
No ratings yet
True Regression Model: C - Logincome = Β + Β · B - Years Of Schooling + Β · D - Age + Β · E - Female + Β ·H - Smoker + Β · D - Age ·E - Female + Β · D - Age · H - Smoker + Ε
7 pages
Lec 7
No ratings yet
Lec 7
7 pages
MECO6312 2021F Test1 - AZ
No ratings yet
MECO6312 2021F Test1 - AZ
6 pages
Homework 2
No ratings yet
Homework 2
3 pages
Mit Class 9
No ratings yet
Mit Class 9
19 pages
Report
No ratings yet
Report
5 pages
Econometric Methods
No ratings yet
Econometric Methods
8 pages
Exercises Chapter2 Part1 2
No ratings yet
Exercises Chapter2 Part1 2
3 pages
Mock Exam 2 - Solutions
No ratings yet
Mock Exam 2 - Solutions
6 pages
Topic Wise Test Polynomials Cbse Class 9 Maths: Verify Division Algorithm For The P (X) X X
No ratings yet
Topic Wise Test Polynomials Cbse Class 9 Maths: Verify Division Algorithm For The P (X) X X
1 page
Functions of Several Variables, Partial Derivatives
No ratings yet
Functions of Several Variables, Partial Derivatives
26 pages
Eco Exercise 3answer Ans 1
No ratings yet
Eco Exercise 3answer Ans 1
8 pages
Tutorial 3 4
No ratings yet
Tutorial 3 4
4 pages
Defining Work Tasks
No ratings yet
Defining Work Tasks
26 pages
Blokchain Technology Assignment: 1. Public Distribution System (PDS)
No ratings yet
Blokchain Technology Assignment: 1. Public Distribution System (PDS)
5 pages
Community Detection
No ratings yet
Community Detection
72 pages
Chapter 1 Qualitative Variables Final
No ratings yet
Chapter 1 Qualitative Variables Final
74 pages
Adv Econometrics
No ratings yet
Adv Econometrics
8 pages
Reg. No.: 39110009 Colab Notebook Link: Name: Abivirshan Suresh
No ratings yet
Reg. No.: 39110009 Colab Notebook Link: Name: Abivirshan Suresh
27 pages
Bayes Regression
No ratings yet
Bayes Regression
16 pages
Econometrics Eviews 2
No ratings yet
Econometrics Eviews 2
13 pages
AE Week 3
No ratings yet
AE Week 3
3 pages
Chapter 7
No ratings yet
Chapter 7
50 pages
Worksheet - 4 Name-Sarah Nuzhat Khan ID-20175008
No ratings yet
Worksheet - 4 Name-Sarah Nuzhat Khan ID-20175008
6 pages
Homework 03 Answers PDF
No ratings yet
Homework 03 Answers PDF
12 pages
Quality Vs Quantatiy
No ratings yet
Quality Vs Quantatiy
10 pages
S Doc1
100% (1)
S Doc1
7 pages
CH 1 - Economic Data & Nature of Econometrics
No ratings yet
CH 1 - Economic Data & Nature of Econometrics
3 pages
Capital Asset Pricing Model
No ratings yet
Capital Asset Pricing Model
2 pages
Michael Joseph-Introductory Econometrics
No ratings yet
Michael Joseph-Introductory Econometrics
8 pages
1-6 Dummy Variable
No ratings yet
1-6 Dummy Variable
16 pages
Logistic Regression
No ratings yet
Logistic Regression
4 pages
Assignment3 05.01.24
No ratings yet
Assignment3 05.01.24
4 pages
ps1 Build
No ratings yet
ps1 Build
4 pages
LT15 Graphing Linear Functions Foldable
No ratings yet
LT15 Graphing Linear Functions Foldable
2 pages
MTH403 Assignment 231219
No ratings yet
MTH403 Assignment 231219
2 pages
Assignment 2 S 10
No ratings yet
Assignment 2 S 10
4 pages
Project3 1
No ratings yet
Project3 1
2 pages
Solutions To Sample Final Exam ECO2151
No ratings yet
Solutions To Sample Final Exam ECO2151
7 pages
Bing Qian Montecarlo
No ratings yet
Bing Qian Montecarlo
20 pages
Lecture 7 B
No ratings yet
Lecture 7 B
72 pages
Econometrics Lecture Note Chapter 4 and 5
No ratings yet
Econometrics Lecture Note Chapter 4 and 5
39 pages
Lecture 8 - Limited Dependent Var PDF
No ratings yet
Lecture 8 - Limited Dependent Var PDF
78 pages
Chapter 5 Sol
100% (1)
Chapter 5 Sol
48 pages
Chapter 4 (Compatibility Mode)
No ratings yet
Chapter 4 (Compatibility Mode)
66 pages
Homework 2 With Suggested Answers
No ratings yet
Homework 2 With Suggested Answers
14 pages
Applied Econometrics For Managers (MBAA-II, AY: 2023-24) IIM Kashipur
No ratings yet
Applied Econometrics For Managers (MBAA-II, AY: 2023-24) IIM Kashipur
3 pages
Lecture 3 Types of Machine Learning
No ratings yet
Lecture 3 Types of Machine Learning
40 pages
Past Paper 2019
No ratings yet
Past Paper 2019
7 pages
Solution Manual For Introductory Econometrics 6th Edition by Woolridge
0% (3)
Solution Manual For Introductory Econometrics 6th Edition by Woolridge
7 pages
Data Structures and Algorithm: Avl Tree
No ratings yet
Data Structures and Algorithm: Avl Tree
42 pages
E-Commerce With Digital Signature
No ratings yet
E-Commerce With Digital Signature
18 pages
Econometrics Assignment HW4
No ratings yet
Econometrics Assignment HW4
8 pages
SS202B 2015midterm Sol
No ratings yet
SS202B 2015midterm Sol
7 pages
Math 5
No ratings yet
Math 5
37 pages
eNAT Grade Level Report (Grade 5)
No ratings yet
eNAT Grade Level Report (Grade 5)
22 pages
Solutions Week 10
No ratings yet
Solutions Week 10
7 pages
Capitulo1 Exercicios
No ratings yet
Capitulo1 Exercicios
3 pages
Exercise Sheet 1 The Multiple Regression Model
No ratings yet
Exercise Sheet 1 The Multiple Regression Model
5 pages
Chapter 6 - Optimization Models With Integer Variables: Page 1
No ratings yet
Chapter 6 - Optimization Models With Integer Variables: Page 1
14 pages
Data Mining Introduction
No ratings yet
Data Mining Introduction
52 pages
K Means R and Rapid Miner Patient and Mall Case Study
No ratings yet
K Means R and Rapid Miner Patient and Mall Case Study
80 pages
Regression With Qualitative Information
No ratings yet
Regression With Qualitative Information
25 pages
Cross Section Answers
No ratings yet
Cross Section Answers
22 pages
Econo Mid-Term Exam
No ratings yet
Econo Mid-Term Exam
4 pages
Econometrics 2 Exam Answers
67% (3)
Econometrics 2 Exam Answers
6 pages
ETC1010 S12015 Solution Part 1
No ratings yet
ETC1010 S12015 Solution Part 1
7 pages
Econometrics II-1
No ratings yet
Econometrics II-1
56 pages
Big Data
No ratings yet
Big Data
9 pages
Introductory Econometrics - Exam: 1 Theoretical Questions
No ratings yet
Introductory Econometrics - Exam: 1 Theoretical Questions
5 pages
Long Life Learning: Preparing for Jobs that Don't Even Exist Yet
From Everand
Long Life Learning: Preparing for Jobs that Don't Even Exist Yet
Michelle R. Weise
3/5 (1)
How Much Income Do I Really Need in Retirement?
From Everand
How Much Income Do I Really Need in Retirement?
Dale Maley
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Case 3

Uploaded by

Case 3

Uploaded by

Applied Econometrics for Managers: Case #3

High Earner Prediction

Variable Number of Observations Mean Standard Deviation Minimum Maximum Range

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.