0% found this document useful (0 votes)

104 views4 pages

Analytics Report: Gray Hunter and John Mcclintock Yamelle Gonzalez Dummy Variables APRIL 14, 2020

The researcher analyzed credit card data from Taiwanese customers to predict credit limits and likelihood of default. Two regression models were created: one used variables like age, education, marital status to predict limits; the other used age, average bill, payment amounts to predict defaults. Both models had low standard errors and high adjusted R-squares, showing the variables were significant predictors. The models allow estimating a customer's limit and default chance based on their characteristics.

Uploaded by

api-529885888

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

104 views4 pages

Analytics Report: Gray Hunter and John Mcclintock Yamelle Gonzalez Dummy Variables APRIL 14, 2020

Uploaded by

api-529885888

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

ANALYTICS REPORT

TO: GRAY HUNTER AND JOHN MCCLINTOCK

FROM: YAMELLE GONZALEZ

SUBJECT: DUMMY VARIABLES

DATE: APRIL 14, 2020

Introduction
As requested, the researcher studied and analyzed the credit card data from Taiwanese
customers further. His/her goal was to accurately predict the credit card limit, and the
likelihood of someone defaulting on their next month’s payment. For the model
predicting these customer’s credit limit, the researcher used the following variables: age,
education level, marital status, and age, and for the other model predicting their default
chance, the researcher used their age, average bill amount, and average previous payment
amounts. With this information, and by conducting regression models, the researcher
determined the best model for calculating the previously mentioned predictions. Below
you will find the two models conducted, which are regression statistic tests.

Data Analysis
Regression Output: Best Model to Predict Credit Limit
SUMMARY OUTPUT
Made by: Yamelle Gonzalez Dabdoub
Regression Statistics
Multiple R 0.344472735
R Square 0.118661465
Adjusted R Square 0.118444074
Standard Error 4027.958073
Observations 24332

ANOVA
df SS MS F Significance F
Regression 6 53136100338 8856016723 545.844006 0
Residual 24325 3.9466E+11 16224446.2
Total 24331 4.47796E+11

Coefficients Standard Error t Stat P-value Lower 95% Upper 95%

Intercept 376.5023861 159.5896786 2.35919008 0.01832273 63.6967991 689.307973
Female 366.0376855 53.33062783 6.86355478 6.8781E-12 261.506374 470.568997
High School -1135.585699 75.081855 -15.124636 1.9102E-51 -1282.7508 -988.42065
Graduate School 2321.850475 57.82486723 40.1531484 0 2208.51018 2435.19077
Married 2978.65553 234.7488337 12.6886915 8.9362E-37 2518.53338 3438.77768
Married*Age -67.02842584 6.391249245 -10.487531 1.1183E-25 -79.555668 -54.501184
AGE 117.9569722 4.713377908 25.0259951 1.699E-136 108.718462 127.195483
Which variables are included in the best model? How do you know?
The variables that are included in the best model are the following: “Female”, “High
School”, “Graduate School”, “Married”, “Married*Age”, and “Age”. They are all
included because they are significant- low p value and high t statistic. In addition, a
researcher would never remove a significant variable, because it would make their model
less precise and reliable.

Regression Equation
^
Limit :376.50+ 366.04 ( Female )−1135.59 ( High School ) +2321.85 ( Graduate School )+ 2978.66 ( Married ) −67.

Interpretation of the cand the Standard Error

R2 : The researcher is 11.87% of the way toward perfectly predicting the credit limit using
this model and the variables given: female, high school, graduate school, married,
married*age, and age.

Standard Error: The researcher’s predictions of the credit limit are off by an average of
$4,027.96.

Interpretation of the Coefficients

Female: Females have a credit limit $366.04 higher than males, on average and all else
constant.

High School: Someone with a high school degree has a credit limit of $1135.59 lower
than someone with a graduate school degree, on average and else constant.

Graduate School: Someone with a graduate school degree has a credit limit of $2321.85
higher than someone with a high school degree, on average and else constant.

Married: At zero years old, someone who is married would have a credit limit of
$2978.66 higher than someone who is single, on average and all else constant.

Married*Age: For married people, as age increases by 1 year, credit limit increases by
$51, on average and all else constant.
OR
For married people, as age increases by 1 year, credit limit increases by $67.03 less than
for a single person, on average and all else constant.

Age: For single people, as age increases by 1 year, credit limit increases by $117.96, on
average and all else constant.

Note: *The slopes written above mention the slope for married and single people as well
as the changes for married people*

Regression Output: Best Model to Predict the Chance of Defaulting on Next Month’s
Payment

2
SUMMARY OUTPUT
Made by: Yamelle Gonzalez Dabdoub
Regression Statistics
Multiple R 0.11812777
R Square 0.01395417
Adjusted R Square 0.01383258
Standard Error 0.41209391
Observations 24332

ANOVA
df SS MS F Significance F
Regression 3 58.46631187 19.4887706 114.760399 8.5694E-74
Residual 24328 4131.414791 0.16982139
Total 24331 4189.881103

Coefficients Standard Error t St at P-value Lower 95% Upper 95%

Intercept 0.20453129 0.010650007 19.204803 1.3566E-81 0.18365662 0.22540596
Average Bill 6.8707E-06 1.34359E-06 5.11366262 3.1837E-07 4.2372E-06 9.5042E-06
Average Payment -0.0001685 9.23401E-06 -18.24578 6.957E-74 -0.0001866 -0.0001504
AGE 0.00100365 0.000289205 3.47036809 0.00052065 0.00043679 0.00157051

Which variables are included in the best model? How do you know?
The variables that are included in the best model are the following: “Average Bill”,
“Average Payment”, and “Age”. They are all included because they are significant- low p
value and high t statistic. In addition, and as mentioned before, a researcher would never
remove a significant variable, because it would make their model less precise and
reliable.

Regression Equation
^
P( Default =1) :0.20+0.0000069 ( Average Bill )−0.00017 ( Average Payment ) +0.0010( Age)

Interpretation of the R2and the Standard Error

R2 :The researcher is 1.40% of the way toward perfectly predicting the chance of
defaulting on next month’s payment using this model and the variables given: age,
average bill amount and average previous payment amounts.

Standard Error: The researcher’s predictions of the default chance are off by an average
of 41.21 percentage points.

Interpretation of the Coefficients

Average Bill: As average bill amount increases by $1000, chance of defaulting increases
by 0.69 percentage points, on average and all else constant.

Average Payment: As average payment amounts increases by $10, chance of defaulting

decreases by 0.17 percentage points, on average and all else constant.

Age: As age increases by 1 year, chance of defaulting increases by 0.0010 percentage

points, on average and all else constant.

3
For the “Limit” Model-Prediction of the Limit of a 35-year-old, Single, Female with a
High School Degree
^
Limit :376.50+ 366.04 ( Female )−1135.59 ( High School ) +2321.85 ( Graduate School )+ 2978.66 ( Married ) −67.

376.50+366.04(1)-1135.59(1)+2321.85(0)+2978.66(0)-67.03(0*35)+117.96(35)
$3735.55

For the “Default” Model-Prediction of the Chance of Someone Defaulting who is 25

years old, has an Average Bill Amount of $1150, and Average Payments of $900

^
P( Default =1) :0.20+0.0000069 ( Average Bill )−0.00017 ( Average Payment ) +0.0010( Age)
0.20+0.0000069 ( 1150 ) −0.00017 ( 900 ) +0.0010(25)
0.079935 percentage points

Conclusion
The researcher would recommend individuals to use either the credit limit regression
statistics test or the default chance regression statistics test due to having a low standard
error, high adjusted R2, and showing significance in all of its variables. In more depth, the
credit limit regression statistics test had a standard error of 4027.96 and an adjusted R2of
0.1184 whereas the default chance regression statistics test had a standard error of 0.4121
and an adjusted R2 of 0.0138. Although the credit limit regression statistics test appears to
have a much higher standard error and a much lower adjusted R2, it is not the case
entirely, as it is only calculated in a different measurement. Besides, all of the p-values
were low, and all of the t statistics were relatively high. To conclude, these models with
the given variables are quite effective and reliable for predicting the credit limit and
chance of default of Taiwanese customers.

Analysis of German Credit Data
100% (1)
Analysis of German Credit Data
24 pages
Assessment - 2
No ratings yet
Assessment - 2
14 pages
Data Analysis Final Assignment
No ratings yet
Data Analysis Final Assignment
14 pages
DADM Assessment 2
No ratings yet
DADM Assessment 2
11 pages
XLSTAT - Statistical Analysis Software
No ratings yet
XLSTAT - Statistical Analysis Software
43 pages
2 Assignment For Data Analysis For Decision Making: Dipanwita Ghosh
No ratings yet
2 Assignment For Data Analysis For Decision Making: Dipanwita Ghosh
13 pages
Ba Cia3
No ratings yet
Ba Cia3
33 pages
Farlin Discussion 7 - Dummy Variables Credit Limit and Default Probability Report Bnad277
No ratings yet
Farlin Discussion 7 - Dummy Variables Credit Limit and Default Probability Report Bnad277
9 pages
Assesment
40% (5)
Assesment
15 pages
CH 5 Limited Dependent Variable Models Jan 2023
No ratings yet
CH 5 Limited Dependent Variable Models Jan 2023
43 pages
Credit Card Analysis
No ratings yet
Credit Card Analysis
19 pages
DADM Assignment
No ratings yet
DADM Assignment
10 pages
Analytics Report Outline Bnad 277
No ratings yet
Analytics Report Outline Bnad 277
6 pages
Probability of A Term Deposit
No ratings yet
Probability of A Term Deposit
31 pages
Eda Case Study Final PDF
100% (1)
Eda Case Study Final PDF
15 pages
Chapter 5
No ratings yet
Chapter 5
22 pages
Bnad Lab 7
No ratings yet
Bnad Lab 7
5 pages
Data Analytics Full Time Bootcamp PDF
100% (1)
Data Analytics Full Time Bootcamp PDF
11 pages
Limited Dependent Variables Models-1
No ratings yet
Limited Dependent Variables Models-1
23 pages
EDA Assignment S
No ratings yet
EDA Assignment S
33 pages
Logistic Regression - 2011
No ratings yet
Logistic Regression - 2011
76 pages
Qualitative Predictor
No ratings yet
Qualitative Predictor
15 pages
CH 5 2023 Eonometrics For Acct and Finance
No ratings yet
CH 5 2023 Eonometrics For Acct and Finance
6 pages
Lab4 277
No ratings yet
Lab4 277
3 pages
Lending Club Data Analysis and Default
No ratings yet
Lending Club Data Analysis and Default
10 pages
SFA - Group 10 - Assignment
No ratings yet
SFA - Group 10 - Assignment
4 pages
Consumer Credit Card Usage Analysis Krithik Jain Business Statistics MGSC 2301-07 Professor Dimitrios Fotiadis
No ratings yet
Consumer Credit Card Usage Analysis Krithik Jain Business Statistics MGSC 2301-07 Professor Dimitrios Fotiadis
10 pages
MGT5426 1
No ratings yet
MGT5426 1
10 pages
Assignment-2: Submitted By: Name: Vipul Kumar Singh Roll No: 133118 Submitted To: Prof. Kuldeep Baishya
No ratings yet
Assignment-2: Submitted By: Name: Vipul Kumar Singh Roll No: 133118 Submitted To: Prof. Kuldeep Baishya
4 pages
Team 14 - Project Documentation - Taiwan Credit Defaults v1.0
No ratings yet
Team 14 - Project Documentation - Taiwan Credit Defaults v1.0
3 pages
Assignment 2 - Consumer Research, Inc.
No ratings yet
Assignment 2 - Consumer Research, Inc.
9 pages
Problem Description
No ratings yet
Problem Description
7 pages
Business Report Assesment 2-1
No ratings yet
Business Report Assesment 2-1
13 pages
Case: German Credit: Var. # Variable Name Description Variable Type Code Description
No ratings yet
Case: German Credit: Var. # Variable Name Description Variable Type Code Description
4 pages
AE Project
No ratings yet
AE Project
9 pages
Logistic Regression:: PGP Dse Bangalore July 2018
No ratings yet
Logistic Regression:: PGP Dse Bangalore July 2018
62 pages
QTA 18-04-2013 Logistic Regression
No ratings yet
QTA 18-04-2013 Logistic Regression
4 pages
RMSC3001 2023-24 PS2
No ratings yet
RMSC3001 2023-24 PS2
2 pages
Bnad 277 Lab 7 Web
No ratings yet
Bnad 277 Lab 7 Web
3 pages
Frequency Distribution, Cross-Tabulation, and Hypothesis Testing (PPT) 1
No ratings yet
Frequency Distribution, Cross-Tabulation, and Hypothesis Testing (PPT) 1
22 pages
Pima Indians Diabetes Database Analysis - Kaggle
No ratings yet
Pima Indians Diabetes Database Analysis - Kaggle
37 pages
Amr Assignment 2: Logistic Regression On Credit Risk
No ratings yet
Amr Assignment 2: Logistic Regression On Credit Risk
6 pages
Credit Scoring Modelling For Retail Banking Sector
No ratings yet
Credit Scoring Modelling For Retail Banking Sector
9 pages
Classification
No ratings yet
Classification
56 pages
No. of Pages:i: Re-Examination
No ratings yet
No. of Pages:i: Re-Examination
7 pages
Multi Col Linearity
No ratings yet
Multi Col Linearity
37 pages
75.an Approach For Prediction of Loan Approval Using
No ratings yet
75.an Approach For Prediction of Loan Approval Using
5 pages
Research On The Influencing Factors of Personal CR
No ratings yet
Research On The Influencing Factors of Personal CR
12 pages
Topics in Time Series Econometrics PDF
No ratings yet
Topics in Time Series Econometrics PDF
157 pages
CAP5768 Homework3
No ratings yet
CAP5768 Homework3
10 pages
Binary Logistic
No ratings yet
Binary Logistic
29 pages
Group 9
No ratings yet
Group 9
9 pages
Reading Material - Module-5 - Introduction To Special Topics
No ratings yet
Reading Material - Module-5 - Introduction To Special Topics
27 pages
Analytics Report Outline Bnad 277 5
No ratings yet
Analytics Report Outline Bnad 277 5
4 pages
Week 3 Practice Quiz
100% (1)
Week 3 Practice Quiz
10 pages
A Synopsis of The Thesis Project
100% (1)
A Synopsis of The Thesis Project
3 pages
Non Linear Probability Models
No ratings yet
Non Linear Probability Models
18 pages
Excel and R Analysis
No ratings yet
Excel and R Analysis
8 pages
Introduction To The New Statistics Estimation Open Science and Beyond 1st Edition Geoff Cumming All Chapter Instant Download
100% (1)
Introduction To The New Statistics Estimation Open Science and Beyond 1st Edition Geoff Cumming All Chapter Instant Download
55 pages
Capstone Project
100% (1)
Capstone Project
7 pages
Linear Regression and Logit
No ratings yet
Linear Regression and Logit
15 pages
RWS Handout
No ratings yet
RWS Handout
3 pages
Factor Analysis and Dimension Reduction in R A Social Scientists Toolkit G David Garson Download
No ratings yet
Factor Analysis and Dimension Reduction in R A Social Scientists Toolkit G David Garson Download
82 pages
Association Rule in Data Mining
No ratings yet
Association Rule in Data Mining
4 pages
Everitt Cluster Analysis PDF
0% (1)
Everitt Cluster Analysis PDF
2 pages
Assignment 3 F1 - F4
No ratings yet
Assignment 3 F1 - F4
19 pages
BBA 6th Semester Course Outline
No ratings yet
BBA 6th Semester Course Outline
19 pages
Assignment3 05.01.24
No ratings yet
Assignment3 05.01.24
4 pages
Finance
No ratings yet
Finance
24 pages
Akash Final Sip PDF
No ratings yet
Akash Final Sip PDF
51 pages
Teacher Personal and Professiona History
No ratings yet
Teacher Personal and Professiona History
14 pages
Ppa Final Project
No ratings yet
Ppa Final Project
17 pages
Optimization of A Battery Manufacturing Line Using Computer Simulation
No ratings yet
Optimization of A Battery Manufacturing Line Using Computer Simulation
107 pages
Schutt - Qualitative Data Analysis Chapter 10
No ratings yet
Schutt - Qualitative Data Analysis Chapter 10
38 pages
Logistic Regression
No ratings yet
Logistic Regression
41 pages
Orgculturepaper IJICBM
No ratings yet
Orgculturepaper IJICBM
21 pages
A Comprehensive Study On Using Data Mining in ERP Systems
No ratings yet
A Comprehensive Study On Using Data Mining in ERP Systems
7 pages
Decision Analysis Sumendran R Answers
No ratings yet
Decision Analysis Sumendran R Answers
12 pages
Topic 2 Business in Practice and The GRISP-DM Framework
No ratings yet
Topic 2 Business in Practice and The GRISP-DM Framework
22 pages
Summer Training Report
No ratings yet
Summer Training Report
62 pages
Research Paper On Autism
No ratings yet
Research Paper On Autism
11 pages
Aaf Executive Summary
No ratings yet
Aaf Executive Summary
3 pages
2024 From - Authenticity - To - Perceived - Value - Role - of - Souvenir Image and Place Identity On Ceramic Souvenir-Repurchasing Intention
No ratings yet
2024 From - Authenticity - To - Perceived - Value - Role - of - Souvenir Image and Place Identity On Ceramic Souvenir-Repurchasing Intention
24 pages
Correlation 2
No ratings yet
Correlation 2
23 pages
Regression Log
No ratings yet
Regression Log
4 pages
Business Report, DADM-Muthukumar V
No ratings yet
Business Report, DADM-Muthukumar V
11 pages
Adagboyi Esther Pre-Project
No ratings yet
Adagboyi Esther Pre-Project
10 pages
MDU B.Tech CSE 8th Sem Syllabus
No ratings yet
MDU B.Tech CSE 8th Sem Syllabus
7 pages
The Vietnamese Version of The Social and Emotional Competence Questionnaire (Secq) : Psychometric Properties Among Adolescents
No ratings yet
The Vietnamese Version of The Social and Emotional Competence Questionnaire (Secq) : Psychometric Properties Among Adolescents
16 pages
Eller College of Management Professional Admission-Cover Letter
No ratings yet
Eller College of Management Professional Admission-Cover Letter
1 page
Resume For A Strategic Sales and Business Operations Internship at Mars Yamelle Gonzalez
No ratings yet
Resume For A Strategic Sales and Business Operations Internship at Mars Yamelle Gonzalez
1 page
Please Answer The Following Questions in About 250 Words Per Question. Please Have Your Responses Single Space, 12 Point Font
No ratings yet
Please Answer The Following Questions in About 250 Words Per Question. Please Have Your Responses Single Space, 12 Point Font
2 pages
Bulba Code ICE - RLHF Synthetic & Organic Loss
No ratings yet
Bulba Code ICE - RLHF Synthetic & Organic Loss
94 pages
Student Solutions Manual to Accompany Loss Models: From Data to Decisions, Fourth Edition
From Everand
Student Solutions Manual to Accompany Loss Models: From Data to Decisions, Fourth Edition
Stuart A. Klugman
4/5 (1)
Solutions Manual to accompany Introduction to Linear Regression Analysis
From Everand
Solutions Manual to accompany Introduction to Linear Regression Analysis
Douglas C. Montgomery
1/5 (1)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Analytics Report: Gray Hunter and John Mcclintock Yamelle Gonzalez Dummy Variables APRIL 14, 2020

Uploaded by

Analytics Report: Gray Hunter and John Mcclintock Yamelle Gonzalez Dummy Variables APRIL 14, 2020

Uploaded by

ANALYTICS REPORT

TO: GRAY HUNTER AND JOHN MCCLINTOCK

FROM: YAMELLE GONZALEZ

SUBJECT: DUMMY VARIABLES

DATE: APRIL 14, 2020

Coefficients Standard Error t Stat P-value Lower 95% Upper 95%

Interpretation of the cand the Standard Error

Interpretation of the Coefficients

Coefficients Standard Error t St at P-value Lower 95% Upper 95%

Interpretation of the R2and the Standard Error

Interpretation of the Coefficients

Average Payment: As average payment amounts increases by $10, chance of defaulting

Age: As age increases by 1 year, chance of defaulting increases by 0.0010 percentage

For the “Default” Model-Prediction of the Chance of Someone Defaulting who is 25

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.