0% found this document useful (0 votes)
22 views10 pages

Eda - Assignment 1 (Final)

Uploaded by

rodiistar1024
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
22 views10 pages

Eda - Assignment 1 (Final)

Uploaded by

rodiistar1024
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 10

EDA ASSIGNMENT

ANALYSIS
EDA Loan DATA
(EDA) Assignment – Yeswanth Chippada
Problem statement
Our goal is to analyze and understand the data and identify the
potential customers and defaulters based on target variable
Target Variable -
The client with payment difficulties (1) : he/she had late payment more
than X days on at least one of the first Y instalments of the loan in our
sample.
All other cases (0) : All other cases when the payment is paid on time.
We need to be able to find the people who might default and people
who can repay so people who default shouldn’t get loan and people
who can repay shouldn’t miss the loan.
This will help the banks to stay safe financially without incurring any
losses in the future.
EDA ASSIGNMENT

Approach
• We read the data and found null values in some of those columns
• There were some columns with house dimensions and related data which was not
needed for us and consisted of more than 50% of null values so dropped
• EXT_sourse had no correlation with Target so we removed those
• FLAG_DOCC – except 3 remaining had no correlation
• Dates were in –ve so converted to positive
• We separated data into Defaulters and Non-Defaulters for analysis
• Created a loop for Bivariate analysis wrt to TARGET
• And used Heatmaps for correlation
• Countplots for Trends in Data
• Scatterplots on Numerical Bivariate
• Used hue – TARGET for better understanding
• Conclusions were based on graphs outcomes
UNIVARIATE ANALYSIS
UNIVARIATE ANALYSIS
BIVARIATE ANALYSIS
BIVARIATE ANALYSIS
EDA ASSIGNMENT

Conclusion
EDA – With the help of data
visualization we draw insights, it is
more about studying and
understanding the data in detail.
EDA ASSIGNMENT

Non-defaulters
• Children count - Zero is safe
• Gender - Females are comparitively better
• Age - Above 55 has less likely to default
• Organization Type - Trade-4, Industry-8, Trade-5, Religion have very less likely to
default
• Region Rating - 1 is Safe
• Family - Less than 3
• Income Type - Businessmen and Students are safe
• Education - Academic Degree
EDA ASSIGNMENT

defaulters
• Children count - More than 4 is risky and above 8 are No
• Gender - Men are comparitively more
• Age - below 45 are likely to default more
• Organization Type - Self employed are likely
• Region Rating - 3 is Risky
• Family - More than 8 are likely
• Income Type - Maternity Leave , Unemployed
• Education - Lower secoundry ,Secoundry

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy