Decision Making Assignment
Decision Making Assignment
FINTECH
ASSIGNMENT 1
NAME – OMKAR RAUT
ROLL NO – 29
LOAN DELIQUENCY PROBLEM
Given:
Total number of customers bank has disbursed loan to between 2013 - 18 – 8,00,000
Loan delinquency rate – 10%
To find:
To identify customers whose account balance falls below a certain criteria
To identify 30,000 customers most likely to miss loan payments in next 3 months to send
timely reminders.
To study data of customers who missed payments (age, income, loan details).
To use customer behavior and characteristics to predict likelihood of missed payments.
To generate rules based on patterns and insights.
To evaluate whether reminders effectively reduced missed payments.
To explore how accurate and reliable are the predictions for real world applicable to real
world insights.
Step 4 : Modeling
1)Analyzing Historical Data
Data Collection: Gather comprehensive historical data on customer characteristics,
transaction history, credit scores, and loan performance.
2) Defining Rules for the Model (Rule-Based Approach)
Objective: Create clear, actionable rules to flag high-risk customers.
Rule 1:
Condition:
Account Balance < 3 times the Monthly Installment
Number of Past Due Payments > 2 in the last 6 months
Credit Score < 600
Action: Flag the customer as High Risk.
Rule 2:
Condition:
Debt-to-Income Ratio > 0.5
Loan Amount > 5 times Monthly Income
Action: Flag the customer as High Risk.
Proactive Measure:
Send timely reminders to flagged customers two weeks before the due installment to
reduce the likelihood of missed payments.
3) Choosing a Statistical Model
Model Selection:
Decision Trees: For classifying customer behavior and identifying high-risk groups through
interpretability.
Logistic Regression: To predict the probability of loan delinquency based on defined
features.
Neural Networks: To capture complex, non-linear patterns and relationships between
customer data and delinquency outcomes.
4) Training the Model
Data Splitting:
Split the dataset into training (80%) and validation (20%) sets to ensure robust model
evaluation.
Feature Matrix:
Define the feature matrix with key variables such as Account Balance, Credit Score, Debt-to-
Income Ratio, Loan Amount, and Past Due Payments.
Model Training:
Train the model on the training dataset.
Use the validation set to evaluate performance and fine-tune hyperparameters.
Model Evaluation:
Evaluate model performance using metrics like Accuracy, Precision, Recall, and F1 Score.
Implement cross-validation to ensure the model generalizes well to unseen data.
Final Testing:
Apply the trained model to the test dataset (remaining 50%) to identify potential loan
delinquents.
Step 6- Experimentation
To understand the quantify actual impact of models in the real world.
i. Defining relevant success metrics (repayment rate)
ii. Dividing population into two random groups (A/B Testing) divide the dataset containing
information on loan holders into two groups: one receives reminders (control group) and
one does not (treatment group)
iii. Execute experiment (in real world i.e. banks, private loan companies)
iv. Measure success metrics : Track delinquency rates, false positive/negative rates, other
KPIs for both groups and compare the delinquency rate in the treatment group with the
control group.
In concluding, I think Big data model will give the accurate result as all possible
consequences and situations have taken into consideration to detect and prevent loan
delinquency.