Capstone Project
Capstone Project
Batch ID: 17
Batch Members
● Ashutosh Das (8249198088)
● Suchismita Sahoo (7853937873)
● Yoshobanta Bisoi (7008209672)
● Sunay Lahiri (8847856525)
● Eski Palai (9348694971)
MOTIVATION & OBJECTIVE
MOTIVATION:
● Cost of Customer Acquisition vs. Retention: Retaining existing customers is more cost-effective
than acquiring new ones, saving businesses significant resources.
● Business Impact of Churn: High churn rates lead to revenue losses, making proactive churn
management essential for business stability.
● Actionable Insights: Predictive models help identify at-risk customers, enabling targeted retention
strategies to reduce churn.
● Scalability Across Industries: Churn prediction can be applied across sectors like telecom,
banking, and e-commerce, benefiting diverse businesses
OBJECTIVE:
● Predict Customer Churn: Develop a machine learning model to accurately classify
whether a customer is likely to churn or not.
● Analyze Key Factors: Identify the primary features contributing to customer churn for
actionable insights.
● Improve Business Retention Strategies: Provide businesses with data-driven
recommendations to reduce churn and enhance customer satisfaction.
● Model Performance Comparison: Evaluate and compare different machine learning
models to identify the most effective one for churn prediction.
Source: Kaggle
Purpose: Customer Churn Prediction
Size and Structure:
● Rows: 7,043
● Columns: 21
Key Features:
Data Preprocessing:
Data Preprocessing:
● Visualized distributions of key variables (e.g., churn vs. non-churn, monthly charges,
customer tenure) to understand patterns and relationships.
● Examined correlations between features to identify which factors contribute most to
customer churn.
Model Selection:
Model Evaluation:
● Evaluated models using performance metrics like Accuracy, Precision, Recall, F1-Score,
and ROC-AUC to select the best-performing model.
● Used Cross-Validation to prevent overfitting and ensure that the model generalizes well to
unseen data.
Prediction:
● The best-performing model was used to predict customer churn on new, unseen data.
Results were presented using a confusion matrix to evaluate classification accuracy.
About 75% of customer with Month-to-Month Contract opted to move out as compared
to 13% of customers with One Year Contract and 3% with Two Year Contract
Major customers who moved out were having Electronic Check as Payment Method.
Customers who opted for Credit-Card automatic transfer or Bank Automatic Transfer and Mailed Check
as Payment Method were less likely to move out.
A lot of customers choose the Fiber optic service and it's also evident that the customers who use Fiber
optic have high churn rate, this might suggest a dissatisfaction with this type of internet service.
Customers having DSL service are majority in number and have less churn rate compared to Fibre optic
service.
SOFTWARE SPECIFICATION:
SOFTWARE TOOLS :Python 3.10
Jupyter Notebook / IDE (e.g., VSCode, PyCharm) for execution
PROGRAMMING LANGUAGE:Python
● The machine learning models were evaluated using key metrics like accuracy and confusion matrix.
● The most accurate model successfully identified a majority of churn and non-churn customers, as
highlighted in the confusion matrix.
Key Observations:
● Non-Churn Prediction: The model correctly predicted 1,400 out of 1,549 actual non-churn values.
● Churn Prediction: For churn customers, 324 out of 561 were accurately identified.
● Challenges: Misclassification of 237 churn customers as non-churn remains a challenge that could be
improved with advanced techniques or additional data.
Business Implications:
● Customer Retention: By identifying at-risk customers, companies can design targeted retention
strategies, such as improving customer service or offering personalized incentives.
● Proactive Measures: Analyzing reasons for customer dissatisfaction can help businesses address
churn factors early and enhance customer satisfaction.
● Feedback and Loyalty: Surveys and loyalty programs can build stronger customer relationships and
reduce churn rates over time.
Future Scope:
● Explore deep learning techniques or ensemble models to further improve prediction accuracy.
● Perform feature engineering to capture additional factors influencing customer behavior.
Foundation For Innovation and Technology Transfer, IIT,
Delhi
17
SNAPSHOT
Data Snapshot: