0% found this document useful (0 votes)

8 views2 pages

VL2024250504566 Ast03

The document outlines an assessment for a B.Tech (CSE) Data Mining Lab course, focusing on implementing and evaluating classification algorithms (ID3 Decision Tree, CART, and Naïve Bayes) and customer segmentation using clustering algorithms (K-Means, K-Medoids, Hierarchical Clustering). Students are required to use real-world datasets, preprocess the data, and evaluate model performance using various metrics and visualizations. Additionally, a comparative analysis of the clustering algorithms' performance is to be conducted.

Uploaded by

jee2022.acc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views2 pages

VL2024250504566 Ast03

Uploaded by

jee2022.acc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

SLOT – L43+L44

SCHOOL OF COMPUTER SCIENCE AND ENGINEERING

ASSESSMENT – III – WINTER S E M E S T E R 2024-2025
Programme Name & Branch: B.Tech (CSE) Course Name: Data Mining Lab
Course Code: BCSE208P

Objective: To Implement and Evaluate Classification Algorithms (Decision Tree

and Naïve Bayes)
1. Implement and Evaluate Classification Algorithms:
Dataset: Use the dataset (available from Kaggle or UCI Machine
Learning Repository). Consider external dataset containing real-world data (e.g.,
customer churn prediction, medical diagnosis, or credit risk classification). The
dataset includes both categorical and numerical attributes.
Load and preprocess the dataset.
Split the dataset into training and testing sets (80% training, 20% testing).
Implement the following classification algorithms:
• ID3 Decision Tree: Implement the Iterative Dichotomiser 3 (ID3) algorithm
using an existing Python library or from scratch if desired.
• CART Decision Tree: Implement the Classification and Regression Tree
(CART) algorithm, which uses the Gini Index for splitting.
• Naïve Bayes Classifier: Implement the Gaussian Naïve Bayes or Multinomial
Naïve Bayes based on the dataset's characteristics.
Evaluate Model Performance:
• Compute accuracy, precision, recall, F1-score, and confusion matrix for each
classifier.
• Visualize the decision trees (ID3 and CART) to analyze how the models
make decisions.
• Use ROC-AUC curves to compare classifier performance.
2. Customer Segmentation using Clustering Algorithms
Dataset:
Use a combination of datasets to create a comprehensive customer profile. You
can source datasets from:
• Kaggle (e.g., customer transaction data, customer demographics)
• UCI Machine Learning Repository (e.g., online retail data)
Combine at least two datasets to enrich the feature set for each customer. Ensure
that the combined dataset has a minimum of 10,000 data points.
Objective:
Perform customer segmentation to identify distinct groups of customers based on
their purchasing behavior, demographics, and other relevant features.
Instructions:
Data Preparation:
• Load and merge the datasets into a single Pandas DataFrame.
• Handle missing values appropriately (e.g., imputation or removal).
• Encode categorical variables using techniques like one-hot encoding or
label encoding.
• Scale the numerical features using StandardScaler or MinMaxScaler.
Implement the following clustering algorithms:
• K-Means
• K-Medoids
• Hierarchical Clustering
For each algorithm:
• Determine the optimal number of clusters using appropriate methods such
as the elbow method, silhouette score3, or dendrograms.
• Train the model on the preprocessed dataset using the determined optimal
number of clusters.
• Assign each data point to a cluster.
• Visualize the clusters using dimensionality reduction techniques (e.g.,
PCA or t-SNE) for higher-dimensional data.
• Evaluate the clustering performance using appropriate metrics such as
silhouette score3.
Comparative Analysis:
Compare the performance of the three algorithms and visualize the results.

^^^^^^^^^^^^^^^^^^^^^^^^^^^

Segmentation Analysis
No ratings yet
Segmentation Analysis
17 pages
Customer Segmentation Report
No ratings yet
Customer Segmentation Report
8 pages
Ads Phase 4
No ratings yet
Ads Phase 4
12 pages
Customer Segmentation
No ratings yet
Customer Segmentation
9 pages
ADS Phase4
No ratings yet
ADS Phase4
21 pages
CSUDS Project
No ratings yet
CSUDS Project
13 pages
Ads Phase 5
No ratings yet
Ads Phase 5
23 pages
Aiml Project Review
No ratings yet
Aiml Project Review
22 pages
Workshop Project Report
No ratings yet
Workshop Project Report
10 pages
Phase 1
No ratings yet
Phase 1
4 pages
Another Project-Creating Customer Segments
No ratings yet
Another Project-Creating Customer Segments
31 pages
Varshini Phase 2
No ratings yet
Varshini Phase 2
19 pages
Daa 01
No ratings yet
Daa 01
11 pages
Customer Segmentation Using K
No ratings yet
Customer Segmentation Using K
16 pages
DWDM PPT
No ratings yet
DWDM PPT
13 pages
Review2 A15
No ratings yet
Review2 A15
14 pages
ML Review PPT 2
No ratings yet
ML Review PPT 2
29 pages
Energy Consumption Prediction System
No ratings yet
Energy Consumption Prediction System
21 pages
Each Stage of A Data Mining Project
No ratings yet
Each Stage of A Data Mining Project
5 pages
Machine Learning Project Report - Customer Segmentation
No ratings yet
Machine Learning Project Report - Customer Segmentation
2 pages
Soln Architecture11.
No ratings yet
Soln Architecture11.
5 pages
Machine Learning Project Report - Customer Segmentation
No ratings yet
Machine Learning Project Report - Customer Segmentation
2 pages
DS MP
No ratings yet
DS MP
18 pages
Clusturing Algorithms For Customer Segmentation
No ratings yet
Clusturing Algorithms For Customer Segmentation
35 pages
EE - 353 - 769 A4 Unsupervised Learning
No ratings yet
EE - 353 - 769 A4 Unsupervised Learning
1 page
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
15 pages
Verapandi
No ratings yet
Verapandi
4 pages
Customer Segmentation New
No ratings yet
Customer Segmentation New
11 pages
Proposed System and Methodology Part 2
No ratings yet
Proposed System and Methodology Part 2
42 pages
Assignment 1
No ratings yet
Assignment 1
4 pages
A Beginner's Guide To Customer Segmentation With Python - by Sigli Mumuni - Medium
No ratings yet
A Beginner's Guide To Customer Segmentation With Python - by Sigli Mumuni - Medium
14 pages
IIM PBA Assignment 2
No ratings yet
IIM PBA Assignment 2
3 pages
Machine Learning - Customer Segment Project. Approved by UDACITY
100% (1)
Machine Learning - Customer Segment Project. Approved by UDACITY
19 pages
Chapter 1,2 Report
No ratings yet
Chapter 1,2 Report
5 pages
DW&DM PROJECT Sawan
No ratings yet
DW&DM PROJECT Sawan
14 pages
Ajithkumar - Inframind Season
No ratings yet
Ajithkumar - Inframind Season
12 pages
Customer Segmentation 2
No ratings yet
Customer Segmentation 2
19 pages
Cours 3 - TP
No ratings yet
Cours 3 - TP
3 pages
22-cp-57 Assignment #02
No ratings yet
22-cp-57 Assignment #02
5 pages
22-CP-63 ML Assignment Report
No ratings yet
22-CP-63 ML Assignment Report
5 pages
Customer Segmentation IEEE Report
No ratings yet
Customer Segmentation IEEE Report
2 pages
Varshini Phase 3
No ratings yet
Varshini Phase 3
12 pages
Lecture - 7 - Practical - DBSCAN Clustering in Python
No ratings yet
Lecture - 7 - Practical - DBSCAN Clustering in Python
3 pages
MiniProject (1) .PPTX LPPT
No ratings yet
MiniProject (1) .PPTX LPPT
11 pages
Final Review Batch 07
No ratings yet
Final Review Batch 07
30 pages
Abstract (1) - 1
No ratings yet
Abstract (1) - 1
3 pages
DM Lab Report
No ratings yet
DM Lab Report
13 pages
Phase 2
No ratings yet
Phase 2
5 pages
Behavioural Customer Segmentation Based
No ratings yet
Behavioural Customer Segmentation Based
7 pages
In Tenshi PPP Tte Jum Am
No ratings yet
In Tenshi PPP Tte Jum Am
23 pages
Phase 2
No ratings yet
Phase 2
17 pages
Advanced Data Science Project Report
No ratings yet
Advanced Data Science Project Report
3 pages
Customer Segmentation Project Plan
No ratings yet
Customer Segmentation Project Plan
2 pages
Laboratory Work 6
No ratings yet
Laboratory Work 6
4 pages
DWDMPROJECTREPORT
No ratings yet
DWDMPROJECTREPORT
9 pages
Customer Segmentation Literature Review 1
No ratings yet
Customer Segmentation Literature Review 1
8 pages
Ankit Survey Paper
No ratings yet
Ankit Survey Paper
6 pages
Digital Logic Design Assignment
No ratings yet
Digital Logic Design Assignment
2 pages
Apps & Webportals 2024 - April To June - Topic-Wise PDF by AffairsCloud 2
No ratings yet
Apps & Webportals 2024 - April To June - Topic-Wise PDF by AffairsCloud 2
13 pages
Ii-Ii Supply Paid List Dec-2024 (R18)
No ratings yet
Ii-Ii Supply Paid List Dec-2024 (R18)
9 pages
AUTOSAR FO RS ProjectObjectives
No ratings yet
AUTOSAR FO RS ProjectObjectives
14 pages
Programming ESP-12E - ESP-12F - NodeMCU With Arduino IDE - Circuit Journal
No ratings yet
Programming ESP-12E - ESP-12F - NodeMCU With Arduino IDE - Circuit Journal
18 pages
Pa600 UpgradeManual v2.1 EFGIC
No ratings yet
Pa600 UpgradeManual v2.1 EFGIC
38 pages
Solution Analyzing A Forecasting Data Source
100% (1)
Solution Analyzing A Forecasting Data Source
5 pages
Cis Kilimanjaro With Server Details
No ratings yet
Cis Kilimanjaro With Server Details
3 pages
Amazon Application Engineer - JD
No ratings yet
Amazon Application Engineer - JD
2 pages
MTH001 Final Term Current
No ratings yet
MTH001 Final Term Current
14 pages
Keshav Sharma Aset It
No ratings yet
Keshav Sharma Aset It
12 pages
Assignment 1 Advanced Programming
No ratings yet
Assignment 1 Advanced Programming
37 pages
Trace - 2023-09-15 16 - 28 - 52 500
No ratings yet
Trace - 2023-09-15 16 - 28 - 52 500
2 pages
Articulo 1
No ratings yet
Articulo 1
12 pages
PPDS OSS Restriction Maintenance For Model Mix Planning SAPAPO RET2
No ratings yet
PPDS OSS Restriction Maintenance For Model Mix Planning SAPAPO RET2
2 pages
PS DBM Cebu CNAS 6-18-2024
No ratings yet
PS DBM Cebu CNAS 6-18-2024
5 pages
ISP 39 - Joining Letter
No ratings yet
ISP 39 - Joining Letter
4 pages
CSS Electronics Products
No ratings yet
CSS Electronics Products
11 pages
Pro Swift - Break Out of Beginner's Swift With This Hands-On Guide - PDF Room
No ratings yet
Pro Swift - Break Out of Beginner's Swift With This Hands-On Guide - PDF Room
265 pages
Sample of Writing Cover Letter For Job Application
100% (1)
Sample of Writing Cover Letter For Job Application
6 pages
Software Project Management Unit-3 - 1 PDF
No ratings yet
Software Project Management Unit-3 - 1 PDF
2 pages
Nabl Test Report Cross Verification Methodology and Steps For Report Authenticity
No ratings yet
Nabl Test Report Cross Verification Methodology and Steps For Report Authenticity
4 pages
COSF 327 INFORMATION SECURITY AND AUDIT CONTROL - Kabarak University
No ratings yet
COSF 327 INFORMATION SECURITY AND AUDIT CONTROL - Kabarak University
3 pages
Telit Le920-Family Datasheet
No ratings yet
Telit Le920-Family Datasheet
2 pages
Acces Problem - Agilent Cytogenomics 5.0: Skipalova, Karolina (Agilent Informatics Support)
No ratings yet
Acces Problem - Agilent Cytogenomics 5.0: Skipalova, Karolina (Agilent Informatics Support)
4 pages
Photo Contest Criteria and Guidelines
No ratings yet
Photo Contest Criteria and Guidelines
2 pages
IOM - Gas Detector - AE - AIYI - EN Manual-AG200 Series Fixed Gas Detector - AnrN - AnrS
No ratings yet
IOM - Gas Detector - AE - AIYI - EN Manual-AG200 Series Fixed Gas Detector - AnrN - AnrS
24 pages
External Optical Drive Case
No ratings yet
External Optical Drive Case
2 pages
Dms-Mba Data Analytics-Syllabus
No ratings yet
Dms-Mba Data Analytics-Syllabus
103 pages
Stm32f411ve Errata
No ratings yet
Stm32f411ve Errata
32 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

VL2024250504566 Ast03

Uploaded by

VL2024250504566 Ast03

Uploaded by

SLOT – L43+L44

SCHOOL OF COMPUTER SCIENCE AND ENGINEERING

Objective: To Implement and Evaluate Classification Algorithms (Decision Tree

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.