0% found this document useful (0 votes)

2 views4 pages

Experiment 8

Uploaded by

faisalkhan778877

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views4 pages

Experiment 8

Uploaded by

faisalkhan778877

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Experiment 8

Develop a program to demonstrate the working of the

decision tree algorithm. Use Breast Cancer Data set for
building the decision tree and applying this knowledge to
classify a new sample.

Introduction to Decision Trees

What is a Decision Tree?
A Decision Tree is a supervised machine learning algorithm used for classification and
regression tasks. It models decisions using a tree-like structure where:

Nodes represent decision points based on feature values.

Edges represent possible outcomes (branches).
Leaves represent the final decision or classification.

Decision trees work by recursively splitting data into subsets based on the most significant
feature, ensuring maximum information gain at each step.

Working of the Decision Tree Algorithm

1. Selecting the Best Feature for Splitting
At each step, the algorithm selects the feature that best separates the data. Common
methods for choosing the best feature include:

Gini Impurity
Gini = 1- ∑Pi2

Measures how often a randomly chosen element would be incorrectly classified.

Entropy (Information Gain)

Entropy = ∑p(X)log p(X)

Measures the uncertainty in a dataset and selects splits that maximize information gain.

Chi-Square Test
Evaluates the statistical significance of the feature split.

2. Splitting the Data

The dataset is divided into subsets based on the selected
feature. The process continues recursively until:
A stopping condition is met (e.g., pure classification, max
depth). The tree reaches a predefined depth.

3. Making Predictions
For a new sample, traverse the tree from the root to a leaf
node. The leaf node contains the predicted class label.

Advantages of Decision Trees

✔ Easy to interpret – Mimics human decision-making.
✔ Handles both numerical & categorical data.
✔ Requires little data preprocessing – No need for feature scaling.
✔ Works well with missing values.

Challenges of Decision Trees

❌ Overfitting – Deep trees may memorize noise instead of patterns.
❌ Bias towards dominant features – Features with more categories can lead to
biased splits.
❌ Instability – Small data variations can lead to different trees.

Optimizing Decision Trees

1. Pruning

Pre-Pruning: Stop the tree early using conditions (e.g., min samples per split).
Post-Pruning: Remove unnecessary branches after the tree is built.
2. Setting Tree Depth

Limiting maximum depth prevents overfitting.

3. Using Ensemble Methods

Random Forest: Combines multiple trees for better generalization.

Gradient Boosting: Sequentially improves predictions.
Applications of Decision Trees
Medical Diagnosis – Classifying diseases based on symptoms.
Fraud Detection – Identifying fraudulent transactions.
Customer Segmentation – Categorizing users based on behavior.
# Importing necessary libraries
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns

from sklearn.model_selection import train_test_split

from sklearn.tree import DecisionTreeClassifier, plot_tree
from sklearn.metrics import accuracy_score, classification_report, confusion_matrix

from sklearn.tree import export_graphviz

from IPython.display import Image
import pydotplus

import warnings
warnings.filterwarnings('ignore')

data = pd.read_csv(r')
data.head()
data.shape
data.info()
data.diagnosis.unique()
data.isnull().sum()
df = data.drop(['id'], axis=1)
df['diagnosis'] = df['diagnosis'].map({'M':1, 'B':0}) # Malignant:1, Benign:0

#Model Building
X = df.drop('diagnosis', axis=1) # Drop the 'diagnosis' column (target)
y = df['diagnosis']
# Split the dataset into training and testing sets (80% train, 20% test)
X_train, X_test, y_train, y_test = train_test_split(X,y,test_size=0.2, random_state=42)

# Fit the decision tree model

model = DecisionTreeClassifier(criterion='entropy') #criteria = gini, entropy
model.fit(X_train, y_train)
model
y_pred = model.predict(X_test)
y_pred
# Evaluate the model
accuracy = accuracy_score(y_test, y_pred) * 100
classification_rep = classification_report(y_test, y_pred)

# Print the results

print("Accuracy:", accuracy)
print("Classification Report:\n", classification_rep)

new = [[12.5, 19.2, 80.0, 500.0, 0.085, 0.1, 0.05, 0.02, 0.17, 0.06,
0.4, 1.0, 2.5, 40.0, 0.006, 0.02, 0.03, 0.01, 0.02, 0.003,
16.0, 25.0, 105.0, 900.0, 0.13, 0.25, 0.28, 0.12, 0.29, 0.08]]
y_pred = model.predict(new)

# Output the prediction (0 = Benign, 1 = Malignant)

if y_pred[0] == 0:
print("Prediction: Benign")
else:
print("Prediction: Malignant")

# Visualize the Decision Tree (optional)

plt.figure(figsize=(12, 8))
plot_tree(model, filled=True, feature_names=X.columns, class_names=['Benign', 'Mali
plt.show()

# Export the tree to DOT format

dot_data = export_graphviz(model, out_file=None,
feature_names=X_train.columns,
rounded=True, proportion=False,
precision=2, filled=True)

# Convert DOT data to a graph

graph = pydotplus.graph_from_dot_data(dot_data)

# Display the graph

Image(graph.create_png())
kkkkkkkkk

Flog Ug
No ratings yet
Flog Ug
924 pages
Breast Cancer Classification
100% (2)
Breast Cancer Classification
16 pages
What Is Decision Tree
No ratings yet
What Is Decision Tree
35 pages
Minor Project
No ratings yet
Minor Project
21 pages
The Balanced Scorecard: Superfactory Excellence Program™
No ratings yet
The Balanced Scorecard: Superfactory Excellence Program™
65 pages
14 - Ensemble Methods
No ratings yet
14 - Ensemble Methods
38 pages
EST200 M3 Ktunotes - in
No ratings yet
EST200 M3 Ktunotes - in
52 pages
Progrram8-Decision Tree
No ratings yet
Progrram8-Decision Tree
3 pages
Decision Tree
No ratings yet
Decision Tree
44 pages
08 Decision - Tree
No ratings yet
08 Decision - Tree
9 pages
Decision Tree Induction Algorithm
No ratings yet
Decision Tree Induction Algorithm
6 pages
Ai Merge All Slides'
No ratings yet
Ai Merge All Slides'
314 pages
Mile Pra 25 Aug 2024 12th Jee Main Part Test Phase 3 KPM Model Test
No ratings yet
Mile Pra 25 Aug 2024 12th Jee Main Part Test Phase 3 KPM Model Test
12 pages
MIS410 Chapter6
No ratings yet
MIS410 Chapter6
47 pages
Lecture 11 Slides - After
No ratings yet
Lecture 11 Slides - After
55 pages
A Level FM cp1 2024
No ratings yet
A Level FM cp1 2024
32 pages
8.program Decisiontree
No ratings yet
8.program Decisiontree
15 pages
Lecture - Part 1 - Activity Scheduling and Gantt Charts
No ratings yet
Lecture - Part 1 - Activity Scheduling and Gantt Charts
28 pages
Decision Trees
No ratings yet
Decision Trees
38 pages
DataMining-Handouts1 5
No ratings yet
DataMining-Handouts1 5
8 pages
Experiment 8
No ratings yet
Experiment 8
14 pages
L04 Decision Trees
No ratings yet
L04 Decision Trees
34 pages
Supervised Learning
No ratings yet
Supervised Learning
71 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
Lab 2
No ratings yet
Lab 2
17 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
Constraint Programming: Michael Trick Carnegie Mellon
No ratings yet
Constraint Programming: Michael Trick Carnegie Mellon
41 pages
Machine Learning
No ratings yet
Machine Learning
16 pages
Decision Trees Presentation
No ratings yet
Decision Trees Presentation
10 pages
MLA Lab 6:-Implementation of Decision Tree
No ratings yet
MLA Lab 6:-Implementation of Decision Tree
16 pages
Machine Learning With Python - Machine Learning Algorithms - Decision Tree
No ratings yet
Machine Learning With Python - Machine Learning Algorithms - Decision Tree
17 pages
Experiment 7a and 7b
No ratings yet
Experiment 7a and 7b
3 pages
Alexis Butler's - Fac - Fact - Fic - Pon - Pound - Pono - Struct - Strue - Stit - Stat - Sto
No ratings yet
Alexis Butler's - Fac - Fact - Fic - Pon - Pound - Pono - Struct - Strue - Stit - Stat - Sto
10 pages
ML4 - Decision Trees & Random Forest
No ratings yet
ML4 - Decision Trees & Random Forest
44 pages
Lecture 15: Tree-Based Algorithms - Applied ML
No ratings yet
Lecture 15: Tree-Based Algorithms - Applied ML
17 pages
Draft Xai
No ratings yet
Draft Xai
16 pages
Decision Tree Code Explanation
No ratings yet
Decision Tree Code Explanation
4 pages
Decision Trees
No ratings yet
Decision Trees
8 pages
Free Damped Vibration
No ratings yet
Free Damped Vibration
50 pages
Research
No ratings yet
Research
2 pages
Decision Trees
No ratings yet
Decision Trees
18 pages
AIH Lab2
No ratings yet
AIH Lab2
10 pages
1.10. Decision Trees - Scikit-Learn 0.24.1 Documentation
No ratings yet
1.10. Decision Trees - Scikit-Learn 0.24.1 Documentation
10 pages
Experiment 8 ML Vtu
No ratings yet
Experiment 8 ML Vtu
4 pages
Eda Document Longterm
No ratings yet
Eda Document Longterm
10 pages
Experiment 2
No ratings yet
Experiment 2
17 pages
Practical 15 Python
No ratings yet
Practical 15 Python
6 pages
Experiment 3 PCA On Iris Dataset
No ratings yet
Experiment 3 PCA On Iris Dataset
2 pages
CONTENTS
No ratings yet
CONTENTS
7 pages
Unit 4
No ratings yet
Unit 4
33 pages
Unit-5 Decision Trees & Ensembles Methods
No ratings yet
Unit-5 Decision Trees & Ensembles Methods
11 pages
Practical No4 - 5 ML
No ratings yet
Practical No4 - 5 ML
11 pages
Lecture 7.2 - DTC Algorithm Implementation
No ratings yet
Lecture 7.2 - DTC Algorithm Implementation
7 pages
AP Calc AB 2003 PDF
No ratings yet
AP Calc AB 2003 PDF
34 pages
8 PRGM
No ratings yet
8 PRGM
2 pages
Experiment 10
No ratings yet
Experiment 10
1 page
Introduction To Decision Tree: Gini Index
No ratings yet
Introduction To Decision Tree: Gini Index
15 pages
Study of Residential Land Use Transport Interaction For Madurai Lpa
No ratings yet
Study of Residential Land Use Transport Interaction For Madurai Lpa
78 pages
Types of Pruning Techniques
No ratings yet
Types of Pruning Techniques
10 pages
DM Lab 04
No ratings yet
DM Lab 04
6 pages
5b Python Implementation of Decision Tree
No ratings yet
5b Python Implementation of Decision Tree
7 pages
Program - 8
No ratings yet
Program - 8
2 pages
Decision - Tree - Regression - Ipynb - Colab
No ratings yet
Decision - Tree - Regression - Ipynb - Colab
3 pages
Discontinuidades en Concreto
No ratings yet
Discontinuidades en Concreto
9 pages
Prac 6
No ratings yet
Prac 6
6 pages
Exp 3 121a1047 Lavanya Kurup ML
No ratings yet
Exp 3 121a1047 Lavanya Kurup ML
4 pages
Title: Implementation of Decision Tree Classification: Department of Computer Science and Engineering
No ratings yet
Title: Implementation of Decision Tree Classification: Department of Computer Science and Engineering
8 pages
Introduction To Python - 2018
No ratings yet
Introduction To Python - 2018
20 pages
Program To Convert Decimal To Binary Using Stack
No ratings yet
Program To Convert Decimal To Binary Using Stack
27 pages
Decision Trees and Random Forests
No ratings yet
Decision Trees and Random Forests
25 pages
Sprague Matthew Thesis App C PDF
No ratings yet
Sprague Matthew Thesis App C PDF
26 pages
Fiitjee Question Paper Solutions
No ratings yet
Fiitjee Question Paper Solutions
37 pages
Recapitulation For Midterm 1
No ratings yet
Recapitulation For Midterm 1
19 pages
Introduction To Quantitative Methods: Morning 6 December 2007
100% (1)
Introduction To Quantitative Methods: Morning 6 December 2007
20 pages
What Is Decision Tree?: ISM Implementation of Decision Tree Submitted By: Sagiruddin Akthar 19mcmc28
No ratings yet
What Is Decision Tree?: ISM Implementation of Decision Tree Submitted By: Sagiruddin Akthar 19mcmc28
4 pages
Bài Tập Về Nhà Buổi 1: YÊU CẦU: Viết mô hình hồi quy mẫu và tính R, RSS, Fqs của các bài sau
No ratings yet
Bài Tập Về Nhà Buổi 1: YÊU CẦU: Viết mô hình hồi quy mẫu và tính R, RSS, Fqs của các bài sau
2 pages
Lead Compensator Design Paper
No ratings yet
Lead Compensator Design Paper
17 pages
Decision Tree and Related Techniques For Classification in Scalation
No ratings yet
Decision Tree and Related Techniques For Classification in Scalation
12 pages
02 Chapter 3 - Weight Volume Relationships
No ratings yet
02 Chapter 3 - Weight Volume Relationships
42 pages
LAB (1) Decision Tree: Islamic University of Gaza Computer Engineering Department Artificial Intelligence ECOM 5038
No ratings yet
LAB (1) Decision Tree: Islamic University of Gaza Computer Engineering Department Artificial Intelligence ECOM 5038
18 pages
CSET301 LabW8L2
No ratings yet
CSET301 LabW8L2
1 page
Emotion Recognition Based On Joint Visual and Audi
No ratings yet
Emotion Recognition Based On Joint Visual and Audi
4 pages
Trees and Forests: Machine Learning With Python Cookbook
No ratings yet
Trees and Forests: Machine Learning With Python Cookbook
5 pages
FDP Session 4 (Decision Tree)
No ratings yet
FDP Session 4 (Decision Tree)
1 page
Test of Homogeneity Based On Geometric Mean of Variances
No ratings yet
Test of Homogeneity Based On Geometric Mean of Variances
11 pages
X Viber Balancing Method
No ratings yet
X Viber Balancing Method
8 pages
Allied Radio Data Handbook 1943
No ratings yet
Allied Radio Data Handbook 1943
52 pages
Design Optimization of Solid Propellant Rocket Motor Pavel Konečný, Vojtěch Hrubý, Zdeněk Křižan
No ratings yet
Design Optimization of Solid Propellant Rocket Motor Pavel Konečný, Vojtěch Hrubý, Zdeněk Křižan
8 pages
Applied Elasticity - Chapter 1
No ratings yet
Applied Elasticity - Chapter 1
59 pages
Object Detection and Recognition: Final Project Title
No ratings yet
Object Detection and Recognition: Final Project Title
6 pages
cs3110 sp11 Prelim 2 Solutions
No ratings yet
cs3110 sp11 Prelim 2 Solutions
8 pages
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Experiment 8

Uploaded by

Experiment 8

Uploaded by

Experiment 8

Develop a program to demonstrate the working of the

Introduction to Decision Trees

Nodes represent decision points based on feature values.

Working of the Decision Tree Algorithm

Measures how often a randomly chosen element would be incorrectly classified.

Entropy (Information Gain)

2. Splitting the Data

Advantages of Decision Trees

Challenges of Decision Trees

Optimizing Decision Trees

Limiting maximum depth prevents overfitting.

Random Forest: Combines multiple trees for better generalization.

from sklearn.model_selection import train_test_split

from sklearn.tree import export_graphviz

# Fit the decision tree model

# Print the results

# Output the prediction (0 = Benign, 1 = Malignant)

# Visualize the Decision Tree (optional)

# Export the tree to DOT format

# Convert DOT data to a graph

# Display the graph

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.