0% found this document useful (0 votes)

13 views3 pages

EXP - 6 - Prasham Doshi - 22bec097

This document details an experiment using decision tree classifiers with entropy and Gini criteria on the IRIS dataset. It evaluates the impact of parameters like min_samples_split, min_samples_leaf, and max_depth on model accuracy, plotting the results for both ID3 and C4.5 algorithms. The findings indicate high accuracy rates for both models across various parameter settings.

Uploaded by

prasham1380

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views3 pages

EXP - 6 - Prasham Doshi - 22bec097

Uploaded by

prasham1380

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

22bec097 - Prasham Doshi - Experiment - 6 - ML

The Decision tree with entropy and gini criterion are being used here directly from sklearn library and other parameters like
min_samples_split,min_samples_leaf and max_depth are manipulated on these decision tree classifier models and subsequent trees are
plotted to check the differences manually.

Dataset used here is the IRIS Dataset https://www.kaggle.com/datasets/uciml/iris

Importing the dependencies

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
from sklearn.model_selection import train_test_split
from sklearn.metrics import accuracy_score
from sklearn.tree import DecisionTreeClassifier,plot_tree
from sklearn.preprocessing import LabelEncoder

df = pd.read_csv('/content/drive/MyDrive/Colab Notebooks/Data/ML/Iris.csv')

label_encoder = LabelEncoder()
df['Species'] = label_encoder.fit_transform(df['Species'])

X = df.iloc[:, 1:-1]
y = df['Species']

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3, random_state=42)

def evaluate_model_id3(parameter_name, parameter_value):

model = DecisionTreeClassifier(criterion='entropy', **{parameter_name: parameter_value}, random_state=42)
model.fit(X_train, y_train)
y_pred = model.predict(X_test)

plt.figure(figsize=(16, 10))
plot_tree(model, feature_names=df.columns[1:-1], class_names=label_encoder.classes_, filled=True)
plt.title(f"ID3 Decision Tree ({parameter_name} = {parameter_value})")

return accuracy_score(y_test, y_pred)

def evaluate_model_c45(parameter_name, parameter_value):

model = DecisionTreeClassifier(criterion='gini', **{parameter_name: parameter_value},splitter='best',random_state=42)
model.fit(X_train, y_train)
y_pred = model.predict(X_test)

plt.figure(figsize=(16, 10))
plot_tree(model, feature_names=df.columns[1:-1], class_names=label_encoder.classes_, filled=True)
plt.title(f"C4.5 Decision Tree ({parameter_name} = {parameter_value})")

return accuracy_score(y_test, y_pred)

################## VALUES TO TEST ###########

min_samples_splits = [2, 5, 10, 20]
min_samples_leaves = [1, 2, 5, 10]
max_depths = [5, 10, 15, 20]

values_to_try = {
'min_samples_split': min_samples_splits,
'min_samples_leaf': min_samples_leaves,
'max_depth': max_depths
}

#RESULTS
accuracy_id3 = []
accuracy_c45 = []

#LOOP FOR TRYING ALL THE VALUES ON THE SKLEARN MODEL

for parameter_name, parameter_values in values_to_try.items():
print(f"\nEvaluating parameter: {parameter_name}")
results_id3 = []
results_c45 = []

for value in parameter_values:

acc_id3 = evaluate_model_id3(parameter_name, value)
acc_c45 = evaluate_model_c45(parameter_name, value)

results_id3.append(acc_id3)
results_c45.append(acc_c45)

print(f"{parameter_name} = {value} -> ID3 Accuracy: {acc_id3}, C4.5 Accuracy: {acc_c45}")

accuracy_id3.append(results_id3)
accuracy_c45.append(results_c45)

#PLOTTING THE VALUES

def plot_parameter_effects(parameter_name, title, index):
plt.figure(figsize=(10,6))

# Plot for ID3

plt.plot(values_to_try[parameter_name], accuracy_id3[index], label='ID3 (Entropy)', marker='o')

# Plot for C4.5

plt.plot(values_to_try[parameter_name], accuracy_c45[index], label='C4.5 (Gini)', marker='o')

plt.title(f'Effect of {title} on Accuracy')

plt.xlabel(title)
plt.ylabel('Accuracy')
plt.legend()
plt.grid(True)
plt.show()

plot_parameter_effects('min_samples_split', 'Min Samples Split' ,0)

plot_parameter_effects('min_samples_leaf', 'Min Samples Leaf', 1)
plot_parameter_effects('max_depth', 'Max Depth', 2)

# RANDOM FINAL TREE MADE AND DISPLAYED WITH plot_tree

final_model_id3 = DecisionTreeClassifier(criterion='entropy', max_depth=5, random_state=42)
final_model_id3.fit(X_train, y_train)

# plt.figure(figsize=(16, 10))
# plot_tree(final_model_id3, feature_names=df.columns[1:-1], class_names=label_encoder.classes_, filled=True)
# plt.title('ID3 Decision Tree (Max Depth = 5)')
plt.show()
Evaluating parameter: min_samples_split
min_samples_split = 2 -> ID3 Accuracy: 0.9777777777777777, C4.5 Accuracy: 1.0
min_samples_split = 5 -> ID3 Accuracy: 0.9777777777777777, C4.5 Accuracy: 1.0
min_samples_split = 10 -> ID3 Accuracy: 1.0, C4.5 Accuracy: 1.0
min_samples_split = 20 -> ID3 Accuracy: 0.9777777777777777, C4.5 Accuracy: 1.0

Evaluating parameter: min_samples_leaf

min_samples_leaf = 1 -> ID3 Accuracy: 0.9777777777777777, C4.5 Accuracy: 1.0
min_samples_leaf = 2 -> ID3 Accuracy: 1.0, C4.5 Accuracy: 1.0
min_samples_leaf = 5 -> ID3 Accuracy: 1.0, C4.5 Accuracy: 1.0
min_samples_leaf = 10 -> ID3 Accuracy: 0.9777777777777777, C4.5 Accuracy: 0.9777777777777777

Evaluating parameter: max_depth

max_depth = 5 -> ID3 Accuracy: 0.9777777777777777, C4.5 Accuracy: 1.0
max_depth = 10 -> ID3 Accuracy: 0.9777777777777777, C4.5 Accuracy: 1.0
<ipython-input-30-3446bf3bd970>:28: RuntimeWarning: More than 20 figures have been opened. Figures created through the pyplot interface
plt.figure(figsize=(16, 10))
max_depth = 15 -> ID3 Accuracy: 0.9777777777777777, C4.5 Accuracy: 1.0
max_depth = 20 -> ID3 Accuracy: 0.9777777777777777, C4.5 Accuracy: 1.0

CSET301 LabW8L2
No ratings yet
CSET301 LabW8L2
1 page
Desicion Tree Ipynb
No ratings yet
Desicion Tree Ipynb
6 pages
02 - Decision Tree Classification On Iris Dataset
No ratings yet
02 - Decision Tree Classification On Iris Dataset
6 pages
DM Lab 04
No ratings yet
DM Lab 04
6 pages
Lab 02: Decision Tree With Scikit-Learn: About The Mushroom Data Set
No ratings yet
Lab 02: Decision Tree With Scikit-Learn: About The Mushroom Data Set
3 pages
AIH Lab2
No ratings yet
AIH Lab2
10 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
Department of Electronics & Telecommunications Engineering: ETEL71A-Machine Learning and AI
No ratings yet
Department of Electronics & Telecommunications Engineering: ETEL71A-Machine Learning and AI
4 pages
ML Lab Programs 2
No ratings yet
ML Lab Programs 2
16 pages
ML 6
No ratings yet
ML 6
15 pages
DWM 06
No ratings yet
DWM 06
4 pages
FDP Session 4 (Decision Tree)
No ratings yet
FDP Session 4 (Decision Tree)
1 page
23ucc542 ml9
No ratings yet
23ucc542 ml9
6 pages
What Is Decision Tree?: ISM Implementation of Decision Tree Submitted By: Sagiruddin Akthar 19mcmc28
No ratings yet
What Is Decision Tree?: ISM Implementation of Decision Tree Submitted By: Sagiruddin Akthar 19mcmc28
4 pages
Practical 15 Python
No ratings yet
Practical 15 Python
6 pages
Programs Lab Bca
No ratings yet
Programs Lab Bca
16 pages
Decision Trees
No ratings yet
Decision Trees
38 pages
Lab 4 - Logistic Regression - KNN - Notes
No ratings yet
Lab 4 - Logistic Regression - KNN - Notes
6 pages
LAB (1) Decision Tree: Islamic University of Gaza Computer Engineering Department Artificial Intelligence ECOM 5038
No ratings yet
LAB (1) Decision Tree: Islamic University of Gaza Computer Engineering Department Artificial Intelligence ECOM 5038
18 pages
Tutorial 6
No ratings yet
Tutorial 6
8 pages
ML 4
No ratings yet
ML 4
5 pages
Practical 1ritesh
No ratings yet
Practical 1ritesh
3 pages
Experiment 8
No ratings yet
Experiment 8
4 pages
Ludic - Workshop - Iris - Copie
No ratings yet
Ludic - Workshop - Iris - Copie
5 pages
FDS Lab Manual
No ratings yet
FDS Lab Manual
10 pages
Random Forest 1737667979
No ratings yet
Random Forest 1737667979
11 pages
Decision Tree
No ratings yet
Decision Tree
2 pages
Decision Tree
No ratings yet
Decision Tree
2 pages
ML Lab6.Ipynb - Colaboratory
100% (1)
ML Lab6.Ipynb - Colaboratory
5 pages
Practical No4 - 5 ML
No ratings yet
Practical No4 - 5 ML
11 pages
Unit-3 Alt
No ratings yet
Unit-3 Alt
24 pages
ML Unit 3 New
100% (1)
ML Unit 3 New
24 pages
Decision Tree Exp 5 DWM
No ratings yet
Decision Tree Exp 5 DWM
2 pages
Types of Pruning Techniques
No ratings yet
Types of Pruning Techniques
10 pages
ML Unit3
No ratings yet
ML Unit3
24 pages
K.Venkat Ratnam 191911412 Class Work 1) Describe The Attribute Selection Measures Used by The ID3 Algorithm To Construct A Decision Tree. A)
No ratings yet
K.Venkat Ratnam 191911412 Class Work 1) Describe The Attribute Selection Measures Used by The ID3 Algorithm To Construct A Decision Tree. A)
8 pages
ML Unit-3
No ratings yet
ML Unit-3
23 pages
Wa0001
No ratings yet
Wa0001
39 pages
MLSP Lab Exp4
No ratings yet
MLSP Lab Exp4
9 pages
Sentence Building
No ratings yet
Sentence Building
1 page
Iris - Copy1 - Jupyter Notebook
No ratings yet
Iris - Copy1 - Jupyter Notebook
8 pages
Experiment 8
No ratings yet
Experiment 8
14 pages
Decision Tree and Related Techniques For Classification in Scalation
No ratings yet
Decision Tree and Related Techniques For Classification in Scalation
12 pages
2
No ratings yet
2
2 pages
Trees and Forests: Machine Learning With Python Cookbook
No ratings yet
Trees and Forests: Machine Learning With Python Cookbook
5 pages
ML Mod-4
No ratings yet
ML Mod-4
30 pages
ML Lab - 2
No ratings yet
ML Lab - 2
14 pages
Assigmnent 3 (Data Mining)
No ratings yet
Assigmnent 3 (Data Mining)
18 pages
Is Lab Aman Agarwal PDF
No ratings yet
Is Lab Aman Agarwal PDF
8 pages
08 Decision - Tree
No ratings yet
08 Decision - Tree
9 pages
ML Using Python Programs
No ratings yet
ML Using Python Programs
12 pages
Data Mining Assignment No. 1
No ratings yet
Data Mining Assignment No. 1
7 pages
Progrram8-Decision Tree
No ratings yet
Progrram8-Decision Tree
3 pages
Programming Assignment: Decision Tree Classifier: Objective
No ratings yet
Programming Assignment: Decision Tree Classifier: Objective
3 pages
Ai/Ml Lab-4: Name: Pratik Jadhav PRN: 20190802050
No ratings yet
Ai/Ml Lab-4: Name: Pratik Jadhav PRN: 20190802050
5 pages
EX - NO:3: Algorithm
No ratings yet
EX - NO:3: Algorithm
11 pages
ML Lab1 PGM
No ratings yet
ML Lab1 PGM
4 pages
Iris - Regression - Jupyter Notebook
No ratings yet
Iris - Regression - Jupyter Notebook
5 pages
Aih Exp 2
No ratings yet
Aih Exp 2
8 pages
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Micro 7
No ratings yet
Micro 7
79 pages
Approach Ramp - Bridge-5
No ratings yet
Approach Ramp - Bridge-5
3 pages
Amc 8-1996
No ratings yet
Amc 8-1996
6 pages
Santillana Kate Rizelle P Labexercise 3 PDF
No ratings yet
Santillana Kate Rizelle P Labexercise 3 PDF
26 pages
UV-curing Inks and Coatings For Offset Printing
100% (3)
UV-curing Inks and Coatings For Offset Printing
16 pages
Bouncy Chudinov
No ratings yet
Bouncy Chudinov
8 pages
I-Map Ad
No ratings yet
I-Map Ad
1 page
Nystatin Preparation: Coccidioides Immitis, Cryptococcus Neoformans, Histoplasma Capsulatum, Blastomyces Dermatidis, and
No ratings yet
Nystatin Preparation: Coccidioides Immitis, Cryptococcus Neoformans, Histoplasma Capsulatum, Blastomyces Dermatidis, and
1 page
System Administration and Maintenance FQ1 FINAL MP2 PDF
No ratings yet
System Administration and Maintenance FQ1 FINAL MP2 PDF
3 pages
Functions 87h and 89h of Interrupt 15h
No ratings yet
Functions 87h and 89h of Interrupt 15h
2 pages
Weekly Report Automation
No ratings yet
Weekly Report Automation
1 page
GDC22 AgeIV ML Trials and Tribulations
No ratings yet
GDC22 AgeIV ML Trials and Tribulations
69 pages
Factorytalk® View Site Edition: Powerful, Scalable Visualization Solutions
No ratings yet
Factorytalk® View Site Edition: Powerful, Scalable Visualization Solutions
4 pages
Landing Gear Shimmy - Dr. Ing. Besselink
No ratings yet
Landing Gear Shimmy - Dr. Ing. Besselink
31 pages
FLAT Unit-3 Part2
No ratings yet
FLAT Unit-3 Part2
9 pages
Embedded System Module-1: by Dr. Manoj Prabhakaran - K Assistant Professor, SEEE, VIT Bhopal
No ratings yet
Embedded System Module-1: by Dr. Manoj Prabhakaran - K Assistant Professor, SEEE, VIT Bhopal
38 pages
Method Prof Raja
No ratings yet
Method Prof Raja
3 pages
1 - RCA 1 Pager - Training
No ratings yet
1 - RCA 1 Pager - Training
35 pages
Decrypted Save File
No ratings yet
Decrypted Save File
15 pages
NV10
100% (1)
NV10
39 pages
Revision Worksheet 5 (Unit 5 Electricity)
No ratings yet
Revision Worksheet 5 (Unit 5 Electricity)
3 pages
Problem 1: Ramanujan Numbers and The Taxicab Problem
No ratings yet
Problem 1: Ramanujan Numbers and The Taxicab Problem
20 pages
SCADA PLC S7 Basic Programming Course Content
No ratings yet
SCADA PLC S7 Basic Programming Course Content
2 pages
Microsoft Visual Basic para Aplicaciones
No ratings yet
Microsoft Visual Basic para Aplicaciones
3 pages
Class 6
No ratings yet
Class 6
3 pages
Boost Option
No ratings yet
Boost Option
16 pages
Sudoku Evil 5 v2
No ratings yet
Sudoku Evil 5 v2
2 pages
LV Circuit Components
No ratings yet
LV Circuit Components
22 pages
Owasp Top 10 2017 Security Scanner Reference Table
No ratings yet
Owasp Top 10 2017 Security Scanner Reference Table
1 page
Lenze ACTech MC3000 User Manual
No ratings yet
Lenze ACTech MC3000 User Manual
84 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

EXP - 6 - Prasham Doshi - 22bec097

Uploaded by

EXP - 6 - Prasham Doshi - 22bec097

Uploaded by

22bec097 - Prasham Doshi - Experiment - 6 - ML

Dataset used here is the IRIS Dataset https://www.kaggle.com/datasets/uciml/iris

Importing the dependencies

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3, random_state=42)

def evaluate_model_id3(parameter_name, parameter_value):

return accuracy_score(y_test, y_pred)

def evaluate_model_c45(parameter_name, parameter_value):

return accuracy_score(y_test, y_pred)

################## VALUES TO TEST ###########

#LOOP FOR TRYING ALL THE VALUES ON THE SKLEARN MODEL

for value in parameter_values:

print(f"{parameter_name} = {value} -> ID3 Accuracy: {acc_id3}, C4.5 Accuracy: {acc_c45}")

#PLOTTING THE VALUES

# Plot for ID3

# Plot for C4.5

plt.title(f'Effect of {title} on Accuracy')

plot_parameter_effects('min_samples_split', 'Min Samples Split' ,0)

# RANDOM FINAL TREE MADE AND DISPLAYED WITH plot_tree

Evaluating parameter: min_samples_leaf

Evaluating parameter: max_depth

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.