0% found this document useful (0 votes)

6 views6 pages

Comparison of Classifiers

The document analyzes the performance of three classification algorithms (Decision Tree, K-Nearest Neighbors, and Logistic Regression) on the Iris dataset. Each classifier underwent 5-fold cross-validation, with KNN and Logistic Regression achieving the highest accuracy of 0.97, while the Decision Tree achieved 0.96. A summary of the cross-validation accuracies for all classifiers is provided at the end.

Uploaded by

pnagakalyan.aiml

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views6 pages

Comparison of Classifiers

Uploaded by

pnagakalyan.aiml

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Comparison_of_Classifiers

November 5, 2024

[1]: #Performance analysis of Classification Algorithms on a IRIS dataset

[2]: import numpy as np

import pandas as pd
from sklearn.datasets import load_iris
from sklearn.model_selection import train_test_split
from sklearn.tree import DecisionTreeClassifier
from sklearn.neighbors import KNeighborsClassifier
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import accuracy_score, classification_report,␣
↪confusion_matrix

import matplotlib.pyplot as plt

import seaborn as sns
from sklearn.model_selection import cross_val_score

[3]: # The Iris dataset is a classic dataset in the field of machine learning and␣
↪statistics, widely used for classification tasks. :

# Features of the Iris Dataset

# The dataset consists of four features measured for each flower sample:
# 1. Sepal Length: The length of the sepal in centimeters.
# 2. Sepal Width: The width of the sepal in centimeters.
# 3. Petal Length: The length of the petal in centimeters.
# 4. Petal Width: The width of the petal in centimeters.

# Species of Iris Flowers

# The dataset includes three species of the Iris flower:

# 1. Iris Setosa:Typically has shorter and narrower petals and sepals.

# 2.Iris Versicolor:Intermediate flower size compared to Setosa and Virginica.
# 3. Iris Virginica: Usually has the largest petals and sepals among the three␣
↪species.

# - Total Samples: 150

# - Samples per Species: 50 for each species
# - Total Features: 4 (Sepal Length, Sepal Width, Petal Length, Petal Width)
# - Target Variable: Species of the Iris flower (Setosa, Versicolor, Virginica)

1
[4]: # Load the Iris dataset
data = load_iris()
X = data.data # Features
y = data.target # Target labels

[5]: # Dictionary to store accuracy results for final comparison

accuracy_results = {}

[6]: # 1. Decision Tree Classifier

decision_tree = DecisionTreeClassifier()

# Perform 5-fold cross-validation

dt_scores = cross_val_score(decision_tree, X, y, cv=5)
dt_accuracy = dt_scores.mean()
accuracy_results["Decision Tree"] = dt_accuracy

# Train the model on the full dataset

decision_tree.fit(X, y)
dt_y_pred = decision_tree.predict(X)

# Print results
print("\nDecision Tree Results:")
print(f"Cross-Validation Accuracy: {dt_accuracy:.2f}")
print("Classification Report:\n", classification_report(y, dt_y_pred,␣
↪target_names=data.target_names, zero_division=1))

# Confusion matrix and plot

dt_conf_matrix = confusion_matrix(y, dt_y_pred)
plt.figure(figsize=(6, 4))
sns.heatmap(dt_conf_matrix, annot=True, fmt="d", cmap="Blues",
xticklabels=data.target_names, yticklabels=data.target_names)
plt.title("Decision Tree - Confusion Matrix")
plt.xlabel("Predicted")
plt.ylabel("Actual")
plt.show()

Decision Tree Results:

Cross-Validation Accuracy: 0.96
Classification Report:
precision recall f1-score support

setosa 1.00 1.00 1.00 50

versicolor 1.00 1.00 1.00 50
virginica 1.00 1.00 1.00 50

accuracy 1.00 150

macro avg 1.00 1.00 1.00 150

2
weighted avg 1.00 1.00 1.00 150

[7]: # 2. K-Nearest Neighbors Classifier

knn = KNeighborsClassifier(n_neighbors=3)

# Perform 5-fold cross-validation

knn_scores = cross_val_score(knn, X, y, cv=5)
knn_accuracy = knn_scores.mean()
accuracy_results["K-Nearest Neighbors"] = knn_accuracy

# Train the model on the full dataset

knn.fit(X, y)
knn_y_pred = knn.predict(X)

# Print results
print("\nK-Nearest Neighbors Results:")
print(f"Cross-Validation Accuracy: {knn_accuracy:.2f}")
print("Classification Report:\n", classification_report(y, knn_y_pred,␣
↪target_names=data.target_names, zero_division=1))

# Confusion matrix and plot

knn_conf_matrix = confusion_matrix(y, knn_y_pred)

3
plt.figure(figsize=(6, 4))
sns.heatmap(knn_conf_matrix, annot=True, fmt="d", cmap="Blues",
xticklabels=data.target_names, yticklabels=data.target_names)
plt.title("K-Nearest Neighbors - Confusion Matrix")
plt.xlabel("Predicted")
plt.ylabel("Actual")
plt.show()

K-Nearest Neighbors Results:

Cross-Validation Accuracy: 0.97
Classification Report:
precision recall f1-score support

setosa 1.00 1.00 1.00 50

versicolor 0.94 0.94 0.94 50
virginica 0.94 0.94 0.94 50

accuracy 0.96 150

macro avg 0.96 0.96 0.96 150
weighted avg 0.96 0.96 0.96 150

4
[8]: # 3. Logistic Regression Classifier
logistic_regression = LogisticRegression(max_iter=200)

# Perform 5-fold cross-validation

lr_scores = cross_val_score(logistic_regression, X, y, cv=5)
lr_accuracy = lr_scores.mean()
accuracy_results["Logistic Regression"] = lr_accuracy

# Train the model on the full dataset

logistic_regression.fit(X, y)
lr_y_pred = logistic_regression.predict(X)

# Print results
print("\nLogistic Regression Results:")
print(f"Cross-Validation Accuracy: {lr_accuracy:.2f}")
print("Classification Report:\n", classification_report(y, lr_y_pred,␣
↪target_names=data.target_names, zero_division=1))

# Confusion matrix and plot

lr_conf_matrix = confusion_matrix(y, lr_y_pred)
plt.figure(figsize=(6, 4))
sns.heatmap(lr_conf_matrix, annot=True, fmt="d", cmap="Blues",
xticklabels=data.target_names, yticklabels=data.target_names)
plt.title("Logistic Regression - Confusion Matrix")
plt.xlabel("Predicted")
plt.ylabel("Actual")
plt.show()

Logistic Regression Results:

Cross-Validation Accuracy: 0.97
Classification Report:
precision recall f1-score support

setosa 1.00 1.00 1.00 50

versicolor 0.98 0.94 0.96 50
virginica 0.94 0.98 0.96 50

accuracy 0.97 150

macro avg 0.97 0.97 0.97 150
weighted avg 0.97 0.97 0.97 150

5
[9]: # Print a summary of cross-validation accuracies for all classifiers
print("\nFinal Comparison of Cross-Validation Accuracies:")
for name, accuracy in accuracy_results.items():
print(f"{name}: {accuracy:.2f}")

Final Comparison of Cross-Validation Accuracies:

Decision Tree: 0.96
K-Nearest Neighbors: 0.97
Logistic Regression: 0.97

ML Lab-1
No ratings yet
ML Lab-1
32 pages
Sklearn
No ratings yet
Sklearn
141 pages
Iris Flower Classification Project
No ratings yet
Iris Flower Classification Project
9 pages
Desicion Tree Ipynb
No ratings yet
Desicion Tree Ipynb
6 pages
ML Keshav
No ratings yet
ML Keshav
23 pages
ML Yogesh
No ratings yet
ML Yogesh
23 pages
AML Lab3 2021wb15156
No ratings yet
AML Lab3 2021wb15156
13 pages
ML5 Implementation
No ratings yet
ML5 Implementation
32 pages
Iris Flower Classification
No ratings yet
Iris Flower Classification
3 pages
Lab06 KNN 01
No ratings yet
Lab06 KNN 01
3 pages
ML Lecture 10 Project
No ratings yet
ML Lecture 10 Project
20 pages
Bagging, Random Forest, Gradient Boost, AdaBoost & PCA
No ratings yet
Bagging, Random Forest, Gradient Boost, AdaBoost & PCA
8 pages
Dsbda Assig 6 Data Analytcs 3
No ratings yet
Dsbda Assig 6 Data Analytcs 3
6 pages
Decision Tree
No ratings yet
Decision Tree
6 pages
ML Expt 4
No ratings yet
ML Expt 4
4 pages
FDS Lab Manual
No ratings yet
FDS Lab Manual
10 pages
ML Mini Project
No ratings yet
ML Mini Project
9 pages
Decision Tree
No ratings yet
Decision Tree
2 pages
Decision Tree
No ratings yet
Decision Tree
2 pages
ML Lab Programs
No ratings yet
ML Lab Programs
23 pages
PR
No ratings yet
PR
17 pages
Machine Learning Aiml
No ratings yet
Machine Learning Aiml
7 pages
ML Lab Manual
No ratings yet
ML Lab Manual
6 pages
Random Forest 1737667979
No ratings yet
Random Forest 1737667979
11 pages
Decision Tree
No ratings yet
Decision Tree
4 pages
AAM 5th Practicle
No ratings yet
AAM 5th Practicle
3 pages
22BCS14374 - Sanya - Singh - Assignment 2
No ratings yet
22BCS14374 - Sanya - Singh - Assignment 2
8 pages
ML Using Python Programs
No ratings yet
ML Using Python Programs
12 pages
EX - NO:3: Algorithm
No ratings yet
EX - NO:3: Algorithm
11 pages
Iris - Copy1 - Jupyter Notebook
No ratings yet
Iris - Copy1 - Jupyter Notebook
8 pages
Pra 8
No ratings yet
Pra 8
4 pages
ML Lab Programs 2
No ratings yet
ML Lab Programs 2
16 pages
Assignment 2 Solution
No ratings yet
Assignment 2 Solution
4 pages
22IZ023 Nikhil - Exercise 7 A - Decision Trees
No ratings yet
22IZ023 Nikhil - Exercise 7 A - Decision Trees
4 pages
Understanding-Code-for A-Classifier
No ratings yet
Understanding-Code-for A-Classifier
15 pages
Decision Tree
No ratings yet
Decision Tree
6 pages
Decision Tree
No ratings yet
Decision Tree
2 pages
Write A Program To Demonstrate Decision Tree Algorithm For A Classification Problem and Perform Parameter Tuning For Better Results
No ratings yet
Write A Program To Demonstrate Decision Tree Algorithm For A Classification Problem and Perform Parameter Tuning For Better Results
5 pages
SUMITs MINOR REPORT
No ratings yet
SUMITs MINOR REPORT
16 pages
Exp 9 - 2131
No ratings yet
Exp 9 - 2131
7 pages
TranMinhTu1 bt2 2
No ratings yet
TranMinhTu1 bt2 2
5 pages
Assignment 5
No ratings yet
Assignment 5
5 pages
ML Lab Manual 4-8
No ratings yet
ML Lab Manual 4-8
11 pages
NaiveBayesClassifier - Jupyter Notebook
No ratings yet
NaiveBayesClassifier - Jupyter Notebook
2 pages
ML 1
No ratings yet
ML 1
4 pages
Nomlab 14 Ai
No ratings yet
Nomlab 14 Ai
3 pages
Prac5 AAM
No ratings yet
Prac5 AAM
2 pages
Lab - 5 (CB - En.u4ece22115)
No ratings yet
Lab - 5 (CB - En.u4ece22115)
5 pages
Decision Tree Exp 5 DWM
No ratings yet
Decision Tree Exp 5 DWM
2 pages
Iris Classification
No ratings yet
Iris Classification
6 pages
Mbas901 - L3
No ratings yet
Mbas901 - L3
103 pages
Lab 6
No ratings yet
Lab 6
4 pages
19mid0034 (Chandru) - ML Lab Fat - Jupyter Notebook
No ratings yet
19mid0034 (Chandru) - ML Lab Fat - Jupyter Notebook
4 pages
Implementing KNN Algorithm: Importing Libraries
No ratings yet
Implementing KNN Algorithm: Importing Libraries
6 pages
SVM and KNN
No ratings yet
SVM and KNN
3 pages
Different Types of Post
100% (1)
Different Types of Post
4 pages
11 Classical Time Series Forecasting Methods in Python (Cheat Sheet)
100% (1)
11 Classical Time Series Forecasting Methods in Python (Cheat Sheet)
27 pages
FDP Session 4 (Decision Tree)
No ratings yet
FDP Session 4 (Decision Tree)
1 page
Ai/Ml Lab-4: Name: Pratik Jadhav PRN: 20190802050
No ratings yet
Ai/Ml Lab-4: Name: Pratik Jadhav PRN: 20190802050
5 pages
UNIT 2 AAM Notes
No ratings yet
UNIT 2 AAM Notes
38 pages
SVM and Kmeans - Iris Dataset - Ipynb - Colab
No ratings yet
SVM and Kmeans - Iris Dataset - Ipynb - Colab
5 pages
536C3B
No ratings yet
536C3B
2 pages
Statistics and Probability Quarter 4: Week 8-Module 16 Regression Analysis
100% (2)
Statistics and Probability Quarter 4: Week 8-Module 16 Regression Analysis
13 pages
07 Relation Analysis
No ratings yet
07 Relation Analysis
88 pages
Exercise - Corriges AUTO HETERO E7
No ratings yet
Exercise - Corriges AUTO HETERO E7
24 pages
Day 3 Moderation
No ratings yet
Day 3 Moderation
25 pages
CSE-4119 Assignment
No ratings yet
CSE-4119 Assignment
3 pages
EDA Regression1
100% (1)
EDA Regression1
15 pages
Forecasting of Tourist Arrivals in Malaysia
No ratings yet
Forecasting of Tourist Arrivals in Malaysia
32 pages
Stats216 hw4 PDF
No ratings yet
Stats216 hw4 PDF
27 pages
Non-Stationarity and Unit Roots
No ratings yet
Non-Stationarity and Unit Roots
25 pages
02 - Decision Tree Classification On Iris Dataset
No ratings yet
02 - Decision Tree Classification On Iris Dataset
6 pages
Lab2 Solution PDF
No ratings yet
Lab2 Solution PDF
2 pages
Cfa
No ratings yet
Cfa
40 pages
Statistics Jamovi
No ratings yet
Statistics Jamovi
4 pages
Logistic Regression Lecture Notes
No ratings yet
Logistic Regression Lecture Notes
11 pages
Pengaruh Perceived Desirability Perceived Feasibil
No ratings yet
Pengaruh Perceived Desirability Perceived Feasibil
7 pages
Ardl Model
No ratings yet
Ardl Model
20 pages
Machine Learning Formulae
No ratings yet
Machine Learning Formulae
2 pages
36-401 Modern Regression HW #5 Solutions: Air - Flow
No ratings yet
36-401 Modern Regression HW #5 Solutions: Air - Flow
7 pages
Arch
No ratings yet
Arch
8 pages
Chapter 2 - Exercises - Econometrics2
No ratings yet
Chapter 2 - Exercises - Econometrics2
2 pages
F-Test Using One-Way ANOVA: Objectives
No ratings yet
F-Test Using One-Way ANOVA: Objectives
5 pages
ML 06 Multiclass
No ratings yet
ML 06 Multiclass
11 pages
Baron & Kenny
No ratings yet
Baron & Kenny
4 pages
Stats Formula Sheet 1
No ratings yet
Stats Formula Sheet 1
1 page
Anova How To Statistica
No ratings yet
Anova How To Statistica
3 pages
Applied Probability and Statistics
No ratings yet
Applied Probability and Statistics
2 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Comparison of Classifiers

Uploaded by

Comparison of Classifiers

Uploaded by

Comparison_of_Classifiers

[1]: #Performance analysis of Classification Algorithms on a IRIS dataset

[2]: import numpy as np

import matplotlib.pyplot as plt

# Features of the Iris Dataset

# Species of Iris Flowers

# 1. Iris Setosa:Typically has shorter and narrower petals and sepals.

# - Total Samples: 150

[5]: # Dictionary to store accuracy results for final comparison

[6]: # 1. Decision Tree Classifier

# Perform 5-fold cross-validation

# Train the model on the full dataset

# Confusion matrix and plot

Decision Tree Results:

setosa 1.00 1.00 1.00 50

accuracy 1.00 150

[7]: # 2. K-Nearest Neighbors Classifier

# Perform 5-fold cross-validation

# Train the model on the full dataset

# Confusion matrix and plot

K-Nearest Neighbors Results:

setosa 1.00 1.00 1.00 50

accuracy 0.96 150

# Perform 5-fold cross-validation

# Train the model on the full dataset

# Confusion matrix and plot

Logistic Regression Results:

setosa 1.00 1.00 1.00 50

accuracy 0.97 150

Final Comparison of Cross-Validation Accuracies:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.