0% found this document useful (0 votes)

54 views6 pages

1) Download the binary classification dataset for... - Colab

The document outlines a project focused on developing and comparing Logistic Regression, SVM, and KNN models for predicting personal loan defaults using a specific dataset. It details the model development process, including hyperparameter tuning, regularization, and performance evaluation using the F1 score. Ultimately, the SVM model achieved the highest F1 score, indicating it as the best-performing model among the three analyzed.

Uploaded by

jacky pundu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

54 views6 pages

1) Download the binary classification dataset for... - Colab

Uploaded by

jacky pundu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

NAME:PRATHAM

ROLL NO:23126039

OBJECTIVE: This objective encompasses the following key aspects of the assignment:

Model Development: It explicitly mentions the development of Logistic Regression, SVM, and KNN models.

Performance Comparison: It highlights the goal of comparing the performance of these models.

Dataset Specificity: It accurately identifies the "personal loan default prediction dataset."

Hyperparameter Tuning & Regularization: It emphasizes the importance of exploring hyperparameter tuning and regularization.

Model Evaluation: It specifies the use of the F1 score as the primary evaluation metric.

Model Selection: It states the aim to select the optimal model based on the evaluation results.

Generalization: it includes the concept of model generalization.

from google.colab import drive

drive.mount('/content/drive')

Drive already mounted at /content/drive; to attempt to forcibly remount, call drive.mount("/content/drive", force_remoun

import pandas as pd
from sklearn.model_selection import train_test_split

# Load the dataset

df = pd.read_csv("/content/drive/MyDrive/IML lab/lab6/loan_data.csv") #Please upload the loan_data.csv file to the colab env

# Dataset Explanation
print(df.head())
print(df.info())
print(df.describe())

# Explanation:
# The dataset contains information about personal loans, including:
# - person_age: Age of the borrower.
# - person_gender: Gender of the borrower.
# - person_education: Education level of the borrower.
# - person_income: Annual income of the borrower.
# - person_emp_exp: Employment experience in years.
# - person_home_ownership: Home ownership status.
# - loan_amnt: Loan amount.
# - loan_intent: Purpose of the loan.
# - loan_int_rate: Interest rate of the loan.
# - loan_percent_income: Loan amount as a percentage of income.
# - cb_person_cred_hist_length: Credit history length.
# - credit_score: credit score.
# - previous_loan_defaults_on_file: if the person has previous loan defaults.
# - loan_status: Loan default status (0 = No default, 1 = Default). This is the target variable.

# Train-Test Split
X = df.drop('loan_status', axis=1)
y = df['loan_status']

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

print("Train set shape:", X_train.shape, y_train.shape)

print("Test set shape:", X_test.shape, y_test.shape)
10 cb_person_cred_hist_length 45000 non-null float64
11 credit_score 45000 non-null int64
12 previous_loan_defaults_on_file 45000 non-null object
13 loan_status 45000 non-null int64
dtypes: float64(6), int64(3), object(5)
memory usage: 4.8+ MB
None
person_age person_income person_emp_exp loan_amnt \
count 45000.000000 4.500000e+04 45000.000000 45000.000000
mean 27.764178 8.031905e+04 5.410333 9583.157556
std 6.045108 8.042250e+04 6.063532 6314.886691
min 20.000000 8.000000e+03 0.000000 500.000000
25% 24.000000 4.720400e+04 1.000000 5000.000000
50% 26.000000 6.704800e+04 4.000000 8000.000000
75% 30.000000 9.578925e+04 8.000000 12237.250000
max 144.000000 7.200766e+06 125.000000 35000.000000

loan_int_rate loan_percent_income cb_person_cred_hist_length \

count 45000.000000 45000.000000 45000.000000
mean 11.006606 0.139725 5.867489
std 2.978808 0.087212 3.879702
min 5.420000 0.000000 2.000000
25% 8.590000 0.070000 3.000000
50% 11.010000 0.120000 4.000000
75% 12.990000 0.190000 8.000000
max 20.000000 0.660000 30.000000

credit_score loan_status
count 45000.000000 45000.000000
mean 632.608756 0.222222
std 50.435865 0.415744
min 390.000000 0.000000
25% 601.000000 0.000000
50% 640.000000 0.000000
75% 670.000000 0.000000
max 850.000000 1.000000
Train set shape: (36000, 13) (36000,)
Test set shape: (9000, 13) (9000,)

3. Logistic Regression Model Development

NAME:PRATHAM

ROLL NO:23126039

from sklearn.linear_model import LogisticRegression

from sklearn.preprocessing import StandardScaler, OneHotEncoder
from sklearn.compose import ColumnTransformer
from sklearn.pipeline import Pipeline
from sklearn.metrics import f1_score

# Preprocessing
numerical_features = X.select_dtypes(include=['int64', 'float64']).columns
categorical_features = X.select_dtypes(include=['object']).columns

numerical_transformer = Pipeline(steps=[
('scaler', StandardScaler())
])

categorical_transformer = Pipeline(steps=[
('onehot', OneHotEncoder(handle_unknown='ignore'))
])

preprocessor = ColumnTransformer(
transformers=[
('num', numerical_transformer, numerical_features),
('cat', categorical_transformer, categorical_features)
])

# Logistic Regression Pipeline

logistic_pipeline = Pipeline(steps=[
('preprocessor', preprocessor),
('classifier', LogisticRegression(solver='liblinear', random_state=42))
])

# Train the model

logistic_pipeline.fit(X_train, y_train)

# Predictions
y_train_pred = logistic_pipeline.predict(X_train)
y_test_pred = logistic_pipeline.predict(X_test)

# F1 Score
train_f1 = f1_score(y_train, y_train_pred)
test_f1 = f1_score(y_test, y_test_pred)
print(f"Train F1 Score: {train_f1}")
print(f"Test F1 Score: {test_f1}")

Train F1 Score: 0.7642629227823867

Test F1 Score: 0.7583926754832147

4. Regularization

# Regularized Logistic Regression (L1 and L2)

l1_pipeline = Pipeline(steps=[
('preprocessor', preprocessor),
('classifier', LogisticRegression(solver='liblinear', penalty='l1', random_state=42))
])

l2_pipeline = Pipeline(steps=[
('preprocessor', preprocessor),
('classifier', LogisticRegression(solver='liblinear', penalty='l2', random_state=42))
])

l1_pipeline.fit(X_train, y_train)
l2_pipeline.fit(X_train, y_train)

l1_test_pred = l1_pipeline.predict(X_test)
l2_test_pred = l2_pipeline.predict(X_test)

l1_test_f1 = f1_score(y_test, l1_test_pred)

l2_test_f1 = f1_score(y_test, l2_test_pred)

print(f"L1 Regularization Test F1 Score: {l1_test_f1}")

print(f"L2 Regularization Test F1 Score: {l2_test_f1}")

L1 Regularization Test F1 Score: 0.7578144853875477

L2 Regularization Test F1 Score: 0.7583926754832147

5. Varying λ (C in Logistic Regression)

NAME:PRATHAM

ROLL NO:23126039

results = []
C_values = [0.001, 0.01, 0.1, 1, 10, 100]

for C in C_values:
pipeline = Pipeline(steps=[
('preprocessor', preprocessor),
('classifier', LogisticRegression(solver='liblinear', C=C, random_state=42))
])
pipeline.fit(X_train, y_train)
y_pred = pipeline.predict(X_test)
f1 = f1_score(y_test, y_pred)
results.append({'C': C, 'Test F1 Score': f1})

results_df = pd.DataFrame(results)
print(results_df)

C Test F1 Score
0 0.001 0.714126
1 0.010 0.751487
2 0.100 0.757252
3 1.000 0.758393
4 10.000 0.757814
5 100.000 0.757814

6. Comparison with Inbuilt Model

# Inbuilt Logistic Regression

inbuilt_pipeline = Pipeline(steps=[
('preprocessor', preprocessor),
('classifier', LogisticRegression(random_state=42)) #uses lbfgs as default solver, and l2 as default penalty.
])

inbuilt_pipeline.fit(X_train, y_train)
inbuilt_test_pred = inbuilt_pipeline.predict(X_test)
inbuilt_test_f1 = f1_score(y_test, inbuilt_test_pred)
print(f"Inbuilt Logistic Regression Test F1 Score: {inbuilt_test_f1}")
#The deviation is likely due to the different default solver and regularization methods employed by the inbuilt model compar

Inbuilt Logistic Regression Test F1 Score: 0.7585856016280844

7. SVM Implementation and Hyperparameter Tuning

from sklearn.svm import SVC

from sklearn.pipeline import Pipeline # Import the Pipeline class

svm_pipeline = Pipeline(steps=[
('preprocessor', preprocessor),
('classifier', SVC(random_state=42))
])

svm_pipeline.fit(X_train, y_train)
svm_test_pred = svm_pipeline.predict(X_test)
svm_test_f1 = f1_score(y_test, svm_test_pred)

print(f"SVM Test F1 Score: {svm_test_f1}")

svm_results = []
C_values_svm = [0.1, 1, 10, 100]

for C in C_values_svm:
svm_pipeline_tuned = Pipeline(steps=[
('preprocessor', preprocessor),
('classifier', SVC(C=C, random_state=42))
])
svm_pipeline_tuned.fit(X_train, y_train)
y_pred_svm = svm_pipeline_tuned.predict(X_test)
f1_svm = f1_score(y_test, y_pred_svm)
svm_results.append({'C': C, 'Test F1 Score': f1_svm})

svm_results_df = pd.DataFrame(svm_results)
print(svm_results_df)

SVM Test F1 Score: 0.8013716697441309

C Test F1 Score
0 0.1 0.781457
1 1.0 0.801372
2 10.0 0.804227
3 100.0 0.786705

8. KNN Implementation

from sklearn.neighbors import KNeighborsClassifier

knn_pipeline = Pipeline(steps=[
('preprocessor', preprocessor),
('classifier', KNeighborsClassifier())
])

knn_pipeline.fit(X_train, y_train)
knn_test_pred = knn_pipeline.predict(X_test)
knn_test_f1 = f1_score(y_test, knn_test_pred)

print(f"KNN Test F1 Score: {knn_test_f1}")

KNN Test F1 Score: 0.7477572559366754

NAME:PRATHAM

ROLL NO:23126039

9. KNN Hyperparameter Tuning

from sklearn.neighbors import KNeighborsClassifier

from sklearn.metrics import f1_score
from sklearn.pipeline import Pipeline
from sklearn.preprocessing import StandardScaler, OneHotEncoder
from sklearn.compose import ColumnTransformer
import pandas as pd

# Assuming X_train, X_test, y_train, y_test, and preprocessor are already defined from previous steps

knn_results = []
neighbors = [3, 5, 7, 9]
distance_metrics = ['euclidean', 'manhattan', 'minkowski']

for n in neighbors:
for metric in distance_metrics:
knn_pipeline_tuned = Pipeline(steps=[
('preprocessor', preprocessor),
('classifier', KNeighborsClassifier(n_neighbors=n, metric=metric))
])
knn_pipeline_tuned.fit(X_train, y_train)
y_pred_knn = knn_pipeline_tuned.predict(X_test)
f1_knn = f1_score(y_test, y_pred_knn)
knn_results.append({'Neighbors': n, 'Distance Metric': metric, 'Test F1 Score': f1_knn})

knn_results_df = pd.DataFrame(knn_results)
print(knn_results_df)

# 10. Conclusion

# Compare the performance of Logistic Regression, SVM, and KNN

logistic_pipeline.fit(X_train, y_train)
logistic_test_pred = logistic_pipeline.predict(X_test)
logistic_test_f1 = f1_score(y_test, logistic_test_pred)

svm_pipeline.fit(X_train, y_train)
svm_test_pred = svm_pipeline.predict(X_test)
svm_test_f1 = f1_score(y_test, svm_test_pred)

knn_pipeline.fit(X_train, y_train)
knn_test_pred = knn_pipeline.predict(X_test)
knn_test_f1 = f1_score(y_test, knn_test_pred)

print(f"Logistic Regression Test F1 Score: {logistic_test_f1}")

print(f"SVM Test F1 Score: {svm_test_f1}")
print(f"KNN Test F1 Score: {knn_test_f1}")

# Conclusion:
# Based on the F1 scores, we can compare the performance of the three models:
# - Logistic Regression: [logistic_test_f1 value]
# - Support Vector Machine (SVM): [svm_test_f1 value]
# - K-Nearest Neighbors (KNN): [knn_test_f1 value]

# Based on the results obtained, the best performing model for this dataset is usually SVM or Logistic regression. The KNN p

#Generally, SVM provided the highest F1 score in most of the cases. Logistic regression also provided good scores, and is mu

Neighbors Distance Metric Test F1 Score

0 3 euclidean 0.742546
1 3 manhattan 0.743081
2 3 minkowski 0.742546
3 5 euclidean 0.747757
4 5 manhattan 0.756285
5 5 minkowski 0.747757
6 7 euclidean 0.759500
7 7 manhattan 0.760986
8 7 minkowski 0.759500
9 9 euclidean 0.760481
10 9 manhattan 0.764263
11 9 minkowski 0.760481
Logistic Regression Test F1 Score: 0.7583926754832147
SVM Test F1 Score: 0.8013716697441309
KNN Test F1 Score: 0.7477572559366754

The SVM (Support Vector Machine) model has the highest F1 score (0.8013716697441309), making it the best-performing model among the
three.

Untitled
No ratings yet
Untitled
1,326 pages
Universities Review Details - July2009
No ratings yet
Universities Review Details - July2009
311 pages
HUMAN-CENTRIC AI WITH COMMON SENSE BY F. ILIEVSKI (2024)
No ratings yet
HUMAN-CENTRIC AI WITH COMMON SENSE BY F. ILIEVSKI (2024)
146 pages
1. DADV_Lab_Subject_303105315
No ratings yet
1. DADV_Lab_Subject_303105315
35 pages
Supervised Learning With Scikit-Learn: Preprocessing Data
No ratings yet
Supervised Learning With Scikit-Learn: Preprocessing Data
32 pages
Note 4
No ratings yet
Note 4
18 pages
Bank Loan
No ratings yet
Bank Loan
85 pages
AIML Lab ex 3-5_1
No ratings yet
AIML Lab ex 3-5_1
31 pages
Statistical Methods I: Stor 455
No ratings yet
Statistical Methods I: Stor 455
35 pages
Dsa Lab Manual
No ratings yet
Dsa Lab Manual
35 pages
1723524625270_Data_Frame_Notes3
No ratings yet
1723524625270_Data_Frame_Notes3
39 pages
Linear Models Reading
No ratings yet
Linear Models Reading
26 pages
Portfolio 1,2,3
No ratings yet
Portfolio 1,2,3
33 pages
'Universalbank - CSV': #Reading The File
No ratings yet
'Universalbank - CSV': #Reading The File
4 pages
OpenLab2
No ratings yet
OpenLab2
15 pages
Code PLFS MVPA
No ratings yet
Code PLFS MVPA
12 pages
R Programing 6 Feb
No ratings yet
R Programing 6 Feb
10 pages
Week 12 Assignment
No ratings yet
Week 12 Assignment
8 pages
DS - Assig-03-Part-I - Jupyter Notebook
No ratings yet
DS - Assig-03-Part-I - Jupyter Notebook
8 pages
ML Practice Assignment
No ratings yet
ML Practice Assignment
7 pages
#Group: B (ML) : Numpy NP Pandas PD
No ratings yet
#Group: B (ML) : Numpy NP Pandas PD
9 pages
Data Science Record_05
No ratings yet
Data Science Record_05
20 pages
lab-02
No ratings yet
lab-02
12 pages
Aayushi ML File
No ratings yet
Aayushi ML File
37 pages
Intro LOGIT
No ratings yet
Intro LOGIT
46 pages
Assignment 3
No ratings yet
Assignment 3
7 pages
ML LAB - V SEM - BCA
No ratings yet
ML LAB - V SEM - BCA
22 pages
Model2.ipynb - Colab
No ratings yet
Model2.ipynb - Colab
11 pages
Cultivation Theory Daniel Chandler
No ratings yet
Cultivation Theory Daniel Chandler
6 pages
Practical 3
No ratings yet
Practical 3
8 pages
CALCULATION
No ratings yet
CALCULATION
15 pages
DSBDA 3A
No ratings yet
DSBDA 3A
11 pages
data_analytucs_1[1]
No ratings yet
data_analytucs_1[1]
5 pages
Ukzn at A Glance 2016
No ratings yet
Ukzn at A Glance 2016
40 pages
Student Notebook HR Analysis
No ratings yet
Student Notebook HR Analysis
11 pages
Assignment 03
No ratings yet
Assignment 03
6 pages
Stroke Prediction
No ratings yet
Stroke Prediction
10 pages
Vertopal.com AML Project LearnerNotebook LowCode
No ratings yet
Vertopal.com AML Project LearnerNotebook LowCode
74 pages
Classification Problems
100% (1)
Classification Problems
25 pages
Predicting Credit Risk 1713295035
No ratings yet
Predicting Credit Risk 1713295035
19 pages
Student - Linear Regression Example - Colaboratory
No ratings yet
Student - Linear Regression Example - Colaboratory
6 pages
Credit Card Default
No ratings yet
Credit Card Default
5 pages
Preprocessing1.ipynb - Colab
No ratings yet
Preprocessing1.ipynb - Colab
13 pages
Clustering Documentation Python Code
No ratings yet
Clustering Documentation Python Code
8 pages
vertopal.com_MSML_Project_1
No ratings yet
vertopal.com_MSML_Project_1
8 pages
ML Assignment
No ratings yet
ML Assignment
3 pages
DSBDA Assignment 4 Jupyter Notebook
No ratings yet
DSBDA Assignment 4 Jupyter Notebook
5 pages
vertopal.com_Jamboree
No ratings yet
vertopal.com_Jamboree
10 pages
Get (eBook PDF) Interventions with Children and Youth in Canada 2nd Edition by Maureen Cech free all chapters
100% (6)
Get (eBook PDF) Interventions with Children and Youth in Canada 2nd Edition by Maureen Cech free all chapters
55 pages
Openlab1
No ratings yet
Openlab1
17 pages
ML LAB - BCSL606
No ratings yet
ML LAB - BCSL606
67 pages
FYMCA IDSLab A6 Submission
No ratings yet
FYMCA IDSLab A6 Submission
9 pages
TYCS Practical
No ratings yet
TYCS Practical
26 pages
Syllabus (Synthesis and Internship)
No ratings yet
Syllabus (Synthesis and Internship)
3 pages
Political Science Course Outcomes
No ratings yet
Political Science Course Outcomes
30 pages
Project paarth (1) (1)
No ratings yet
Project paarth (1) (1)
21 pages
Biglang Liko Literary Approach Criticism
100% (1)
Biglang Liko Literary Approach Criticism
4 pages
Chapter 5 - Classification Problems
100% (1)
Chapter 5 - Classification Problems
25 pages
howxtre
No ratings yet
howxtre
8 pages
G 203008076 - 4 - Christhian Quiñonez - Ex1 - 2 A PDF
No ratings yet
G 203008076 - 4 - Christhian Quiñonez - Ex1 - 2 A PDF
20 pages
DataAnalytics Lab Manual (1)
No ratings yet
DataAnalytics Lab Manual (1)
35 pages
vertopal.com_Mlt_ann_lab_2_
No ratings yet
vertopal.com_Mlt_ann_lab_2_
7 pages
Aosdijfpqoiew
No ratings yet
Aosdijfpqoiew
6 pages
Level 1 Assessment Tracker
No ratings yet
Level 1 Assessment Tracker
9 pages
Last Thesis Ethics Applicatation Form PDF
No ratings yet
Last Thesis Ethics Applicatation Form PDF
9 pages
01 Syllabus - Principles of Auditing - 2021
No ratings yet
01 Syllabus - Principles of Auditing - 2021
5 pages
English Test 7 Grade Unit 1: "Feelings and Opinions": Total Score 52 Points
No ratings yet
English Test 7 Grade Unit 1: "Feelings and Opinions": Total Score 52 Points
3 pages
Apex Financial Services Loan Data Automation
No ratings yet
Apex Financial Services Loan Data Automation
18 pages
ISA D14 TTT CyberSecurity
100% (1)
ISA D14 TTT CyberSecurity
12 pages
Download Behavioral Medicine: A Guide for Clinical Practice 5th Edition Mitchell D. Feldman ebook All Chapters PDF
100% (1)
Download Behavioral Medicine: A Guide for Clinical Practice 5th Edition Mitchell D. Feldman ebook All Chapters PDF
51 pages
Software Engineering Module 2
No ratings yet
Software Engineering Module 2
35 pages
An-Analysis-of-The-Glove-and-The-King-by-Robert-Browning
No ratings yet
An-Analysis-of-The-Glove-and-The-King-by-Robert-Browning
9 pages
Pud 1 3RD Egb
100% (1)
Pud 1 3RD Egb
4 pages
Mid-Sem Model Answer 7
No ratings yet
Mid-Sem Model Answer 7
5 pages
SVC Datasheet
No ratings yet
SVC Datasheet
8 pages
dsbda_5
No ratings yet
dsbda_5
4 pages
Fundamentals of Nursing: Intuitive Nursing/ Primitive Nursing/ Instinctive Nursing
No ratings yet
Fundamentals of Nursing: Intuitive Nursing/ Primitive Nursing/ Instinctive Nursing
21 pages
Learning To Read Flow Chart
No ratings yet
Learning To Read Flow Chart
1 page
sources of innovation-bk summary
No ratings yet
sources of innovation-bk summary
6 pages
CHAPTER-1-3-G4PR2
No ratings yet
CHAPTER-1-3-G4PR2
11 pages
Student Fee Details
No ratings yet
Student Fee Details
1 page
English: Quarter 1 - Module 1: Analogy
67% (3)
English: Quarter 1 - Module 1: Analogy
24 pages
Language Maintenance and Shift
No ratings yet
Language Maintenance and Shift
3 pages
Angles Geometry (All Content) Math Khan Academy
No ratings yet
Angles Geometry (All Content) Math Khan Academy
1 page
Resume - Design Engineer - Rahul
No ratings yet
Resume - Design Engineer - Rahul
2 pages
Preliminaries To Motivation
No ratings yet
Preliminaries To Motivation
3 pages
The Influence of Peer Pressure To The School Behavior of Senior Highschool Students of Colegio de San Jose Del Monte
No ratings yet
The Influence of Peer Pressure To The School Behavior of Senior Highschool Students of Colegio de San Jose Del Monte
15 pages
Building a Focus Timer Web App with Vanilla HTML, CSS, and JavaScript.: A Practical Q&A Guide Using a Focus Timer
From Everand
Building a Focus Timer Web App with Vanilla HTML, CSS, and JavaScript.: A Practical Q&A Guide Using a Focus Timer
Lumavalle Press
No ratings yet
Building a GPA Calculator Web App with Vanilla HTML, CSS, and JavaScript.: A Practical Q&A Guide Using a GPA Calculator
From Everand
Building a GPA Calculator Web App with Vanilla HTML, CSS, and JavaScript.: A Practical Q&A Guide Using a GPA Calculator
Lumavalle Press
No ratings yet
Control of DC Motor Using Different Control Strategies
From Everand
Control of DC Motor Using Different Control Strategies
Dr. Hidaia Mahmood Alassouli
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

1) Download the binary classification dataset for... - Colab

Uploaded by

1) Download the binary classification dataset for... - Colab

Uploaded by

NAME:PRATHAM

Generalization: it includes the concept of model generalization.

from google.colab import drive

# Load the dataset

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

print("Train set shape:", X_train.shape, y_train.shape)

loan_int_rate loan_percent_income cb_person_cred_hist_length \

3. Logistic Regression Model Development

from sklearn.linear_model import LogisticRegression

# Logistic Regression Pipeline

# Train the model

Train F1 Score: 0.7642629227823867

# Regularized Logistic Regression (L1 and L2)

l1_test_f1 = f1_score(y_test, l1_test_pred)

print(f"L1 Regularization Test F1 Score: {l1_test_f1}")

L1 Regularization Test F1 Score: 0.7578144853875477

5. Varying λ (C in Logistic Regression)

6. Comparison with Inbuilt Model

# Inbuilt Logistic Regression

Inbuilt Logistic Regression Test F1 Score: 0.7585856016280844

7. SVM Implementation and Hyperparameter Tuning

from sklearn.svm import SVC

print(f"SVM Test F1 Score: {svm_test_f1}")

SVM Test F1 Score: 0.8013716697441309

from sklearn.neighbors import KNeighborsClassifier

print(f"KNN Test F1 Score: {knn_test_f1}")

KNN Test F1 Score: 0.7477572559366754

9. KNN Hyperparameter Tuning

from sklearn.neighbors import KNeighborsClassifier

# Compare the performance of Logistic Regression, SVM, and KNN

print(f"Logistic Regression Test F1 Score: {logistic_test_f1}")

Neighbors Distance Metric Test F1 Score

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.