0% found this document useful (0 votes)

6 views5 pages

PeerEval Classification

Uploaded by

rest peace

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views5 pages

PeerEval Classification

Uploaded by

rest peace

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

IBM- Supervised ML-Classification-PeerEval

Main Objective, Brief Description about dataset and its attributes

For this project, I am using the Heart Disease UCI dataset from Kaggle. Many factors influences the
development of Heart Diseases in a Patient. With 14 explanatory variables describing aspects of Patients. The
OBJECTIVE of this report is to build a machine learning model capable of predicting whether or not
someone has heart disease based on their medical attributes.. Since the target variable is a categorical
variable, this is a classification problem. The data contains the following columns:

• age
• sex
• chest pain type (4 values)
• resting blood pressure
• serum cholestoral in mg/dl
• fasting blood sugar > 120 mg/dl
• resting electrocardiographic results (values 0,1,2)
• maximum heart rate achieved
• exercise induced angina
• oldpeak = ST depression induced by exercise relative to rest
• the slope of the peak exercise ST segment
• number of major vessels (0-3) colored by flourosopy
• thal: 3 = normal; 6 = fixed defect; 7 = reversable defect

Figure 1: First 5 rows of the DataFrame

Data Exploration, Data Cleaning, & Feature Engineering
The data was checked for Duplicated, Null, Shape and Data types.

Figure 2: Checking for Null,Datatypes,Duplicates

Figure 3: Distribution of Dataset

Figure 4 : Visualization of data for more in depth analysis

Figure 5: Correlation Table

Figure 6: Feature importance to determine to figure out important variables

Since the data is clean there is no need for any data cleaning process. Since there is no overwhelming influence
from a variable there is no scaling applied to the data. The data is then split into train and test set using the
train_test_split function in Sci-Kit learn Library. Next step is to apply Classification Algorithm on the training
test and calculate the accuracy of prediction.
Classification Algorithms

Three models are used in this assignment namely:

• Support Vector Machines (SVM)

Support vector machines (SVMs) are a set of supervised learning methods used for classification,
regression and outliers detection.The advantages of support vector machines include Being Effective
in high dimensional spaces, Still effective in cases where number of dimensions is greater than the
number of samples, Uses a subset of training points in the decision function (called support vectors),
so it is also memory efficient, Different Kernel functions can be specified for the decision function.
Common kernels are provided, but it is also possible to specify custom kernels.
• K-Nearest Neigbours
k-NN is a type of classification where the function is only approximated locally and all computation is
deferred until function evaluation. Since this algorithm relies on distance for classification, if the
features represent different physical units or come in vastly different scales then normalizing the
training data can improve its accuracy dramatically. A peculiarity of the k-NN algorithm is that it is
sensitive to the local structure of the data.
• Random Forest
Random forest is a flexible, easy to use machine learning algorithm that produces, even without hyper-
parameter tuning, a great result most of the time. It is also one of the most used algorithms, because
of its simplicity and diversity. Random forest is a supervised learning algorithm. The "forest" it builds,
is an ensemble of decision trees, usually trained with the “bagging” method. The general idea of the
bagging method is that a combination of learning models increases the overall result.

Key Findings

From the above tables we can see that random Forest is

the best model that fits our data as it has the highest
accuracy value. Thus, as our objective is to best predict
the state of heart disease, we make use of the third model
i.e. the Random Forest classifier model.

From this Random Forest classifier model, we find out

that to make predictions about the condition of the heart
disease, we input the given 14 variables. Also from the
coefficients, we observe that cheat pain type has a high
coefficient value meaning that chest pain type plays a very
important role in explaining the patient suffers
from a heart condition or not.

Figure 7: Accuracy scores for 3 Classifiers

Proposed Future Work

• Hyperparameter Tuning
• Cross-Validation
• Classification report
• Try different encoders

Printers Presentation
100% (1)
Printers Presentation
17 pages
Final PPT Heart Disease
67% (3)
Final PPT Heart Disease
23 pages
03 Supervised - Machine.learning - Classification
No ratings yet
03 Supervised - Machine.learning - Classification
45 pages
Heart Disease Prediction Final
67% (3)
Heart Disease Prediction Final
45 pages
ML Cep FAisal
No ratings yet
ML Cep FAisal
18 pages
Project Proposal
No ratings yet
Project Proposal
23 pages
Final 1
No ratings yet
Final 1
36 pages
Mini Research
No ratings yet
Mini Research
4 pages
HEART
No ratings yet
HEART
15 pages
Thesis Presentation
No ratings yet
Thesis Presentation
22 pages
Cse437 4
No ratings yet
Cse437 4
14 pages
Thesis On Comparison of Machine Learning Techniques To Predict Cardiovascular Disease
No ratings yet
Thesis On Comparison of Machine Learning Techniques To Predict Cardiovascular Disease
52 pages
BCSP241006 BCS221016 BCS221023 Report
No ratings yet
BCSP241006 BCS221016 BCS221023 Report
38 pages
Wah Industry Limited. Internship Report
100% (4)
Wah Industry Limited. Internship Report
52 pages
SUMMARY
No ratings yet
SUMMARY
16 pages
Heart Disease Prediction
No ratings yet
Heart Disease Prediction
16 pages
Heart Disease Prediction
No ratings yet
Heart Disease Prediction
53 pages
03-Supervised Machine Learning Classification
No ratings yet
03-Supervised Machine Learning Classification
33 pages
Heart Disease Prediction Using Machine Learning Techniques: Abstract
No ratings yet
Heart Disease Prediction Using Machine Learning Techniques: Abstract
5 pages
Title: Heart Disease Prediction Using Different Machine Learning Algorithm
No ratings yet
Title: Heart Disease Prediction Using Different Machine Learning Algorithm
7 pages
Draft Xai
No ratings yet
Draft Xai
16 pages
Heart Disease Prediction Using Machine Learning
No ratings yet
Heart Disease Prediction Using Machine Learning
11 pages
Paper - Heart Disease Prediction
No ratings yet
Paper - Heart Disease Prediction
5 pages
Predicting The Presence of Heart Diseases Using Comparative Data Mining and Machine Learning Algorithms
No ratings yet
Predicting The Presence of Heart Diseases Using Comparative Data Mining and Machine Learning Algorithms
5 pages
(IJCST-V9I3P22) : Yogesh Gedam, Shivraju Bomble, Uma Kurwade, Bhavana Parchake, Hemant Uike
No ratings yet
(IJCST-V9I3P22) : Yogesh Gedam, Shivraju Bomble, Uma Kurwade, Bhavana Parchake, Hemant Uike
4 pages
IEEE Conference Team ATOM
No ratings yet
IEEE Conference Team ATOM
5 pages
Comparative Study For Classification
No ratings yet
Comparative Study For Classification
6 pages
I. Bstract Iii. ATA ET: Heart Disease Prediction Using Weka Tools On Machine Learning Anshu Garg, Jasleen Kaur
No ratings yet
I. Bstract Iii. ATA ET: Heart Disease Prediction Using Weka Tools On Machine Learning Anshu Garg, Jasleen Kaur
9 pages
Ai Finalreport b2
No ratings yet
Ai Finalreport b2
11 pages
Heart Disease Detection Using Machine Learning
No ratings yet
Heart Disease Detection Using Machine Learning
12 pages
Heart Disease Prediction
No ratings yet
Heart Disease Prediction
8 pages
Research Paper - IT - Group No 8
No ratings yet
Research Paper - IT - Group No 8
10 pages
IJMLC DivyanshKhanna RohanSahu
No ratings yet
IJMLC DivyanshKhanna RohanSahu
7 pages
IEEE Paper Format Template
No ratings yet
IEEE Paper Format Template
3 pages
Garg 2021 IOP Conf. Ser. Mater. Sci. Eng. 1022 012046
No ratings yet
Garg 2021 IOP Conf. Ser. Mater. Sci. Eng. 1022 012046
10 pages
Literature Survey
No ratings yet
Literature Survey
11 pages
HussainBadshah SafwanSheikh
No ratings yet
HussainBadshah SafwanSheikh
12 pages
Heart Disease
No ratings yet
Heart Disease
14 pages
Prediction of Heart Diseases Using Machine Learning
No ratings yet
Prediction of Heart Diseases Using Machine Learning
49 pages
INTRODUCTION
No ratings yet
INTRODUCTION
14 pages
Machine Learning
No ratings yet
Machine Learning
30 pages
Article Eda
No ratings yet
Article Eda
7 pages
New Microsoft PowerPoint Presentation (Recovered)
No ratings yet
New Microsoft PowerPoint Presentation (Recovered)
23 pages
National Manual For TB Control 2022update
No ratings yet
National Manual For TB Control 2022update
246 pages
Heart Disease Prediction
No ratings yet
Heart Disease Prediction
27 pages
AB Report Group 2
No ratings yet
AB Report Group 2
14 pages
Synopsis (Heart Disease Prediction)
No ratings yet
Synopsis (Heart Disease Prediction)
7 pages
SMD Marking Code
No ratings yet
SMD Marking Code
14 pages
Project Report
No ratings yet
Project Report
18 pages
Heart Disease
No ratings yet
Heart Disease
13 pages
Heart Disease Prediction Using Machine Learning IJERTV9IS040614
No ratings yet
Heart Disease Prediction Using Machine Learning IJERTV9IS040614
4 pages
FP Report - Group 2
No ratings yet
FP Report - Group 2
4 pages
Previews 1928680 Pre
No ratings yet
Previews 1928680 Pre
7 pages
IEEE Paper Format Template
No ratings yet
IEEE Paper Format Template
4 pages
Heart Failure Prediction Using Hybrid Method
No ratings yet
Heart Failure Prediction Using Hybrid Method
8 pages
Heart Disease Prediction
No ratings yet
Heart Disease Prediction
17 pages
Final Research Paper
No ratings yet
Final Research Paper
3 pages
Final Year Project
No ratings yet
Final Year Project
57 pages
3-Structure Analysis - Trusses
No ratings yet
3-Structure Analysis - Trusses
58 pages
Heart Disease Detection Using Machine Learning: Chithambaram T Logesh Kannan N Gowsalya M (Gowsalya.m@vit - Ac.in)
No ratings yet
Heart Disease Detection Using Machine Learning: Chithambaram T Logesh Kannan N Gowsalya M (Gowsalya.m@vit - Ac.in)
5 pages
Heart Disease Python Report 1st Phase
No ratings yet
Heart Disease Python Report 1st Phase
33 pages
Heart Disease Prediction
No ratings yet
Heart Disease Prediction
9 pages
Ch07 CMOS Amplifiers Ch09 Cascode Stages and Current Mirrors 2024 V3print
No ratings yet
Ch07 CMOS Amplifiers Ch09 Cascode Stages and Current Mirrors 2024 V3print
133 pages
Mumbai Pune Expressway
100% (3)
Mumbai Pune Expressway
12 pages
Heart Disease Prediction With Machine Learning Approaches
No ratings yet
Heart Disease Prediction With Machine Learning Approaches
5 pages
363 HHD 221
No ratings yet
363 HHD 221
102 pages
Heart Disease Prediction With Machine Learning Approaches
No ratings yet
Heart Disease Prediction With Machine Learning Approaches
6 pages
Die Basics 101: Intro To Stamping: Stamping (Metalworking) Stamping Pressing
No ratings yet
Die Basics 101: Intro To Stamping: Stamping (Metalworking) Stamping Pressing
30 pages
Physics - Classes IX-X - NC 2006 - Latest Revision June 2012
No ratings yet
Physics - Classes IX-X - NC 2006 - Latest Revision June 2012
72 pages
Titrimetric Methods of Analysis
No ratings yet
Titrimetric Methods of Analysis
82 pages
BZ3 Instruction (v1.0)
No ratings yet
BZ3 Instruction (v1.0)
23 pages
Hydrocarbon Solutions
No ratings yet
Hydrocarbon Solutions
26 pages
Body Control System: Section
No ratings yet
Body Control System: Section
49 pages
TOEFL Reading - Practice Exam - Revisión Del Intento (Página 1 de 5)
No ratings yet
TOEFL Reading - Practice Exam - Revisión Del Intento (Página 1 de 5)
5 pages
Management of Patients With Intestinal and Rectal Disorders
No ratings yet
Management of Patients With Intestinal and Rectal Disorders
54 pages
Design of A Material Handling Equipment: Belt Conveyor System For Crushed Limestone Using 3 Roll Idlers
No ratings yet
Design of A Material Handling Equipment: Belt Conveyor System For Crushed Limestone Using 3 Roll Idlers
10 pages
CHAP7
No ratings yet
CHAP7
24 pages
Ingles - Mackenzie 2024.01
No ratings yet
Ingles - Mackenzie 2024.01
4 pages
Distokia Pada Sapi
No ratings yet
Distokia Pada Sapi
3 pages
EAF DustTreatment ByNewProcess
No ratings yet
EAF DustTreatment ByNewProcess
11 pages
The Trinity - Lesson 4
100% (1)
The Trinity - Lesson 4
3 pages
Grade 5 - Week 13 - Science Questions
No ratings yet
Grade 5 - Week 13 - Science Questions
4 pages
IDCON CMS 102R Coupling Sure Flex TOC (With Watermark)
No ratings yet
IDCON CMS 102R Coupling Sure Flex TOC (With Watermark)
3 pages
Influencer Script - Nidhi Gupta - 15.03.23
No ratings yet
Influencer Script - Nidhi Gupta - 15.03.23
2 pages
Eco SMRT
No ratings yet
Eco SMRT
2 pages
High Current Linear Regulated Bench Power Supply
No ratings yet
High Current Linear Regulated Bench Power Supply
14 pages
Bai Tap Dat Cau Hoi
No ratings yet
Bai Tap Dat Cau Hoi
4 pages
Pa6 GF20 - RTP Company RTP Pa6 20 GF
No ratings yet
Pa6 GF20 - RTP Company RTP Pa6 20 GF
1 page
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
From Everand
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
Seaport AI Madhavan
No ratings yet
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

PeerEval Classification

Uploaded by

PeerEval Classification

Uploaded by

IBM- Supervised ML-Classification-PeerEval

Main Objective, Brief Description about dataset and its attributes

Figure 1: First 5 rows of the DataFrame

Figure 2: Checking for Null,Datatypes,Duplicates

Figure 3: Distribution of Dataset

Figure 4 : Visualization of data for more in depth analysis

Figure 6: Feature importance to determine to figure out important variables

Three models are used in this assignment namely:

• Support Vector Machines (SVM)

From the above tables we can see that random Forest is

From this Random Forest classifier model, we find out

Figure 7: Accuracy scores for 3 Classifiers

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.