JETIR2008396
JETIR2008396
org (ISSN-2349-5162)
Abstract : In this modern days, there are many changes occur in our daily life. It will mainly impacting on health system. As a result of this
various changes, health diseases are rapidly increasing in our day to day life. Here is some of diseases are more affected in our life. Such as
cardiovascular Diseases, Stress Depression, Cancer and many more diseases are present in our today’s life. Mainly, cardiovascular disease
is more commonly affected in our life. It will affect in any age group persons. The main cause of this cardiovascular disease is changes in
the Blood Pressure, Cholesterol, increasing Heartbeat etc.. It may lead to risk for life and death also. Coronary Heart disease is caused by
fatty plaque deposits on narrowed arteries walls supply to the heart and it will reduce the flowing of blood in heart. The main aim of this
project is to predict the heart disease with machine learning algorithms and diagnose in early stages. In this research, we are implementing
different machine learning algorithms with UCI dataset to find the best accuracy in different algorithms. Then i got best accuracy in
Artificial Neural Network. So ANN classification algorithm is used to know the possibilities of getting heart disease and diagnose in initial
stage.
Index Terms - Support Vector Machine, Decision Tree, Logistic Regression, Random Forest, KNN, Artificial Neural Network,
Machine Learning, Keras, Tensor Flow, Cardiovascular Disease Prediction.
I. INTRODUCTION
Nowadays, there are so many diseases affected in our life. Mainly Cardiovascular heart disease is most commonly affected in our
human life. And these are the most common types of cardiovascular diseases is cardiac arrest, hypertension, Heart strokes, coronary Heart
disease and many more. Coronary heart disease can be identified with different medical test and diagnosed it. But medical tests are taking
more time to detect and quite difficult. Sometimes it leads to mortality. According to World Health Organization, cardiovascular disease is
main cause of deaths in worldwide. Around 17.9 million people are mortality due to cardiovascular disease in the year 2016. Out of these
85% deaths occurs due to heart stroke and heart failure [1] and in India also around 1.7 million peoples are died due to Heart disease [2].
The rate of mortality are rapidly increased due to changes of human behavior habits like smoking, stress depression and drinking alcohol
[4] in day to day life. If we predict the heart disease in early stage, lot of patients can take proper treatment and prevention from these
diseases without taking any risk to the life. And it will automatically decrease the rate of mortality and increase the recovery rate. So
mainly prediction of heart disease are quite difficult in early stages and it plays crucial role to predict. Heart is the most important organ in
the human body.
In this days, it is important to develop the medical diagnosed system to identify the possibilities of getting heart disease in a short
time. So that, we can prevention the heart disease on early stage, and we can decrease the risk from heart disease and death. When we
talking about diagnose system, the machine learning(ML) algorithm are most popular technique to predict the risk of heart disease with
maximum accuracy in less time[3]. In machine learning algorithms, artificial neural network is the one of the best machine learning
technique which is used to predict the possibilities of heart disease with best accuracy and efficient results in a short period. In this
proposed system, we are using different algorithms has been implemented to predict the risk of heart disease. In this research, we are used
UCI dataset to implement the prediction of heart disease by using different machine learning techniques to get best accuracy. In the further
studies we are discussed about II. Literature survey III. Proposed system IV. Data source V. Methodology VI. Result VII. Conclusion and
References.
JETIR2008396 Journal of Emerging Technologies and Innovative Research (JETIR) www.jetir.org 683
© 2020 JETIR August 2020, Volume 7, Issue 8 www.jetir.org (ISSN-2349-5162)
In the existing system, some of disadvantages are obtained in machine learning techniques. Many researchers are used decision
tree technique. There are some drawbacks in this technique are sometimes it takes so much time to train and calculation the model
compared with other machine learning techniques. Some researchers are used SVM technique but here also some drawback are svm
doesn’t work on large dataset very well and sometimes it overlaps the classes. In knn technique it doesn’t learn the any data before training
the model and continues to training the model. Some researchers are uses large data with less attributes to predict the disease.
1. Random Forest
2. K-nearest Neighbor
3. Support Vector Machine
4. Logistic Regression
5. Decision Tree
6. Artificial Neural Network
1. Age Integer
2. Sex Integer
3. Chest Pain Integer
4. Blood Pressure Integer
5. Cholesterol Integer
6. Fbs(sugar) Integer
7. ECG(Electrocardiographic) Integer
8. Thalach (Maximum heart Rate) Integer
9. ST Depression(Oldpeak) Float
10. Slope Integer
11. Exercise angina(exang) Integer
12. Ca(Number of vessels) Integer
13. Thalassemia Integer
14. Target Integer
JETIR2008396 Journal of Emerging Technologies and Innovative Research (JETIR) www.jetir.org 684
© 2020 JETIR August 2020, Volume 7, Issue 8 www.jetir.org (ISSN-2349-5162)
V. METHODOLOGY
The main aim of this project is to diagnose the cardiovascular disease with the help of machine learning algorithms and it will
be the fastest way to predict and cure the disease at right time. We are showing different machine techniques to diagnose the heart disease
with some selected parameters are tested. In this methodology we are structured in different stages to predict [Fig.3].
In this proposed system, we are used three main various tools and different machine learning algorithms.
1) Tools:
Tensor Flow:It is the most popular machine learning library in the globe. Google artificial intelligence organisation and brain team
are developed tensor flow library. It is used to speed up the machine learning and AI system. It can use in different language like
python, java[12]
Keras: It is an application programming interface designed for user. It is frontend library for Tensor flow. Keras and tensor flow is
jointly works as one tool.[12]
Jupyter Notebook: It is open source application are available in anaconda. It is easy to create any python code and data visualization.
In anaconda, there are nearly 1400 packages are already available in environment in the repository.
2) Algorithms:
2.1. Random Forest:
Random forest is a one of the best algorithm in machine learning and it is member of Supervised Learning. Random forest runs same as
Essemble learning. It is nothing different that more decision trees are collected in one forest like that only it will solves the problem with
best accuracy of these model. In this algorithm, a tree can splits into nodes and it predicts the each node are available in tree.
The main advantage of random forest algorithms are it is very easy to solve the algorithm with good accuracy and it can able to
handle the large number of datasets. It is proficient in both the classification and regression.
JETIR2008396 Journal of Emerging Technologies and Innovative Research (JETIR) www.jetir.org 685
© 2020 JETIR August 2020, Volume 7, Issue 8 www.jetir.org (ISSN-2349-5162)
2.2. Decision Tree:
Decision tree is another type of machine learning algorithm and it is also comes under supervised learning. It uses the structured
type of tree to solve the predictive model. In which each tree having parent and child nodes to represents a structured tree.
The main advantage of Decision Tree algorithms are it takes less time and less attempt for pre-processing the data in a model
comparing with other algorithms in a machine learning. It is very attractive and easy to explain the model to clients.
In machine learning, the classification report is used to calculate the standard of predicted values. With this report we can get
four types of results as precision, f1 score, recall, support values (Table.2). With this results we can calculate the f-measures, specificity,
and sensitivity. So that we may estimate the predicted result is true or false.
VI. RESULT :
In this study, we are showing the Classification report of different Machine Learning algorithms using UCI dataset.
A) Classification Report:
Classifications
Algorithms Precision F-Measure Sensitivity Specificity
B ) ACCURACY REPORT:
In below, we are showing accuracy report of different ML algorithms using UCI dataset.
1) Accuracy = (true negative + true positive) / (true negative + true positive + false negative + false positive)
Finally, In this Research Paper, We got highest accuracy in Artificial Neural Network with 96.00% accuracy using dataset [fig.
5]. In UCI dataset classification report, the Precision, F-Measure, Sensitivity, Specificity are 94.73%, 97.23%, 100% and 85.71%
respectively[Table.2] with Artificial Neural Network algorithm. And second highest accuracy got in Logistic Regression with 88.52%
accuracy with dataset. So in this research ANN is more accurate to prediction the Heart disease in early stages.
VII. CONCULSION :
Mainly In this paper, we are implemented six Machine learning techniques are SVM, KNN, Random Forest, Logistic
Regression, Decision Tree, ANN algorithms to find the risk of heart disease prediction with best accuracy. In this research, we are used
nearly 303 patients record from UCI dataset. We used the dataset and trained with all the ML algorithms separately. After that, we are
achieved best accuracy and result with artificial neural network algorithm in UCI dataset are 96.00% and followed by logistic regression
are 88.52% respectively. In this proposed system, we got artificial neural network is better method than the remaining algorithms. Finally,
JETIR2008396 Journal of Emerging Technologies and Innovative Research (JETIR) www.jetir.org 687
© 2020 JETIR August 2020, Volume 7, Issue 8 www.jetir.org (ISSN-2349-5162)
the artificial neural network is best technique to find the possibilities of cardiovascular disease by using different parameters in early stages.
In Future study, researcher can use some advance deep learning algorithms like Conventional neural network, Long short-term memory,
Generative Adversarial Network, RNN etc.,. To identify the heart disease with maximum accuracy using more number of parameters.
VIII. REFERENCE :
1. “Cardiovascular diseases”, World Health Organization, 2020. [Online].
Available:https://www.who.int/en/news-room/fact-sheets/detail/cardiovascular-diseases (cvds) [accessed: 15-june-2020].
2. “Heart disease in India “Global Burden of disease, 2017. [Online]. Available: http://www.healthdata.org/india [accessed:
15-june-2020]
3. S. Singh, S.Bharti, “Analytical study of heart disease prediction comparing with different algorithms”, International Conference on
Computing, Communication & Automation, 2015, IEEE
4. T. Karaylan and. Kl, “Prediction of heart disease using neural network”, International Conference on Computer Science and
Engineering (UBMK), 2017, IEEE.
5. M. Raihan, S. Mondal, A. More, P. Boni, “Smartphone Based Heart Attack Risk Prediction System with Statistical Analysis and Data
Mining Approaches”, Advances in Science,Technology and Engineering Systems Jaurnal, 2017.
6. Parichay Kumar Mandal, Muhammad Muinul Islam, Tanvir Hossain, Promila Ghosh, Shekel Ahmed Shaj, Abdullah Anik, Mubtasim
Rafid Chowdhury, M. Raihan, Saikat Mondal and Arun More “Risk Prediction of Ischemic Heart Disease Using Artificial Neural
Network” Internatianal Conference on ECCE, 2019, IEEE.
7. R. Thanigaivel, and K. Ramesh Kumar. "Boosted Apriori: an Effective Data Mining Association Rules for Heart Disease Prediction
System." Middle-Eost Journal of Scientific Research, 2016.
8. Monika Gandhi, Shailendra Narayanan Singh, “Predictions in heart disease using techniques of data mining”, International
Conference on futuristic trends on computational analysis and knowladge management, 2015, IEEE.
9. M. Marimuthu, S.Deivarani, Gayathri.R,”Analysis of Heart Disease prediction using various Machine LearningTechniques”,
International Conference on Artificial Intelligence, Smart Grid and Smart City Applications, 2019, IEEE.
10. Shashikant Ghumbre, Chetan Patil, Ashok Ghatol, “Heart Disease Diagnosis using Suport Vector Machine”, Internatianal C0nference
on Information Technology, 2011, ICCIT.
11. Resul Das, Ibrahim Turkoglu, and Abdulkadir Sengur, ``Effective diagnosis of heart disease through neural networks ensembles'',
Expert System with Applications: An Internati0nal Journal, 2009.
12. “Tensor Flow”[Online].Available: https://www.tensorflow.org/ [accessed: 15- June- 2020].
13. “UCI dataset” [Online]. Available: https://archive.ics.uci.edu/ml/datasets/heart+Disease/ [accesed: 15-febuary-2020].
JETIR2008396 Journal of Emerging Technologies and Innovative Research (JETIR) www.jetir.org 688