Base Paper
Base Paper
Learning
Rohit Kumar, Prince Thakur, SPS Chauhan
School of Computing Science and Engineering, Galgotias University
Greater Noida, UP, India-201310
rohitraks583@gmail.com
princekumarth@gmail.com
sps.chauhan@galgotiasuniversity.edu.in
Abstract— The wide variety of computer-based technologies a robust tool for diagnosing diseases effectively.
within the healthcare industry has led to the gathering of
electronic data. Due to the massive number of information, consistent with our study general mathematical
medical professionals are faced with the challenge of accurately techniques, work experience and understanding of
diagnosing signs and figuring out diseases at an early stage. In medical professionals led to unpopular selections
medicine, misdiagnosis could be a major factor leading because
of poor treatment and diagnosing the disease when it’s serious. and errors when detecting disease-related risks.
However, supervised machine learning techniques have With the rapid proliferation of electronic health data,
demonstrated the potential to surpass conventional diagnostic medical doctors are facing the challenge of
procedures and assist medical professionals in diagnosing high-
risk diseases. Mostly people feel lazy to visit a hospital, and diagnosing disease accurately in advance. For this
concern a doctor for a minor complication. However, this small reason, advanced calculation methods such as
problem can pose significant medical risk. Since, online medical supervised ML algorithms are introduced to detect
advice is readily available. The system evaluates the symptoms
that person give as an input and gives the disease as an output. logical patterns and hidden information from data,
Naive Bayes Classifier is used in the system. Our system focuses which can be used to make critical decisions. As a
on accuracy, the more numbers of a symptoms furnished by the result, the burden on medical staff will be decreased,
person as a input the disorder prediction as a output will be
better. Work can enhance the health care industry to zenith and and the death rate of patients will be reduced and
give cure to world. survival rate will be enhanced.
.
Keywords— Supervised Machine Learning, Health care, Disease
Prediction, Naïve Bayes classifier II. PROPOSED SOLUTION
I. INTRODUCTION
As we all know Machine learning algorithms use
a range of developmental, mathematical, and
probable development techniques to learn from data Fig.1. Block Diagram
generated from past events, and use them in higher
cognitive process. The enhancement of AI has Classical process could be a procedure within which a
enabled computer systems to acknowledge, think patient should visit a doctor, receive various treatments and
and use intelligently as human beings. sML are at the end gets the results . This procedure will takes a lot
of time.
algorithms are considered to be utilized in a good
range of fields including network login recognition, To save time and money which is needed for the initial
customer acquisition behaviour detection, diagnostic process, this program suggests an automatic
production process improvement, Mastercard fraud diagnostic system based on user input. The program takes
input from the user and provides an inventory of possible
detection, and disease predictions. Many of those diseases. In Fig.1 above we defined our proposed model.
applications are built using the ML supervised
The purpose of this study was to test the proposed hypothesis
method. During this way, records unit with labels that ML algorithms are monitored to improve health care with
had been delivered to hypothesis fashions that accurate and rapid diagnosis. In this study, we investigate
expect non-label fashions. This implies the thought studies using more than one monitored ML model for each
that, Medical doctors can use supervised learning as diagnostic problem. This way Provides more understanding
and accuracy because the performance testing of a single
43
Authorized licensed use limited to: Florida Institute of Technology. Downloaded on February 05,2024 at 07:44:25 UTC from IEEE Xplore. Restrictions apply.
Contains 5 drop down options where we have a list symptoms as a input from the user and provides accurately
predicted diseases as a output in within the text box.
box
of all successful symptoms. User can select one of
the symptoms and by clicking the button the The system will predict the disease in within which symptoms
correctly predicted disease will be displayed in the are given as an infection or anything adverse happening to you
which makes you uncomfortable.. The disease will be predicted
text box. Fig.2 contains a little detail of the dataset. using the Naive ve Bayesian algorithm. In keeping with a
literature study, this algorithm results in high accuracy of
enormous datasets. The GUI contains symptoms of all possible
diseases as labels, symptoms are chosen accordingly and then
prediction will be don.. 70% of the data
dataset will be used as
training and 30% will be used for data testing. Training and
testing will be done on the GUI and therefore the results
obtained are available.
• Diseases are predicted using the Naïve Bayesian
algorithm which works mainly with Multinomial NB
as many symptoms will be there.
there
• According to
o the literature, this algorithm leads to the
Fig.2. Datasets
complete accuracy of an outsized database.
• The GUI contains diseases like labels gives symptoms
C. GUI for each disease.
Tkinter GUI has been used in our project
project. It used for V. CONCLUSION
creating the GUI for desktop based applications.
The project is built so far that the system takes symptoms
Tkinter is very easy to grasp, required very little code for from the user as input and generates output i.e. disease
application, portable across all OS, and preinstalled with prediction. The user can select at least one to five symptoms.
Python library. And it is faster way to create GUI when python Less accuracy will be obtained if only one input is selected.
with Tkinter. The developed GUI is shown in Fig.3. The greater the number of input, the greater the accuracy. In
this paper we have proposed a learning model for a compact
novel machine Algorithm of Naïve bayes. We also tried to
reduce the number of features from the dataset. In this process
we were able to obtain sufficient
ient accuracy for all data sets
using our machine learning model. We found the best
accuracy of the most disease approx.(78.6%).
ACKNOWLEDGMENT
Fig.3. Developed GUI
It’s an honor to work with Professor SPS Chauhan sir
(Program Chair), he guided us in all the obstacles, thankful for
IV. RESULT AND DISCUSSION the good support regarding paper and their all views. We are
Classical process could be a procedure within which a really thankful to our reviewer who encourages us to figure
patient should visit a doctor, receive various treatments and are more on this and showing their keen interest in our paper. I
at the end gets the results . This procedure will takes a lot of also place on record, my sense of gratitude to one and all, who
time. directly and indirectly, have lent their hand
h during this journey.
.
To save time and money which is needed for tthe initial
diagnostic process, this program suggests an automat
automatic REFERENCES
diagnostic system based on user input. The program takes
Authorized licensed use limited to: Florida Institute of Technology. Downloaded on February 05,2024 at 07:44:25 UTC from IEEE Xplore. Restrictions apply.
[1] M. Marimuthu, M. Abinaya, K. S., K. Madhankumar, and V. Pavithra, [7] P. P. Sengar, M. J. Gaikwad, and A. S. Nagdive, “Comparative study
“A Review on cardiovascular disease Prediction using Machine of machine learning algorithms for breast cancer prediction,”
Learning and Data Analytics Approach,” International Journal of Proceedings of the 3rd International Conference on Smart Systems and
Computer Applications, vol. 181, no. 18, pp. 20–25, 2018. Inventive Technology, ICSSIT 2020, pp. 796–801, 2020.
[2] Joshi J, Rinal D, Patel J, Diagnosis And Prognosis of Breast Cancer [8] Chen, Yixue Hao, Kai Hwang , Lu Wang, and Lin Wang(2017).
Using Classification Rules, International Journal of Engineering “Disease Prediction by Machine Learning Over Big Data From
Research and General Science,2(6):315-323, October 2014. Healthcare Communities”.IEEE Access. Vol. 5, pp. 8869-8879.
[3] V.Chaurasia and S. Pal, “Data Mining Approach to Detect Heart [9] Michael L. Raymer, William F. Punch, Erik D. Goodman, Leslie A.
Diseases”, International Journal of Advanced applied Science and Kuhn, and Anil K. Jain(2000). “Dimensionality Reduction Using
Information Technology (IJACSIT), Vol. 2, No. 4, 2013, Page 56-66. Genetic Algorithms”. IEEE Transactions on Evolutionary Computation.
[4] K. Dwivedi, “Performance evaluation of various machine learning Vol. 4, Issue 2, pp. 164-171.
techniques for prediction of heart condition,” Neural Computing and [10] D. Asir Antony Gnana Singh,E. Jebamalar Leavline R. Priyanka and P.
Applications, vol. 29, no. 10, pp. 685– 693, 2018. Padma Priya(2016).“Dimensionality Reduction using Genetic
[5] SolankiA.V., Data Mining Techniques using WEKA Classification for Algorithm for Improving Accuracy in Medical Diagnosis”, I.J.
Sickle Cell Disease, International Journal of Computer Science and Intelligent Systems and Applications MECS. No. 1, pp.67-73.
Information Technology,5(4): 58575860,2014. [11] S. Jadhav, R. Kasar, N. Lade, M. Patil, and S. Kolte, “Disease
[6] Ahmed F. Otoom, Emad E. Abdallah, Yousef Kilani, Ahmed Kefaye Prediction by Machine Learning from Healthcare Communities,”
and Mohammad Ashour.(2015) .Effective Diagnosis and Monitoring of International Journal of Scientific Research in Science and Technology,
Heart Disease. International Journal of Software Engineering and Its pp. 29–35, 2019.
Applications. Vol. 9, No. 1, pp. 143-156.
45
Authorized licensed use limited to: Florida Institute of Technology. Downloaded on February 05,2024 at 07:44:25 UTC from IEEE Xplore. Restrictions apply.