Breast Cancer Detectionusing Artificial Neural Networks
Breast Cancer Detectionusing Artificial Neural Networks
net/publication/355982962
CITATIONS READS
2 2,691
1 author:
SEE PROFILE
All content following this page was uploaded by Md Haris Uddin Sharif on 21 November 2021.
https://doi.org/10.22214/ijraset.2021.38582
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.429
Volume 9 Issue X Oct 2021- Available at www.ijraset.com
Abstract: Early detection of disease has emerged as a critical issue in recent years due to the fast population increase seen in
medical research. The chance of dying from breast cancer increases dramatically as the world’s population continues to increase
at an alarming rate. Compared to other cancers discovered thus far, breast cancer is the second most severe. In addition to
assisting medical staff in disease diagnosis, an automated disease detection system also provides reliable, effective, and fast
intervention, which reduces the likelihood of mortality. In this research study, the Artificial Neural Network is employed for
breast cancer classification. The model is validated on well-known dataset comprised from UCI machine learning repository.
The results reveal that the ANNs obtained the highest accuracy i.e. 98.24%.
Keywords: Machine Learning, Neural Network, Algorithm, Artificial Intelligence.
I. INTRODUCTION
The correct identification of some essential information is a significant problem in bioinformatics and medical research, among
other fields [1]. In the field of medicine, the diagnosis of a disease is a physically demanding and challenging task. Thousands of
diagnostic centres, hospitals, and research institutes, in addition to countless websites, provide a wealth of medical diagnosis
information to the public at large. It is scarcely required to categorize them to make the system automated and fast in identifying
medical conditions. The expertise and ability of the medical planning officer in the medical field are often used to diagnose a
requirement in most cases. Consequently, there are situations in which mistakes and undesirable biases occur. It also takes a long
time to get an accurate diagnosis of the illness.
According to the American Cancer Society [2], women are more likely than men to be affected with breast cancer than all other
cancers discovered. Approximately one-third of the female population is infected with invasive breast cancer, according to
estimations. Breast cancer is the most common kind of cancer in women all over the world. Breast cancer develops as a result of the
abnormal development of specific cells inside the breast. Several methods have been developed to ensure that breast cancer is
diagnosed correctly. Breast screening, often known as mammography [3], is used to detect and diagnose breast cancer. By using X-
rays, it is possible to determine the nipple status of a woman. Breast cancer is difficult to detect in its early stages in most cases,
owing to the tiny size of the cancer cell when seen from the outside. It is possible to detect cancer in its early stages using
mammography, and the procedure takes just a few minutes. When it comes to detecting breast cancer, ultrasound [4] is a well-
known method in which a sound wave is delivered into the body to examine the situation on the inside. A transducer that provides
sound waves is located on the skin, and the bounce of the sound waves records the echoes of the tissues of the body as they travel
through the body. It is required to convert the echoes into a greyscale, a binary value represented in a computer.
Positron emission tomography (PET) [5], which uses F-fluorodeoxyglucose to image the human body, allows physicians to
determine the location of a tumour in the body. It is based on the detection of radiolabel cancer cells in the body. -tracers that are
specific. Breast distortions may be detected using dynamic magnetic resonance imaging (MRI), which has been developed [6]. The
modality predicts the pace of contrast enhancement in cancer by increasing the rate of angiogenesis in the cancerous tissue. The
presence of metastases on magnetic reasoning imaging in breast cancer patients is associated with increased contrast enhancement.
As a consequence of advancements in imaging technology, the method known as Elastography [7] has just been created. Breast
cancer tissue that is larger than the normal parenchyma may be removed using this method. This method uses a colour map of probe
compression to distinguish between benign and malignant tumours.
Medical prognosis has significantly benefited from the application of machine learning [8–11], deep learning [12, 13], and bio-
inspired computing [14]. There have been many methods shown, but none of them have provided an accurate and reliable result.
Doctors must interpret a large amount of imaging data during mammography, which decreases accuracy. This technique is highly
time-consuming, and in some cases, it incorrectly diagnoses the illness. This paper proposed a machine learning-based technique
(Artificial Neural Networks) to detect the disease from the input features.
The remaining part of the paper is prepared as follows. The following section outlines the current review of state of the art in this
field, followed by which the methods and materials used for the study are illustrated. Section III presented the details of the dataset,
and section IV presented the experimental protocol, while section V concluded the paper.
IV. METHODOLOGY
To perform breast cancer detection, we performed several steps.
V. DATA PRE-PROCESSING
Data pre-processing is the first step in filling in the gaps left by missing data, detecting and eliminating outliers, and resolving self-
contradiction problems. In the dataset, there are 16 missing values for characteristics that are not present. The mean takes the place
of the missing attributes for that class. Additionally, the dataset is subjected to random selection to ensure that the data is adequately
circulated. After data pre-processing, the dataset was divided into the training and testing phase. The training phase is used to extract
the features from the dataset, and the testing phase is used to evaluate how the suitable model performs when it comes to predicting
from the dataset. Each component of the dataset is split into two parts, Training and Testing. Cross-validation using K folds indicate
that a single fold is used for testing, with the remaining K1 folds being used for training cyclically. Cross-validation is used to
prevent over fitting in the data collection process. Specifically, a ten-fold cross-validation method is used to partition data, with
nine-fold of the data utilized for training and one fold used for testing in each iteration of the research.
VI. CLASSIFICATION
Classification is the process of dividing a collection of data into categories. It may be done on both structured and unstructured data.
Predicting the class of supplied data points is the first step in the procedure. Target labels and categories are all terms used to
describe the course. Estimating the mapping function from the discrete input variables to the discrete output variable is classified as
predictive modelling. The principal purpose is to figure out which category or class the new data belongs to. In this research study,
we employed an Artificial Neural Network (ANNs) for breast cancer classification. The detail of the employed architecture is given
below.
Figure 3: Model Training As a function of epochs Figure 4: Classification accuracy as a function of epochs
VIII. CONCLUSIONS
This paper presented a machine learning technique for the prediction of breast cancer. It is incredibly costly and time-consuming to
conduct a medical diagnostic process in the area of medicine. According to the system’s recommendations, machine learning
techniques may be used as a clinical assistant to detect breast cancer, which will be very beneficial for new doctors of a physician in
the event of a misdiagnosis. The model produced by ANN is more consistent than any other method previously mentioned, and it
has the potential to make essential advancements in breast cancer prediction. Based on the research findings, we can infer that
machine learning techniques can automatically detect the disease with high accuracy.
REFERENCES
[1] Park SH, Han K. Methodological guide for evaluating clinical performance and effect of artificial intelligence technology for medical diagnosis and prediction.
Radiol Soc N Am. 2018;286(3):800–9.
[2] Breast Cancer: Statistics, Approved by the Cancer.Net Editorial Board, 04/2017. [Online]. Available: http://www.cancer.net/cance r-types/breast-
cancer/statistics. Accessed 26 Aug 2018.
[3] Mori M, Akashi-Tanaka S, Suzuki S, Daniels MI, Watanabe C, Hirose M, Nakamura S. Diagnostic accuracy of contrast-enhanced spectral mammography in
comparison to conventional full-feld digital mammography in a population of women with dense breasts. Springer. 2016;24(1):104–10.
[4] Kurihara H, Shimizu C, Miyakita Y, Yoshida M, Hamada A, Kanayama Y, Tamura K. Molecular imaging using PET for breast cancer. Springer.
2015;23(1):24–32
[5] Azar AT, El-Said SA. Probabilistic neural network for breast cancer classifcation. Neural Comput Appl. 2013;23(6):1737–51.
[6] Nagashima T, Suzuki M, Yagata H, Hashimoto H, Shishikura T, Imanaka N, Miyazaki M. Dynamic-enhanced MRI predicts metastatic potential of invasive
ductal breast cancer. Springer. 2002;9(3):226–30
[7] Park CS, Kim SH, Jung NY, Choi JJ, Kang BJ, Jung HS. Interobserver variability of ultrasound elastography and the ultrasound BI-RADS lexicon of breast
lesions. Springer. 2013;22(2):153–60.
[8] Ayon SI, Islam MM, Hossain MR. Coronary artery heart disease prediction: a comparative study of computational intelligence techniques. IETE J Res. 2020;.
https://doi.org/10.1080/03772 063.2020.1713916.
[9] Muhammad LJ, Islam MM, Usman SS, Ayon SI. Predictive data mining models for novel coronavirus (COVID-19) infected patients’ recovery. SN Comput Sci.
2020;1(4):206.
[10] Islam MM, Iqbal H, Haque MR, Hasan MK. Prediction of breast cancer using support vector machine and K-Nearest neighbors. In: Proc. IEEE Region 10
Humanitarian Technology Conference (R10-HTC), Dhaka, 2017, pp. 226–229
[11] Haque MR, Islam MM, Iqbal H, Reza MS, Hasan MK. Performance evaluation of random forests and artifcial neural networks for the classifcation of liver
disorder. In: Proc. International Conference on Computer, Communication, Chemical, Material and Electronic Engineering (IC4ME2), Rajshahi, 2018, pp. 1–5.
[12] Ayon SI, Islam MM. Diabetes prediction: a deep learning approach. Int J Inf Eng Electron Bus (IJIEEB). 2019;11(2):21–7.
[13] Islam MZ, Islam MM, Asraf A. A combined deep CNN-LSTM network for the detection of novel coronavirus (COVID-19) using X-ray images, 2020. pp. 1–
20.
[14] Hasan MK, Islam MM, Hashem MMA. Mathematical model development to detect breast cancer using multigene genetic programming. In: 2016 5th
International Conference on Informatics, Electronics and Vision (ICIEV), pp. 574–579, 2016.
[15] Sakri SB, Rashid NBA, Zain ZM. Particle swarm optimization feature selection for breast cancer recurrence prediction. IEEE Access. 2018;6:29637–47.
[16] Juneja K, Rana C. An improved weighted decision tree approach for breast cancer prediction. In: International Journal of Information Technology, 2018
[17] Yue W, et al. Machine learning with applications in breast cancer diagnosis and prognosis. Designs. 2018;2(2):13
[18] Azar AT, El-Metwally SM. Decision tree classifiers for automated medical diagnosis. Neural Comput Appl. 2012;23(7–8):2387–403.
[19] Senapati MR, Mohanty AK, Dash S, Dash PK. Local linear wavelet neural network for breast cancer recognition. Neural Comput Appl. 2013;22(1):125–31
[20] Azar AT, El-Said SA. Performance analysis of support vector machines classifers in breast cancer mammography recognition. Neural Comput Appl.
2013;24(5):1163–77.
[21] Ferreira P, Dutra I, Salvini R, Burnside E. Interpretable models to predict Breast Cancer. In: Proc. IEEE International Conference on Bioinformatics and
Biomedicine (BIBM), Shenzhen, 2016, pp. 1507–1511.
[22] Jhajharia S, Verma S, Kumar R. A cross-platform evaluation of various decision tree algorithms for prognostic analysis of breast cancer data. In: Proc.
International Conference on Inventive Computation Technologies (ICICT), Coimbatore, 2016, pp. 1–7.
[23] Breast Cancer Wisconsin (Original) Data Set, [Online]. https:// archive.ics.uci.edu/ml/machine-learning-databases/breast-cance r-wisconsin/breast-cancer-
wisconsin.data. Accessed 25 Aug 2018