Presentation 3
Presentation 3
◦ Unlabeled dataset
Categorization of machine
learning-
•ML is categorized in four part- Classification
• Supervised learning Supervised
learning
• Used labeled data set.
Regression
• Find the relationships between the feature set and the label set.
• Unsupervised learning
• Used unlabeled data set. Clustering
• Output is not given Unsupervised
learning
• Reinforcement learning
Association
• Used to solve the problem of decision making.
• Semi-supervised learning
• Used labeled and unlabeled data.
Introduction about cancer
Cancer is derived from the word ”CRAB”
The term cancer specifically refers to a new growth which has the ability to invade surrounding
tissues, metastasize (spread to other organs).
Fig: Cancer
Types of Tumors
Benign
Benign tumors are non-
malignant/non-cancerous tumor
Tumor Malignant
Malignant tumors are cancerous
growths.
A cancer is another word for a
malignant tumor
Question
Test, Prediction
Training Machine
Patterns + Learning Classifier
class labels Algorithm
Feature
selection Model Predicted
selection Class
Performance
Feature Training/Testing
Data set Pre-processing evaluation Deployment
extraction model
matrices
Performance Measure Definition
TP (True Positive) Correctly classified Benign
tuples.
1. Confusion matrix
2. Sensitivity=TP/(TP+FN)
FP (False Positive) False classified Malignant
tuples that are marked Benign. 3. Specificity=TN/(FP+TN).
4. Accuracy=(TP+TN)/(TP+FP+FN+TN)
TN (True Negative) Correctly classified Malignant
tuples.
Pre-processing:-
• To enhance the image
• Noise are removed from a mammogram.
Segmentation :-
• To find the region of interest
Fig- Breast cancer prediction system[1]
2. Rana et al. [3]
Author used UCI depository (WDBC data set)