0% found this document useful (0 votes)

53 views30 pages

Chapter 3 Model Evaluation Final

Uploaded by

Lusi ሉሲ

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views30 pages

Chapter 3 Model Evaluation Final

Uploaded by

Lusi ሉሲ

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 30

Evaluating Classification & Predictive Performance of

Data Mining Models

Data Mining and Warehousing

By Gaddisa Olani (Ph.D)

1
Why Evaluate?
Multiple methods are available to classify or predict

For each method, multiple choices are available for

settings

To choose best model, need to assess each model’s

performance

2
Misclassification error

Error = classifying a record as belonging to one class

when it belongs to another class.

3
Naïve Rule

Naïve rule: classify all records as belonging to the

most prevalent class

 Often used as benchmark: we hope to do better than

that.

 Ignore your model if it’s not better than Naïve

4 rule
How to evaluate DM models?
We need:

Train Dataset: Used to fit the machine learning model.

(X_train, Y_train)

Test Dataset: Used to evaluate the fit machine learning

model (X_test, Y_test).

5
How to evaluate DM models?
 During the training:
 Give both X_train and Y_train to it. After training you will have a
model.

6
How to evaluate DM models?
 During testing:
 The goal is to asses the performance of your model by giving it only the
X_test. Then, it will guest the value of Y_test.

 Finally we need to compare the Y_test that was predicted by the model and
the one we hide it from the mode,.

7
Partitioning the dataset

a) Old Fashion train-test split

Divide your dataset in to 80/20.

 80% of your data are used for training, and
 20% of your data are used for testing it, and hence called test data
(a dataset that is independent of the training dataset).

8
Partitioning the dataset cont…
b) K-fold cross validation (most commonly used)
Example when the value of K=8

9
Partitioning the dataset cont…
b) K-fold cross validation (most commonly used)

10
Confusion Matrix
 A confusion matrix is a table that is often used to describe the performance
of a classification model (or "classifier") on a set of test data for which the
true values are known.

11
Confusion Matrix
True Positive:
 Cases, where the model claims that something has happened and actually it
has happened i.e patient, has cancer and the model also predicts cancer.

12
Confusion Matrix
True Negative:
 Cases, where the model claims that nothing has happened and actually
nothing, has happened i.e patient doesn’t have cancer and the model also
doesn’t predict cancer.

13
Confusion Matrix
False Positive ( Type-1 error):
 Cases, where the model claims that something has happened when actually
it hasn’t i.e patient, doesn’t have cancer but the model predicts cancer.

14
Confusion Matrix
False Negative (Type-2 error):
 Cases, where the model claims nothing when actually something has
happened i.e patient has cancer but the model doesn’t predict cancer.

15
Commonly used Classification
Model Evaluation Metrics

16
Accuracy
Accuracy is the fraction of predictions our model got
right.

Accuracy = (TP + TN) / (TP + TN + FP + FN)

Accuracy should be considered when TP and TN are more important and

the dataset is balanced because in that case the model will not get biased
based on the class distribution. But in real-life classification problem,
imbalanced class distribution exists.

17
PRECISION
Precision quantifies out of the total predicted positive
values, how many were actually positive

Precision = TP / (TP + FP)

Taking a use case of Spam Detection, suppose the mail is not spam (0), but the
model has predicted it as spam (1) which is FP. In this scenario, one can miss the
important mail. So, here we should focus on reducing the FP and must consider
precision in this case.

18
RECALL
Recall metric quantifies out of the total actual positive
values, how many were correctly predicted as positive
Recall= TP / (TP + FN)

When recall should be considered?

In Cancer Detection, suppose if a person is having cancer (1), but it is not predicted
(0) by the model which is FN. This could be a disaster. So, in this scenario, we should
focus on reducing the FN and must consider recall in this case.

19
F-1 Score
 Use F-1 Score when FP and FN both are equally important. This allows
the model to consider both precision and recall equally using a single
score.

When recall should be considered?

20
Exercise 1: How do you read the ff confusion
matrix?
Classification Confusion Matrix
Predicted Class
Actual Class 1 0
1 201 85
0 25 2689

201 1’s correctly classified as “1”

85 1’s incorrectly classified as “0”
25 0’s incorrectly classified as “1”
2689 0’s correctly classified as “0”

21
Exercise

22
Exercise 2: You build a data mining model for e-mail spam
detection. The result of your model is summarized in Confusion matrix
below. Calculate the Accuracy, Precision, Recall and F1-score of your
model. Write some description based on your finding?

23
Problem
 Imbalanced classification is the problem of classification when there is an unequal
distribution of classes in the training dataset.

 E.g. there are 3000 HIV negative samples, and 200 positive samples in your
training data.

24
Problem
 Accuracy metrics will fail when there is Imbalanced Class Distributions.

 This kind of problem happens in problems such as fault detection or fraud detection
(AND other rare events)

 Many machine learning models are designed around the assumption of balanced class
distribution, and often learn simple rules (explicit or otherwise) like always predict the
majority class, causing them to achieve an accuracy of 99 percent, although in practice
performing no better than an unskilled majority class classifier.

25
Exercise 3: Confusion matrix summarized below depicts
the performance of Cyber Attack Detection Model (kind
of Antivirus) based on Data Mining. Calculate the
Accuracy of this AV, and state your finding? Note:
Category means not a Virus, Category B implies virus.

26
Solutions to deal with class imbalance
problem?
Re-sampling Technique

27
Reporting Performance report in your thesis or paper

Compare your work using different models

Compare your work with other works

28
Reporting Performance report in your thesis or paper

29
Demo

CS585 Lecture October03rd
No ratings yet
CS585 Lecture October03rd
146 pages
Unit6 -7 Issues_23bc7150-918a-4ebe-9af6-01db96af986a
No ratings yet
Unit6 -7 Issues_23bc7150-918a-4ebe-9af6-01db96af986a
53 pages
Session 1 Evaluation Model
No ratings yet
Session 1 Evaluation Model
58 pages
Lecture 11 Model Evaluation
No ratings yet
Lecture 11 Model Evaluation
11 pages
Lec07 Classification ModelEvaluation Ensemble
No ratings yet
Lec07 Classification ModelEvaluation Ensemble
62 pages
0 Machine Learning Overview and Metrics LT
No ratings yet
0 Machine Learning Overview and Metrics LT
84 pages
6.Data Mining - Classification Ppt
No ratings yet
6.Data Mining - Classification Ppt
37 pages
COnfusion matrix
No ratings yet
COnfusion matrix
32 pages
Chapter 7 - LAST
No ratings yet
Chapter 7 - LAST
29 pages
5.2
No ratings yet
5.2
62 pages
BSC ML CH1.pptx
No ratings yet
BSC ML CH1.pptx
63 pages
CH-5_ML
No ratings yet
CH-5_ML
36 pages
Confusion Matrix and outliers
No ratings yet
Confusion Matrix and outliers
32 pages
Unit 4 Learning
No ratings yet
Unit 4 Learning
100 pages
Unit 5 Classification PDF
No ratings yet
Unit 5 Classification PDF
131 pages
Lec 12 Performances Metrices Matrix Part 2
No ratings yet
Lec 12 Performances Metrices Matrix Part 2
26 pages
Intermediate Analytics-Regression-Week 3-1
No ratings yet
Intermediate Analytics-Regression-Week 3-1
44 pages
Evaluating Model Performance Unit 6
No ratings yet
Evaluating Model Performance Unit 6
33 pages
ML CM
No ratings yet
ML CM
17 pages
EvaluationMatrix
No ratings yet
EvaluationMatrix
29 pages
Module 2
No ratings yet
Module 2
151 pages
Module 6
No ratings yet
Module 6
24 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
25 pages
Unit Ii
No ratings yet
Unit Ii
118 pages
Lec_4_ML_S4_Evaluation_Metrics
No ratings yet
Lec_4_ML_S4_Evaluation_Metrics
29 pages
Basics of ML and Evaluation
No ratings yet
Basics of ML and Evaluation
42 pages
Lec 12 Performances Metrices Matrix Part 2
No ratings yet
Lec 12 Performances Metrices Matrix Part 2
26 pages
CIVI6731 Lecture (Week9)
No ratings yet
CIVI6731 Lecture (Week9)
18 pages
Classification Algorithm in Machine Learning
No ratings yet
Classification Algorithm in Machine Learning
13 pages
Accuracy Precision and Recall
No ratings yet
Accuracy Precision and Recall
21 pages
Confusion Matrix in Machine Learning
No ratings yet
Confusion Matrix in Machine Learning
10 pages
8c - Model Evaluation and Selection
No ratings yet
8c - Model Evaluation and Selection
15 pages
ML Evaluation Metrics (1)
No ratings yet
ML Evaluation Metrics (1)
20 pages
Model Performance Assessment
No ratings yet
Model Performance Assessment
13 pages
ML Unit 3
No ratings yet
ML Unit 3
127 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
41 pages
ML Unit 2
No ratings yet
ML Unit 2
31 pages
lec-4
No ratings yet
lec-4
24 pages
Confusion Matrix
No ratings yet
Confusion Matrix
43 pages
Accuracy and error measures
No ratings yet
Accuracy and error measures
14 pages
Lec 8
No ratings yet
Lec 8
35 pages
Unit 4 Model Evaluation
No ratings yet
Unit 4 Model Evaluation
24 pages
Module 7 - Evaluation Measures
No ratings yet
Module 7 - Evaluation Measures
27 pages
Chap3 Part1 Classification
No ratings yet
Chap3 Part1 Classification
38 pages
Classification Metrics
No ratings yet
Classification Metrics
24 pages
2-Training and Testing Models, Evaluation Metrics-01-07-2023
No ratings yet
2-Training and Testing Models, Evaluation Metrics-01-07-2023
23 pages
Performance Measures - Session 2
No ratings yet
Performance Measures - Session 2
35 pages
BigData section6
No ratings yet
BigData section6
10 pages
Risk Security and Regulatory Compliance
No ratings yet
Risk Security and Regulatory Compliance
12 pages
Machine Learningassignment
No ratings yet
Machine Learningassignment
10 pages
UNIT4 Confusion Matrix
No ratings yet
UNIT4 Confusion Matrix
12 pages
Machine Learning Chapter3
No ratings yet
Machine Learning Chapter3
27 pages
Lecture 5
No ratings yet
Lecture 5
21 pages
Session 2 Evaluation Boosting Bagging Contemporary Business Anaytics
No ratings yet
Session 2 Evaluation Boosting Bagging Contemporary Business Anaytics
17 pages
Evaluation Measures for Machine Learning Models
No ratings yet
Evaluation Measures for Machine Learning Models
6 pages
Understanding the Confusion Matrix in Machine Learning
No ratings yet
Understanding the Confusion Matrix in Machine Learning
4 pages
ML 5
No ratings yet
ML 5
5 pages
Evaluation of Predictive Models Final
No ratings yet
Evaluation of Predictive Models Final
6 pages
Afaan Oromo Word Prediction Muazhassen Thesis Master DDUCSIT2022 PDF
100% (2)
Afaan Oromo Word Prediction Muazhassen Thesis Master DDUCSIT2022 PDF
73 pages
Chapter 1-Introduction
No ratings yet
Chapter 1-Introduction
36 pages
10 11648 J Iotcc 20190704 11 PDF
No ratings yet
10 11648 J Iotcc 20190704 11 PDF
7 pages
Emerging Technology Final Exam
83% (6)
Emerging Technology Final Exam
2 pages
A Survey On Movie Recommendation System PDF
No ratings yet
A Survey On Movie Recommendation System PDF
4 pages
Migrate Digital Records - For Government - Queensland Government
No ratings yet
Migrate Digital Records - For Government - Queensland Government
3 pages
53-57 PDF
No ratings yet
53-57 PDF
5 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Chapter 3 Model Evaluation Final

Uploaded by

Chapter 3 Model Evaluation Final

Uploaded by

Evaluating Classification & Predictive Performance of

Data Mining Models

Data Mining and Warehousing

For each method, multiple choices are available for

To choose best model, need to assess each model’s

Error = classifying a record as belonging to one class

Naïve rule: classify all records as belonging to the

 Often used as benchmark: we hope to do better than

 Ignore your model if it’s not better than Naïve

Train Dataset: Used to fit the machine learning model.

Test Dataset: Used to evaluate the fit machine learning

a) Old Fashion train-test split

Divide your dataset in to 80/20.

Accuracy = (TP + TN) / (TP + TN + FP + FN)

Accuracy should be considered when TP and TN are more important and

Precision = TP / (TP + FP)

When recall should be considered?

When recall should be considered?

201 1’s correctly classified as “1”

Compare your work using different models

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.