0% found this document useful (0 votes)

15 views19 pages

2. Performance Measures

Uploaded by

venkatraoboppudi95

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views19 pages

2. Performance Measures

Uploaded by

venkatraoboppudi95

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 19

Department of

CSE
MACHINE LEARNING
21CS2226F

Topic:

Performance metrics

Session - 02
AIM OF THE
SESSION
To build an accurate and efficient machine learning model that can handle both classification and
regression tasks.

INSTRUCTIONAL
OBJECTIVES

This session is designed to:

1. Understand the metrics to monitor and measure the performance of a machine learning
model.
2. Apply metrics to solve classification and regression problems.

LEARNING OUTCOMES

At the end of this session, you should be able to:

1. Describe the different metrics used to monitor and measure the performance of a machine learning model, and
2. Apply metrics to validate the performance of output generated by a machine learning model.

2
Performance metrics

• How to validate the performance of output generated by a machine learning model?

• Metrics are needed to monitor and measure the performance of a model.
• In general, machine learning problems have been divided into regression and
classification problems.
• Hence, metrics are divided into:
• Regression metrics
• Classification metrics.

3
Regression metrics

• Regression models generate continuous output.

• Hence, a distance-based calculation between the predicted output and ground truth data
is essential.
• The most popular metrics to evaluate the regression models are:
• Mean Absolute Error (MAE),
• Mean Squared Error (MSE),
• Root Mean Squared Error (RMSE),
• R² (R-Squared),
• Adjusted R².

4
Regression metrics
• Mean Absolute Error (MAE)

• Mean Absolute Error is the average of the difference between the ground truth and the
predicted values.
• Mathematically, it is represented as :

• It’s more robust towards outliers.

• Error interpretation needs no second thoughts.
• It gives us a measure of how far the predictions were from the actual output.

5
Regression metrics
• Mean Squared Error (MSE)

• Mean Squared Error is the average of the squared difference between the ground truth
and the value predicted by the regression model.
• Mathematically, it is represented as :

• It’s more prone to outliers than other metrics.

• It is differentiable; hence it can be optimized better.

6
Regression metrics
• Root Mean Squared Error (RMSE)

• Root Mean Squared Error is the square root of the average of the squared difference
between the ground truth and the value predicted by the regression model.
• Mathematically, it is represented as :

• Error interpretation can be done smoothly.

• It’s less prone to outliers.
• It is differentiable; hence it can be optimized better.

7
Regression metrics
• R-Squared (R²)

• The R-squared metric enables to comparison of the model with a constant baseline to
determine the performance of the regression model.
• Mathematically, it is represented as :

• If the sum of the Squared Error of the regression line is small, R² will be close to 1 (Ideal),
meaning the regression was able to capture 100% of the variance in the target variable.

8
Regression metrics
• Adjusted R²

• When the model overfits the data, the variance will be 100% but the model learning
hasn’t happened. To overcome this problem, R² is adjusted with the number of
independent variables.
• Mathematically, it is represented as :
n = number of observations,

k = number of independent variables

• The adjusted R² always lower than R².
• It only shows improvement if there is a real improvement.

9
classification metrics

• Classification models generate discrete output.

• Hence, a metric is required that compares discrete classes.
• The most popular metrics to evaluate the classification models are:
• Confusion Matrix,
• Precision and Recall,
• F1-score,
• AU-ROC,
• Accuracy.

10
classification metrics
• Confusion Matrix

• Confusion Matrix is the easiest way to measure the performance of the classification

• model.
TP signifies how many positive class
True value
samples your model predicted correctly.
1 0
• TN signifies how many negative class
True False
samples your model predicted correctly. 1 Positive Positive
Predicted (TP) (FP)
• FP signifies how many negative class
value False True
samples your model predicted incorrectly. 0 Negative Negative
• FN signifies how many positive class (FN) (TN)

samples your model predicted incorrectly.

11
classification metrics
• Precision and Recall

• Precision is defined as the ratio of TP to the total number of predictions as positives.

• Mathematically, it is represented as :

• Recall is defined as the ratio of TP to the total number of actual positives.

• Mathematically, it is represented as :

12
classification metrics
• F1-score

• F1-score is the harmonic mean of precision and recall.

• Mathematically, it is represented as :

• It gives equal importance to precision and recall.

• It presents a good balance between precision and recall and gives good results on
imbalanced classification problems.

13
classification metrics
• AU-ROC (Area Under Receiver Operating Characteristics
Curve)
• AU-ROC makes use of True Positive Rates (TPR) and False Positive Rates (FPR) to visualize
the performance of the classification model.
• Mathematically, it is represented as :

• High ROC means that the probability of a randomly chosen positive example is indeed
positive.
• ROC curves aren’t a good choice when your problem has a huge class imbalance.

14
classification metrics
• Accuracy

• Accuracy tells the overall effectiveness of the classifier.

• Mathematically, it is represented as :

N is the total sample size.

• It is the simplest metric to use and implement.

15
Self-Assessment Questions

1. Which among the following evaluation metrics would you NOT use to measure the
performance of a classification model?

(a) Precision
(b) Recall
(c) Mean Squared Error
(d) F1-score

2. The true-positive rate is also referred to as

(a) Recall
(b) Accuracy
(c) Precision
(d) F1-score

16
Self-Assessment Questions

3. A single metric which combines both precision and recall is the

(a) Precision
(b) Recall
(c) Mean Squared Error
(d) F1-score

4. What is called the average squared difference between classifier predicted output and actual
output?

(a) Mean Squared Error

(b) Mean Absolute Error
(c) Root Mean Squared Error
(d) Mean Relative Error

17
REFERENCES FOR FURTHER LEARNING OF THE
SESSION

Text Books:
1. Mitchell, Tom. Machine Learning. New York, NY: McGraw-Hill, 1997. ISBN: 9780070428072.
2. MacKay, David. Information Theory, Inference, and Learning Algorithms. Cambridge, UK: Cambridge University Press,
2003. ISBN: 9780521642989.

Reference Books:
3. EthemAlpaydin “Introduction to Machine Learning “, The MIT Press (2010).
4. Stephen Marsland, “Machine Learning an Algorithmic Perspective” CRC Press, (2009).

Sites and Web links:

5. Data Science and Machine Learning: https://www.edx.org/course/data-science-machinelearning.
2. Machine Learning: https://www.ocw.mit.edu/courses/6-867-machine-learning-fall-2006/.

18
THANK YOU

Team – MACHINE LEARNING

Performance Metrics ML
No ratings yet
Performance Metrics ML
4 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
20 pages
Session 1 Evaluation Model
No ratings yet
Session 1 Evaluation Model
58 pages
22AIP3101A Session 3
No ratings yet
22AIP3101A Session 3
24 pages
Lecture-(3-4) Evaluation Metrices Classification and Regression
No ratings yet
Lecture-(3-4) Evaluation Metrices Classification and Regression
28 pages
08 Classifier Evaluation
No ratings yet
08 Classifier Evaluation
39 pages
3-Performance Measures
No ratings yet
3-Performance Measures
35 pages
Unit III Iml Final
No ratings yet
Unit III Iml Final
36 pages
WEEK 08
No ratings yet
WEEK 08
13 pages
Confusion Matrix
No ratings yet
Confusion Matrix
4 pages
Lecture 5
No ratings yet
Lecture 5
18 pages
6 Evaluarea performantei
No ratings yet
6 Evaluarea performantei
43 pages
S1-Evaluate-Performance-LKW-1Mar2025
No ratings yet
S1-Evaluate-Performance-LKW-1Mar2025
26 pages
Performance Metrics
No ratings yet
Performance Metrics
8 pages
Performance Parameters
No ratings yet
Performance Parameters
23 pages
Third Seminar Assignment on Machine Learning (CSC 912)
No ratings yet
Third Seminar Assignment on Machine Learning (CSC 912)
10 pages
Model Evaluation
No ratings yet
Model Evaluation
18 pages
Lect_02_Evaluation_Part_1
No ratings yet
Lect_02_Evaluation_Part_1
33 pages
09 - ML-Model Evaluation
No ratings yet
09 - ML-Model Evaluation
41 pages
Evaluating A Machine Learning Model
No ratings yet
Evaluating A Machine Learning Model
14 pages
CSL0777 L06
No ratings yet
CSL0777 L06
24 pages
IT 138 - Lecture 4
No ratings yet
IT 138 - Lecture 4
30 pages
Basics of ML and Evaluation
No ratings yet
Basics of ML and Evaluation
42 pages
Expanded Model Evaluation Metrics
No ratings yet
Expanded Model Evaluation Metrics
8 pages
WEEK 6 ML
No ratings yet
WEEK 6 ML
3 pages
Machine Learning Linear and Logistic Rgression K-mean
No ratings yet
Machine Learning Linear and Logistic Rgression K-mean
11 pages
Machine Learning # 2
No ratings yet
Machine Learning # 2
17 pages
Imp Notes For Aamd
No ratings yet
Imp Notes For Aamd
6 pages
Model Validation and Perf Metrics - v2 - Noman - 08 - 06 - 24
No ratings yet
Model Validation and Perf Metrics - v2 - Noman - 08 - 06 - 24
25 pages
lec-4
No ratings yet
lec-4
24 pages
AD3501-DL-UNIT 4 NOTES
No ratings yet
AD3501-DL-UNIT 4 NOTES
16 pages
FALLSEM2024-25_BCSE209L_TH_VL2024250101737_2024-07-30_Reference-Material-II
No ratings yet
FALLSEM2024-25_BCSE209L_TH_VL2024250101737_2024-07-30_Reference-Material-II
23 pages
Metric
No ratings yet
Metric
6 pages
Exp7_MLAI2
No ratings yet
Exp7_MLAI2
8 pages
ML3 Evaluating Models
No ratings yet
ML3 Evaluating Models
40 pages
Performance Parameters
No ratings yet
Performance Parameters
14 pages
2-Training and Testing Models, Evaluation Metrics-01-07-2023
No ratings yet
2-Training and Testing Models, Evaluation Metrics-01-07-2023
23 pages
Lec_4_ML_S4_Evaluation_Metrics
No ratings yet
Lec_4_ML_S4_Evaluation_Metrics
29 pages
Evaluation Measures
No ratings yet
Evaluation Measures
8 pages
DL_IT324a_4
No ratings yet
DL_IT324a_4
52 pages
Mark Stamp - Introduction To Machine Learning With Applications in Information Security (Chapman & Hall - CRC Machine Learning & Pattern Recogn (2022, Chapman and Hall - CRC) - Libgen - Li
50% (2)
Mark Stamp - Introduction To Machine Learning With Applications in Information Security (Chapman & Hall - CRC Machine Learning & Pattern Recogn (2022, Chapman and Hall - CRC) - Libgen - Li
549 pages
What Are The Evaluation Metrics in Machine Learning
No ratings yet
What Are The Evaluation Metrics in Machine Learning
3 pages
performance evaluation
No ratings yet
performance evaluation
24 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
6 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
11 pages
ML CH 5
No ratings yet
ML CH 5
45 pages
Unit 4 Model Evaluation
No ratings yet
Unit 4 Model Evaluation
24 pages
SGPE Econometrics Lecture 1 OLS
No ratings yet
SGPE Econometrics Lecture 1 OLS
87 pages
Intel Assignment ----
No ratings yet
Intel Assignment ----
13 pages
06-FSSR_DS610_2024=2025T1_ٍMetrics
No ratings yet
06-FSSR_DS610_2024=2025T1_ٍMetrics
24 pages
CES Guide 2022
No ratings yet
CES Guide 2022
97 pages
Evaluation Metrics-ML
No ratings yet
Evaluation Metrics-ML
16 pages
Machine Learningassignment
No ratings yet
Machine Learningassignment
10 pages
ML Metrics
No ratings yet
ML Metrics
9 pages
Learning To Rank Using Classification and Gradient Boosting: Pingli@cs - Stanford.edu
No ratings yet
Learning To Rank Using Classification and Gradient Boosting: Pingli@cs - Stanford.edu
10 pages
Confusion Matrix
No ratings yet
Confusion Matrix
5 pages
Evaluation Metrics in Machine Learning
No ratings yet
Evaluation Metrics in Machine Learning
14 pages
Metrix in ML
No ratings yet
Metrix in ML
7 pages
Performance Metrics
No ratings yet
Performance Metrics
12 pages
DAAI - Lecture - 04 - With - Solutions - 10oct22
No ratings yet
DAAI - Lecture - 04 - With - Solutions - 10oct22
84 pages
ADS-EXP4
No ratings yet
ADS-EXP4
3 pages
MGT 2070 - Chapter 3 - Practice Questions
No ratings yet
MGT 2070 - Chapter 3 - Practice Questions
81 pages
Machine Learning Algorithms in Depth MEAP V01 Vadim Smolyakov 2024 scribd download
No ratings yet
Machine Learning Algorithms in Depth MEAP V01 Vadim Smolyakov 2024 scribd download
40 pages
MBA623 MBA 2B9 DASMA ESPINARGrp Forecasting - Model
No ratings yet
MBA623 MBA 2B9 DASMA ESPINARGrp Forecasting - Model
32 pages
Machine Learning Model Evaluation | Zero To Mastery Academy
No ratings yet
Machine Learning Model Evaluation | Zero To Mastery Academy
1 page
Complete ML Notes
No ratings yet
Complete ML Notes
62 pages
10 Time Series Fundamentals and Milestone Project 3 Bitpredict
No ratings yet
10 Time Series Fundamentals and Milestone Project 3 Bitpredict
48 pages
ASSIGNMENT 6 KANDY TEHERAN
No ratings yet
ASSIGNMENT 6 KANDY TEHERAN
10 pages
Chapter - 2-ML
No ratings yet
Chapter - 2-ML
63 pages
ADS 5
No ratings yet
ADS 5
5 pages
Module 2
No ratings yet
Module 2
44 pages
Chap 3 Forecasting
No ratings yet
Chap 3 Forecasting
23 pages
Live Crypto Sentiment: Social Media Influence On Multi-Sectoral Coin and Its Impact On Portfolio Risk Management, Using Data Analytics.
No ratings yet
Live Crypto Sentiment: Social Media Influence On Multi-Sectoral Coin and Its Impact On Portfolio Risk Management, Using Data Analytics.
9 pages
AI&ML Labmanual
No ratings yet
AI&ML Labmanual
33 pages
Vision-based Robot Manipulation of Transparent Liquid Containers in a Laboratory Setting
No ratings yet
Vision-based Robot Manipulation of Transparent Liquid Containers in a Laboratory Setting
8 pages
2019, Prediction of Wind Power Generation Base On Neural Network in Consideration of The Fault Time
No ratings yet
2019, Prediction of Wind Power Generation Base On Neural Network in Consideration of The Fault Time
10 pages
Multi Class Grading and Quality Assessment of Pomegranate Fruits Based On Physical and Visual Parameters
No ratings yet
Multi Class Grading and Quality Assessment of Pomegranate Fruits Based On Physical and Visual Parameters
26 pages
Deep Learning - AD3501 - Important Questions and 2 Marks With Answer - Unit 4 - Model Evaluation
No ratings yet
Deep Learning - AD3501 - Important Questions and 2 Marks With Answer - Unit 4 - Model Evaluation
12 pages
Perceptron Example (Practice Que)
No ratings yet
Perceptron Example (Practice Que)
26 pages
Deflection Model - Final
No ratings yet
Deflection Model - Final
8 pages
4.chapter 3 Demand Forecasting
No ratings yet
4.chapter 3 Demand Forecasting
43 pages
Unit I
No ratings yet
Unit I
26 pages
Analysis of Mine Haul Truck Fuel Consumption Report
No ratings yet
Analysis of Mine Haul Truck Fuel Consumption Report
24 pages
Crow Search Optimization Based Approach
No ratings yet
Crow Search Optimization Based Approach
5 pages
AI paper 12th SET B
No ratings yet
AI paper 12th SET B
5 pages
10 3 Way Propensity Matching
No ratings yet
10 3 Way Propensity Matching
9 pages
Lecture Seven:: Randomized Complete Block Design
No ratings yet
Lecture Seven:: Randomized Complete Block Design
5 pages
Analysis in Time Series Model For RGUKT, R.K. Valley Campus HT Feeder Load Forecasting Using Linear Regression
No ratings yet
Analysis in Time Series Model For RGUKT, R.K. Valley Campus HT Feeder Load Forecasting Using Linear Regression
2 pages
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

2. Performance Measures

Uploaded by

2. Performance Measures

Uploaded by

Department of

This session is designed to:

At the end of this session, you should be able to:

• How to validate the performance of output generated by a machine learning model?

• Regression models generate continuous output.

• It’s more robust towards outliers.

• It’s more prone to outliers than other metrics.

• Error interpretation can be done smoothly.

k = number of independent variables

• Classification models generate discrete output.

samples your model predicted incorrectly.

• Precision is defined as the ratio of TP to the total number of predictions as positives.

• Recall is defined as the ratio of TP to the total number of actual positives.

• F1-score is the harmonic mean of precision and recall.

• It gives equal importance to precision and recall.

• Accuracy tells the overall effectiveness of the classifier.

N is the total sample size.

• It is the simplest metric to use and implement.

2. The true-positive rate is also referred to as

3. A single metric which combines both precision and recall is the

(a) Mean Squared Error

Sites and Web links:

Team – MACHINE LEARNING

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.