0% found this document useful (0 votes)

88 views13 pages

F1 - Score

The F1 score is a machine learning metric that combines precision and recall into a single measure to evaluate classification models, especially on imbalanced data. It is calculated as the harmonic mean of precision and recall to give equal weight to both metrics. The F1 score will be high only if both precision and recall are high, and it will be low if either precision or recall is low. While accuracy is a useful metric for balanced data, F1 score is generally better for imbalanced data where some classes are underrepresented. The document discusses precision, recall, and how F1 score addresses the limitations of accuracy and provides a balanced assessment of model performance.

Uploaded by

Karan Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

88 views13 pages

F1 - Score

Uploaded by

Karan Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 13

F1 – SCORE

VKC
AK
KK
AGENDA
 Introducing the F1 score
 Accuracy
 Imbalanced data example
 Solving imbalanced data
 Precision and Recall: foundations of the
F1 score
 The F1 score: combining Precision and
Recall
 Conclusion
INTRODUCING F1 3

SCORE
The F1 score is a machine learning metric that can be used in
classification models. Although there exist many metrics for
classification models, throughout this article you will discover how the
F1 score is calculated and when there is added value to use it.

The F1-score combines the precision and recall of a classifier into a

single metric by taking their harmonic mean. It is primarily used to
compare the performance of two classifiers.

Suppose that classifier A has a higher recall, and classifier B has higher
precision. In this case, the F1-scores for both the classifiers can be used
to determine which one produces better results.
ACCURACY
5
• Accuracy is a metric for classification models that measures the number of predictions
that are correct as a percentage of the total number of predictions that are made.

• As an example, if 90% of your predictions are correct, your accuracy is simply 90%.

• Accuracy is a useful metric only when you have an equal distribution of classes on
your classification.
• This means that if you have a use case in which you observe more data points of one
class than of another, the accuracy is not a useful metric anymore. Let’s see an
example to illustrate this.
6
IMBALANCED DATA EXAMPLE
• Imagine you are working on the sales data of a
website. You know that 99% of website visitors don’t
buy and that only 1% of visitors buy something. You
are building a classification model to predict which
website visitors are buyers and which are just lookers.
• Now imagine a model that doesn’t work very well.
It predicts that 100% of your visitors are just
lookers and that 0% of your visitors are buyers. It
is clearly a very wrong and useless model.

 Accuracy is not a good metric to use

when you have class imbalance.
7

What would happen if we’d use the accuracy formula on this model? Your model has predicted only
1% wrongly: all the buyers have been misclassified as lookers. The percentage of correct predictions
is therefore 99%.

The problem here is that an accuracy of 99% sounds like a great result, whereas your model
performs very poorly.

In conclusion: Accuracy is not a good metric to use when you have class imbalance.
SOLVING IMBALANCED
DATA
 Solving imbalanced data through resampling
One way to solve class imbalance problems is to work on
your sample. With specific sampling methods, you can
resample your data set in such a way that the data is not
imbalanced anymore.

 Solving imbalanced data through metrics

Another way to solve class imbalance problems is to
use better accuracy metrics like the F1 score, which take
into account not only the number of prediction errors that
your model makes, but that also look at the type of errors
that are made.
PRECISION & RECALL : 9

FOUNDATIONS OF F1 SCORE
PRECISION: THE FIRST PART RECALL : THE SECOND PART
OF THE F1 SCORE OF THE F1 SCORE
• Precision is the first part of the F1 • Recall is the second component of
Score. It can also be used as an the F1 Score, although recall can
individual machine learning metric. also be used as an individual
It’s formula is shown here: machine learning metric. The
formula for recall is:

• A not precise model may find a lot of

the positives, but its selection method • A model with high recall succeeds
is noisy: it also wrongly detects many well in finding all the positive
positives that aren’t actually positives. cases in the data, even though they
• A precise model is very “pure”: may also wrongly identify some
maybe it does not find all the negative cases as positive cases.
positives, but the ones that the model • A model with low recall is not able
does class as positive are very likely to to find all (or a large part) of the
be correct. positive cases in the data.
THE F1 SCORE: COMBINING
PRECISION AND RECALL
• Precision and Recall are the two building blocks of
the F1 score. The goal of the F1 score is to combine
the precision and recall metrics into a single
metric. At the same time, the F1 score has been
designed to work well on imbalanced data.

• F1 score formula
The F1 score is defined as the harmonic mean of precision and recall.
As a short reminder, the harmonic mean is an alternative metric for the
more common arithmetic mean. It is often useful when computing an
average rate.
In the F1 score, we compute the average of precision and recall. They
are both rates, which makes it a logical choice to use the harmonic
mean. The F1 score formula is shown here:
11

Since the F1 score is an average of Precision and Recall, it means that the F1

score gives equal weight to Precision and Recall:

•A model will obtain a high F1 score if both Precision and Recall are high

•A model will obtain a low F1 score if both Precision and Recall are low

•A model will obtain a medium F1 score if one of Precision and Recall is low
and the other is high
SUMMARY
12

In conclusion, when you have the possibility to do so, you should definitely look at multiple
metrics for each of the models that you try out. Each metric has advantages and disadvantages
and each of them will give you specific information on the strengths and weaknesses of your
model.

The real difficulty of choice occurs when doing automated model training, or when using
Grid Search for tuning models. In those cases, you'll have to specify a single metric that you
want to optimize.

In this case, my advice would be to have a good look at multiple different metrics of one or a
few sample models. Then, when you understand the implications for your specific use case, you
can choose one metric for optimization or tuning.

If you move your model to production for long-term use, you should regularly come back to
do model maintenance and verify if the model is still behaving as it should be.
THANK YOU

(Nato Science Series C - 138) J. S. Hunter (Auth.), Bruce R. Kowalski (Eds.) - Chemometrics - Mathematics and Statistics in Chemistry (1984, Springer) (10.1007 - 978-94-017-1026-8) - Libgen - Li
No ratings yet
(Nato Science Series C - 138) J. S. Hunter (Auth.), Bruce R. Kowalski (Eds.) - Chemometrics - Mathematics and Statistics in Chemistry (1984, Springer) (10.1007 - 978-94-017-1026-8) - Libgen - Li
492 pages
DMR Question Bank
No ratings yet
DMR Question Bank
11 pages
Course Description BSIS
No ratings yet
Course Description BSIS
23 pages
Confusion Matrix For Your Multi-Class Machine Learning Model - by Joydwip Mohajon - Towards Data Science
No ratings yet
Confusion Matrix For Your Multi-Class Machine Learning Model - by Joydwip Mohajon - Towards Data Science
9 pages
TFG - 2017 - Alguacil Casas - Francisco Miguel
No ratings yet
TFG - 2017 - Alguacil Casas - Francisco Miguel
33 pages
CHAPTER 3 Same Sex 1
No ratings yet
CHAPTER 3 Same Sex 1
2 pages
Thesis Writing in Social Studies
100% (1)
Thesis Writing in Social Studies
8 pages
ML Interview Questions Placements
No ratings yet
ML Interview Questions Placements
99 pages
Lec 12 13 Evaluation Measures
No ratings yet
Lec 12 13 Evaluation Measures
45 pages
Lecture - 3
No ratings yet
Lecture - 3
24 pages
BSC ML CH1
No ratings yet
BSC ML CH1
63 pages
AI-AG-Day-1-21st Feb
No ratings yet
AI-AG-Day-1-21st Feb
21 pages
Lesson 3.0 Introduction To Classification Structured Data Projects
No ratings yet
Lesson 3.0 Introduction To Classification Structured Data Projects
10 pages
Evaluation Metrics: Yining Chen (Adapted From Slides by Anand Avati) May 1, 2020
No ratings yet
Evaluation Metrics: Yining Chen (Adapted From Slides by Anand Avati) May 1, 2020
31 pages
Cristian Quiñonez Fase2
No ratings yet
Cristian Quiñonez Fase2
7 pages
Unit 4 Model Evaluation
No ratings yet
Unit 4 Model Evaluation
24 pages
Module 6
No ratings yet
Module 6
24 pages
Eagles Island Final Document - Compressed - 2021.10.01
No ratings yet
Eagles Island Final Document - Compressed - 2021.10.01
71 pages
Lec5 Classification
No ratings yet
Lec5 Classification
27 pages
(Resit) - Phe7032-Research - Methods of Enquiry 2
No ratings yet
(Resit) - Phe7032-Research - Methods of Enquiry 2
23 pages
Lec 4
No ratings yet
Lec 4
24 pages
Lecture 04
No ratings yet
Lecture 04
33 pages
Unit-6 Notes PART A
No ratings yet
Unit-6 Notes PART A
20 pages
F1 Score - Thomas Wood
No ratings yet
F1 Score - Thomas Wood
8 pages
MA History Syllabus NPU
No ratings yet
MA History Syllabus NPU
45 pages
Lesson Plan NEW METHOD GBSS
No ratings yet
Lesson Plan NEW METHOD GBSS
9 pages
Module 2
No ratings yet
Module 2
151 pages
6380 18924 1 PB
No ratings yet
6380 18924 1 PB
21 pages
CH-5 ML
No ratings yet
CH-5 ML
36 pages
SakshiJain 6517718 - Fresher - 1 2024-02-15T13 32 21.223
No ratings yet
SakshiJain 6517718 - Fresher - 1 2024-02-15T13 32 21.223
2 pages
Performance Metrics
No ratings yet
Performance Metrics
12 pages
IT 138 - Lecture 4
No ratings yet
IT 138 - Lecture 4
30 pages
Unit3 1.evaluation
No ratings yet
Unit3 1.evaluation
36 pages
Chater 3 Class 10
No ratings yet
Chater 3 Class 10
4 pages
Preferred Strand (Research)
No ratings yet
Preferred Strand (Research)
12 pages
An Introduction To Software Engineering and Fault
No ratings yet
An Introduction To Software Engineering and Fault
31 pages
Model Validation and Perf Metrics - v2 - Noman - 08 - 06 - 24
No ratings yet
Model Validation and Perf Metrics - v2 - Noman - 08 - 06 - 24
25 pages
Final Literature Review
No ratings yet
Final Literature Review
6 pages
Confusion Matrix
No ratings yet
Confusion Matrix
5 pages
Domain 2, Learn-WPS Office
No ratings yet
Domain 2, Learn-WPS Office
6 pages
Accuracy and Error Measures
No ratings yet
Accuracy and Error Measures
14 pages
Assignment 5
No ratings yet
Assignment 5
22 pages
Algorithm
No ratings yet
Algorithm
17 pages
Unit 3
No ratings yet
Unit 3
13 pages
Confusion Matrix and Classification Evaluation Metrics
No ratings yet
Confusion Matrix and Classification Evaluation Metrics
16 pages
F1 Score
No ratings yet
F1 Score
14 pages
Ss PPT Presentation
No ratings yet
Ss PPT Presentation
11 pages
Ads Exp4
No ratings yet
Ads Exp4
3 pages
Performance Evaluation
No ratings yet
Performance Evaluation
24 pages
2-Training and Testing Models, Evaluation Metrics-01-07-2023
No ratings yet
2-Training and Testing Models, Evaluation Metrics-01-07-2023
23 pages
Detecting Stress Based On Social Interactions in Social Networks
100% (1)
Detecting Stress Based On Social Interactions in Social Networks
4 pages
Chapter 5 Model Evaluation
No ratings yet
Chapter 5 Model Evaluation
21 pages
Diary II SakethPulluri
No ratings yet
Diary II SakethPulluri
3 pages
ML Unit 3
No ratings yet
ML Unit 3
127 pages
Performance Metrics Classification
No ratings yet
Performance Metrics Classification
39 pages
ML CH 5
No ratings yet
ML CH 5
45 pages
Evaluation Metrics: Yining Chen (Adapted From Slides by Anand Avati) May 1, 2020
No ratings yet
Evaluation Metrics: Yining Chen (Adapted From Slides by Anand Avati) May 1, 2020
31 pages
جلسه 13
No ratings yet
جلسه 13
76 pages
Basics of ML and Evaluation
No ratings yet
Basics of ML and Evaluation
42 pages
Simulation, Virtual Worlds and Game Mechanics in Nursing Education
No ratings yet
Simulation, Virtual Worlds and Game Mechanics in Nursing Education
12 pages
Model Evaluation Metrics - A Comprehensive Guide For Beginners - by Yash - Medium
No ratings yet
Model Evaluation Metrics - A Comprehensive Guide For Beginners - by Yash - Medium
9 pages
Machine Learningassignment
No ratings yet
Machine Learningassignment
10 pages
Chap3 Part1 Classification
No ratings yet
Chap3 Part1 Classification
38 pages
19-Performance Metrics
No ratings yet
19-Performance Metrics
23 pages
08 Classifier Evaluation
No ratings yet
08 Classifier Evaluation
39 pages
Ads 5
No ratings yet
Ads 5
5 pages
Chapters 4 and 10
No ratings yet
Chapters 4 and 10
56 pages
Instruction & Option Choice
No ratings yet
Instruction & Option Choice
6 pages
Evaluation Metrics-ML
No ratings yet
Evaluation Metrics-ML
16 pages
ML Metrics
No ratings yet
ML Metrics
9 pages
Inferential Statistics
No ratings yet
Inferential Statistics
6 pages
Confusion Matrix
No ratings yet
Confusion Matrix
11 pages
Imbalance Problem
No ratings yet
Imbalance Problem
13 pages
Module 7 - Evaluation Measures
No ratings yet
Module 7 - Evaluation Measures
27 pages
Readings in The Philippine History - Lesson 1
No ratings yet
Readings in The Philippine History - Lesson 1
7 pages
Imp Notes For Aamd
No ratings yet
Imp Notes For Aamd
6 pages
Accuracy Precision and Recall
No ratings yet
Accuracy Precision and Recall
21 pages
Evaluation Measures
No ratings yet
Evaluation Measures
8 pages
Confusion Matrix
No ratings yet
Confusion Matrix
14 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
11 pages
Malla Curricular LiLEI V3
No ratings yet
Malla Curricular LiLEI V3
3 pages
Lesson 1. Nature of Research
No ratings yet
Lesson 1. Nature of Research
42 pages
Lecture - (3-4) Evaluation Metrices Classification and Regression
No ratings yet
Lecture - (3-4) Evaluation Metrices Classification and Regression
28 pages
RECITATION
No ratings yet
RECITATION
3 pages
10 Ai Evaluation tp01
No ratings yet
10 Ai Evaluation tp01
5 pages
A Short Guide to Marketing Model Alignment & Design: Advanced Topics in Goal Alignment - Model Formulation
From Everand
A Short Guide to Marketing Model Alignment & Design: Advanced Topics in Goal Alignment - Model Formulation
David Young
No ratings yet
Module 1 Business Research 2nd Sem 2021 2022 1
No ratings yet
Module 1 Business Research 2nd Sem 2021 2022 1
8 pages
Artificial Intelligence in Accounting and Finance: Meta-Analysis
No ratings yet
Artificial Intelligence in Accounting and Finance: Meta-Analysis
27 pages
Easy Pre-Calculus Step-by-Step, Second Edition
From Everand
Easy Pre-Calculus Step-by-Step, Second Edition
Carolyn Wheater
No ratings yet
My Learning Plan: Teacher: Subject: - Personal Development
No ratings yet
My Learning Plan: Teacher: Subject: - Personal Development
10 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

F1 - Score

Uploaded by

F1 - Score

Uploaded by

F1 – SCORE

The F1-score combines the precision and recall of a classifier into a

 Accuracy is not a good metric to use

 Solving imbalanced data through metrics

• A not precise model may find a lot of

Since the F1 score is an average of Precision and Recall, it means that the F1

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.