0% found this document useful (0 votes)

81 views21 pages

Cross Validation: Chandan B K Mrs. S Asst Professor, Department of Computer Science Engineering

This document discusses various cross-validation techniques used to evaluate machine learning models. It defines cross-validation as a resampling method that evaluates models by training and testing on different subsets of the data. The main types discussed are leave-one-out cross-validation, k-fold cross-validation, stratified k-fold cross-validation, and time series cross-validation. K-fold cross-validation randomly divides data into k groups, trains on k-1 and tests on the remaining group, and averages results over k trials. Stratified k-fold aims to maintain class distributions across folds. Time series cross-validation forecasts sequentially on a rolling basis.

Uploaded by

Chandan BK

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

81 views21 pages

Cross Validation: Chandan B K Mrs. S Asst Professor, Department of Computer Science Engineering

Uploaded by

Chandan BK

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 21

Cross

Validation Chandan B K
By

Submitted under the guidance of

Mrs. Sridevi S
Asst professor,
Department of computer science engineering
Contents
 Cross validation.
 Types of cross validation
 leave one out cross validation.
 Hold out Method.
 K-Fold cross validation.
 Stratified K-Fold cross validation.
 Time series cross validation
Cross Validation
 Cross-Validation is a resampling technique that helps to
make our model sure about its efficiency and accuracy
on the unseen data.
 It is a method for evaluating Machine Learning models
by training several other Machine learning models on
subsets of the available input data set and evaluating
them on the subset of the data set.
 involves reserving a particular sample of a dataset on
which you do not train the model. Later, you test your
model on this sample before finalizing it.
Steps Involved In Cross
Validation
 Shuffle the dataset randomly.
 Split the dataset into k groups
 For each unique group:
 Take the group as a hold out or test data set
 Take the remaining groups as a training data set
 Fit a model on the training set and evaluate it on the
test set
 Retain the evaluation score and discard the model
 Summarize the skill of the model using the sample of
model evaluation scores
Advantages and Disadvantages Of
Cross Validation
Pros
 Reduces Over fitting.
 Hyper parameter Tuning.

Cons
 Increases Training Time.
 Needs Expensive Computation.
Cross Validation
Techniques
Exhaustive Methods
 Leave-One-Out Cross-Validation
 Leave-P-Out Cross-Validation
Non-exhaustive Methods
 Hold out method
 K – Fold cross validation
 Stratified K-Fold cross validation
Time Series cross validation
Leave One Out Cross
Validation
 Leave-one-out cross-validation is a special case of cross-
validation where the number of folds equals the number
of instances in the data set.
 if there are n data points, n – p data points are taken in one iteration
and the remaining p data points are used for validation.
 Only a single data point is taken into consideration as the testing data
i.e. p=1.
Leave One Out Cross
Validation
Leave One Out Cross
Validation
Pro’s and Con’s :
 More number of iterations in case of large data
set.
 Low biased approach.
 Requires more computational power.
 No randomness in test data set.
Hold Out Method
 In this approach we divide our entire dataset into two
parts viz training data and testing data.
 The size of training data is set more than twice that of
testing data, so the data is split in the ratio of 70:30 or
80:20.
 In this approach, the data is first shuffled randomly
before splitting. As the model is trained on a different
combination of data points.
Hold Out Method
Pro’s and Con’s :
 The model can give different results every time we train it,
and this can be a cause of instability.
 We can never assure that the train set we picked is
representative of the whole dataset.
 When dataset is not too large, there is a high possibility that
the testing data may contain some important information
that we lose as we do not train the model on the testing set.
 The hold-out method is good to use when you have a very large
dataset or you are starting to build an initial model in your data
science project.
K -Fold Cross Validation
 K-Fold cross-validation, the data is divided into k
subsets.
 Each time, one of the k subsets is used as the validation
set and the other k-1 subsets as the training set.
 The Parameters is averaged over all k trials to get the
total efficiency of our model.
K -Fold Cross Validation
 A large value for ‘k’ indicates less bias, and high
variance. Also, this means more data samples can be
used to give a better, and precise outcomes.
 The true error is estimated as the average error rate on
test examples.
K -Fold Cross Validation
K -Fold Cross Validation
Pros and Cons:
 Computation time is reduced as we repeated the process
only 10 times when the value of k is 10.
 Reduced bias
 The variance of the is reduced as k increases
 The training algorithm is computationally intensive as
the algorithm has to be rerun from scratch k times.
Stratified K -Fold Cross
Validation
 Stratified sampling is a sampling technique where the
samples are selected in the same proportion (as they appear
in the population.
 Stratified K Fold used when just random shuffling and
splitting the data is not sufficient, and we want to have
correct distribution of data in each fold.
 In case of regression problem folds are selected so that the
mean response value is approximately equal in all the folds.
 In case of classification problem folds are selected to have
same proportion of class labels .
Stratified K -Fold Cross
Validation
Stratified K -Fold Cross
Validation
Pros and Cons:
 It can improve different models using hyper-parameter
tuning.
 Helps us compare models.
 It helps in reducing both Bias and Variance.
Time Series Cross
Validation
 Cross-validating the time-series model is cross-
validation on a rolling basis.
 In this method we Start with a small subset of data for
training purpose, forecast for the later data points and
then checking the accuracy for the forecasted data
points.
 The same forecasted data points are then included as
part of the next training dataset and subsequent data
points are forecasted.
Time Series Cross
Validation
Thank
You

CH1 Path D&R Agam
100% (1)
CH1 Path D&R Agam
34 pages
Judicial Review Notes Slides-Aggrey Wakili Msomi
No ratings yet
Judicial Review Notes Slides-Aggrey Wakili Msomi
58 pages
SAP FSCM-Dispute Management-FI/AR: Wipro Confidential
100% (1)
SAP FSCM-Dispute Management-FI/AR: Wipro Confidential
97 pages
K Fold
No ratings yet
K Fold
21 pages
Cross Validation Techniques
No ratings yet
Cross Validation Techniques
27 pages
Lecture Note #6 - PEC-CS701E
No ratings yet
Lecture Note #6 - PEC-CS701E
11 pages
Cross Validation
No ratings yet
Cross Validation
5 pages
List Steps in Data Preparation. Give Short Description of Each Step
No ratings yet
List Steps in Data Preparation. Give Short Description of Each Step
20 pages
Cross Validation
No ratings yet
Cross Validation
16 pages
Cross Validation in ML
No ratings yet
Cross Validation in ML
5 pages
Cross-Validation in Machine Learning - Javatpoint
No ratings yet
Cross-Validation in Machine Learning - Javatpoint
8 pages
UNIT4 Cross Validation
No ratings yet
UNIT4 Cross Validation
16 pages
CH 05 Optimization Technique
No ratings yet
CH 05 Optimization Technique
58 pages
K Fold and Other Cross-Validation Techniques
No ratings yet
K Fold and Other Cross-Validation Techniques
10 pages
Unit 9 Model Evaluation
No ratings yet
Unit 9 Model Evaluation
26 pages
Cross Validation
No ratings yet
Cross Validation
4 pages
Module 6 - ML
No ratings yet
Module 6 - ML
30 pages
ML-4th Unit
No ratings yet
ML-4th Unit
44 pages
Cross-Validation in Machine Learning
No ratings yet
Cross-Validation in Machine Learning
18 pages
Chapter2 1 33
No ratings yet
Chapter2 1 33
18 pages
Model Validation
No ratings yet
Model Validation
5 pages
Module3-Ensemble Learning
No ratings yet
Module3-Ensemble Learning
107 pages
P-2.1.2 Cross Validation and Regularization
No ratings yet
P-2.1.2 Cross Validation and Regularization
37 pages
ML Unit4 Notes
No ratings yet
ML Unit4 Notes
20 pages
Validation Over Under Fir Unit 5
No ratings yet
Validation Over Under Fir Unit 5
6 pages
Unit 2
No ratings yet
Unit 2
28 pages
Cross Validation - Notes
No ratings yet
Cross Validation - Notes
10 pages
ML Mod 5
No ratings yet
ML Mod 5
58 pages
Answer-4 Shreyansh
No ratings yet
Answer-4 Shreyansh
4 pages
Cross Validation LN 12
No ratings yet
Cross Validation LN 12
11 pages
Cross Validation LN 12
No ratings yet
Cross Validation LN 12
11 pages
Introduction To K-Fold Cross-Validation
No ratings yet
Introduction To K-Fold Cross-Validation
6 pages
Cross Validation
No ratings yet
Cross Validation
7 pages
ML Technique01
No ratings yet
ML Technique01
7 pages
Sampling Methods in Machine Learning
No ratings yet
Sampling Methods in Machine Learning
13 pages
Cross Validation It S Types and How To Choose Correct CV 1707762388
No ratings yet
Cross Validation It S Types and How To Choose Correct CV 1707762388
13 pages
ML Unit4 Notes
No ratings yet
ML Unit4 Notes
20 pages
Resampling Methods
No ratings yet
Resampling Methods
15 pages
Unit 6 - Model Selection
No ratings yet
Unit 6 - Model Selection
13 pages
Cross Validation
No ratings yet
Cross Validation
10 pages
Comparison Between Performance of Classifiers
No ratings yet
Comparison Between Performance of Classifiers
5 pages
Cofusion Matrix Cross - Validation
No ratings yet
Cofusion Matrix Cross - Validation
34 pages
A Gentle Introduction To K-Fold Cross-Validation
No ratings yet
A Gentle Introduction To K-Fold Cross-Validation
69 pages
K-Fold Cross Validation Technique and Its Essentials - Analytics Vidhya
No ratings yet
K-Fold Cross Validation Technique and Its Essentials - Analytics Vidhya
11 pages
ADS-Methodology and Data Visualization
No ratings yet
ADS-Methodology and Data Visualization
12 pages
Ovefitting, Generalization, Cross Validation
No ratings yet
Ovefitting, Generalization, Cross Validation
20 pages
All Types of Cross Validation
No ratings yet
All Types of Cross Validation
9 pages
Why Is Cross-Validation Needed?
No ratings yet
Why Is Cross-Validation Needed?
24 pages
Analysis of K-Fold Cross-Validation Over Hold-Out
No ratings yet
Analysis of K-Fold Cross-Validation Over Hold-Out
6 pages
ML Pyq Ans
No ratings yet
ML Pyq Ans
37 pages
6 Model Evalution
No ratings yet
6 Model Evalution
16 pages
Cross Validation
No ratings yet
Cross Validation
16 pages
Lec 16
No ratings yet
Lec 16
18 pages
Unit 5 New
No ratings yet
Unit 5 New
9 pages
Model Evaluation and Cross-Validation Methods
No ratings yet
Model Evaluation and Cross-Validation Methods
3 pages
Day 24
No ratings yet
Day 24
3 pages
1.2.2 Cross Validation
No ratings yet
1.2.2 Cross Validation
11 pages
Lec - 4
No ratings yet
Lec - 4
43 pages
Lecture 9
No ratings yet
Lecture 9
16 pages
K-Fold CV On Imbalance Classification Data - Analytics Vidhya - Ayobami Akiode
No ratings yet
K-Fold CV On Imbalance Classification Data - Analytics Vidhya - Ayobami Akiode
18 pages
Cross Validation
No ratings yet
Cross Validation
6 pages
Certified Lean Six Sigma Green Belt (ICGB) Practice Questions And Exam Tests ICGB Exam Guidebook And Updated Questions
From Everand
Certified Lean Six Sigma Green Belt (ICGB) Practice Questions And Exam Tests ICGB Exam Guidebook And Updated Questions
Idea Link
No ratings yet
Ways to Achieve Quality
From Everand
Ways to Achieve Quality
chakrapani srinivasa
5/5 (1)
Sample SOP For Visitor Visa Australia
No ratings yet
Sample SOP For Visitor Visa Australia
6 pages
Microprocessor
No ratings yet
Microprocessor
18 pages
Haris Waheed Bhatti
No ratings yet
Haris Waheed Bhatti
26 pages
12 - Section VI - Chapter 3 - Annex 02 - Drawings and Documents Dibi
No ratings yet
12 - Section VI - Chapter 3 - Annex 02 - Drawings and Documents Dibi
15 pages
Luvlygurumi Kitty (ING)
No ratings yet
Luvlygurumi Kitty (ING)
5 pages
Die Shear Test - Microelectronic Devices - Application Overview
No ratings yet
Die Shear Test - Microelectronic Devices - Application Overview
2 pages
White Paper GFM Functional Specification
No ratings yet
White Paper GFM Functional Specification
51 pages
Termination of Contract
No ratings yet
Termination of Contract
12 pages
CSS Installing Computer System
No ratings yet
CSS Installing Computer System
58 pages
Chapter 5 Interpretation of Contracts - Obligations and Contracts
No ratings yet
Chapter 5 Interpretation of Contracts - Obligations and Contracts
44 pages
Proforma Invoice - PAK RIO
No ratings yet
Proforma Invoice - PAK RIO
1 page
Faceplate WinCC Motor en
No ratings yet
Faceplate WinCC Motor en
36 pages
15 07 24-HSS
No ratings yet
15 07 24-HSS
26 pages
Characteristics of Patent Litigation A Window On Competition Lanjouw and Schankerman
No ratings yet
Characteristics of Patent Litigation A Window On Competition Lanjouw and Schankerman
41 pages
Module 4: The Problems: Cyber Antipatterns
No ratings yet
Module 4: The Problems: Cyber Antipatterns
12 pages
Introduction To DBMS: Application Program End-User
No ratings yet
Introduction To DBMS: Application Program End-User
19 pages
Advertisement Agency
72% (18)
Advertisement Agency
42 pages
Testbank For Medical Terminology For Health Professions 9th Edition Ehrlich Solution Manual
No ratings yet
Testbank For Medical Terminology For Health Professions 9th Edition Ehrlich Solution Manual
17 pages
A Guide To: Project Auditing
No ratings yet
A Guide To: Project Auditing
37 pages
B/U Dorsuma Ganderbal
100% (1)
B/U Dorsuma Ganderbal
2 pages
Yarber File 1
No ratings yet
Yarber File 1
29 pages
Mini Jolly Dali 20 Manual
No ratings yet
Mini Jolly Dali 20 Manual
6 pages
Memo Payload Axis XL
100% (1)
Memo Payload Axis XL
3 pages
Oxford Exam Excellence Recording 26
No ratings yet
Oxford Exam Excellence Recording 26
1 page
Materi Matrikulasi
No ratings yet
Materi Matrikulasi
72 pages
8086 Instruction Set
No ratings yet
8086 Instruction Set
50 pages
MarketingPlan Nike
No ratings yet
MarketingPlan Nike
32 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Cross Validation: Chandan B K Mrs. S Asst Professor, Department of Computer Science Engineering

Uploaded by

Cross Validation: Chandan B K Mrs. S Asst Professor, Department of Computer Science Engineering

Uploaded by

Cross

Submitted under the guidance of

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.