0% found this document useful (0 votes)

6 views22 pages

lecture slide 12

Ensemble techniques combine multiple models to improve prediction accuracy over individual classifiers, utilizing methods such as bagging and boosting. Bagging involves creating multiple datasets through sampling with replacement and building classifiers on each, while boosting adaptively adjusts the weights of training data to focus on misclassified records. The document provides detailed examples and algorithms for both methods, illustrating their effectiveness in classification tasks.

Uploaded by

2023aib1008

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views22 pages

lecture slide 12

Uploaded by

2023aib1008

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

Ensemble Techniques

Ensemble Methods

Ensembling methods that combine multiple

models and can perform better than the
individual members.

Construct a set of classifiers from the training

data

Predict class label of test records by combining

the predictions made by multiple classifiers
General Approach

Original
D Training data

Step 1:
Create Multiple D1 D2 .... Dt-1 Dt
Data Sets

Step 2:
Build Multiple C1 C2 Ct -1 Ct
Classifiers

Step 3:
Combine C*
Classifiers
Why Ensemble Methods work?

Suppose there are 25 base

classifiers
– Each classifier has
error rate,  = 0.35
– Assume errors made
by classifiers are
uncorrelated
– Probability that the
ensemble classifier makes
a wrong prediction:
25
 25  i
P( X  13) =    (1 −  ) 25−i = 0.06
i =13  i 
Types of Ensemble Methods

Manipulate data distribution

– Example: bagging, boosting
Manipulate input features
– Example: random forests
Manipulate class labels
– Example: error-correcting output coding
Bagging

Sampling with replacement

Original Data 1 2 3 4 5 6 7 8 9 10
Bagging (Round 1) 7 8 10 8 2 5 10 10 5 9
Bagging (Round 2) 1 4 9 1 2 3 2 7 3 2
Bagging (Round 3) 1 8 5 10 5 5 9 6 3 7

Build classifier on each bootstrap sample

Each sample has probability (1 – 1/n)k of being

selected
If k=n, this is the standard case used in Bagging.
Bagging Algorithm
Bagging Example

Consider 1-dimensional data set:

Original Data:
x 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
y 1 1 1 -1 -1 -1 -1 1 1 1

Classifier is a decision stump

– Decision rule: x  k versus x > k
– Split point k is chosen based on entropy
xk

True False

yleft yright
Bagging Example

Bagging Round 1:
x 0.1 0.2 0.2 0.3 0.4 0.4 0.5 0.6 0.9 0.9 x <= 0.35  y = 1
y 1 1 1 1 -1 -1 -1 -1 1 1 x > 0.35  y = -1

Bagging Round 2:
x 0.1 0.2 0.3 0.4 0.5 0.5 0.9 1 1 1
y 1 1 1 -1 -1 -1 1 1 1 1

Bagging Round 3:
x 0.1 0.2 0.3 0.4 0.4 0.5 0.7 0.7 0.8 0.9
y 1 1 1 -1 -1 -1 -1 -1 1 1

Bagging Round 4:
x 0.1 0.1 0.2 0.4 0.4 0.5 0.5 0.7 0.8 0.9
y 1 1 1 -1 -1 -1 -1 -1 1 1

Bagging Round 5:
x 0.1 0.1 0.2 0.5 0.6 0.6 0.6 1 1 1
y 1 1 1 -1 -1 -1 -1 1 1 1
Bagging Example

Bagging Round 1:
x 0.1 0.2 0.2 0.3 0.4 0.4 0.5 0.6 0.9 0.9 x <= 0.35  y = 1
y 1 1 1 1 -1 -1 -1 -1 1 1 x > 0.35  y = -1

Bagging Round 2:
x 0.1 0.2 0.3 0.4 0.5 0.5 0.9 1 1 1 x <= 0.7  y = 1
y 1 1 1 -1 -1 -1 1 1 1 1 x > 0.7  y = 1

Bagging Round 3:
x 0.1 0.2 0.3 0.4 0.4 0.5 0.7 0.7 0.8 0.9 x <= 0.35  y = 1
y 1 1 1 -1 -1 -1 -1 -1 1 1 x > 0.35  y = -1

Bagging Round 4:
x 0.1 0.1 0.2 0.4 0.4 0.5 0.5 0.7 0.8 0.9 x <= 0.3  y = 1
y 1 1 1 -1 -1 -1 -1 -1 1 1 x > 0.3  y = -1

Bagging Round 5:
x 0.1 0.1 0.2 0.5 0.6 0.6 0.6 1 1 1 x <= 0.35  y = 1
x > 0.35  y = -1
y 1 1 1 -1 -1 -1 -1 1 1 1
Bagging Example

Bagging Round 6:
x 0.2 0.4 0.5 0.6 0.7 0.7 0.7 0.8 0.9 1 x <= 0.75  y = -1
y 1 -1 -1 -1 -1 -1 -1 1 1 1 x > 0.75  y = 1

Bagging Round 7:
x 0.1 0.4 0.4 0.6 0.7 0.8 0.9 0.9 0.9 1 x <= 0.75  y = -1
y 1 -1 -1 -1 -1 1 1 1 1 1 x > 0.75  y = 1

Bagging Round 8:
x 0.1 0.2 0.5 0.5 0.5 0.7 0.7 0.8 0.9 1 x <= 0.75  y = -1
y 1 1 -1 -1 -1 -1 -1 1 1 1 x > 0.75  y = 1

Bagging Round 9:
x 0.1 0.3 0.4 0.4 0.6 0.7 0.7 0.8 1 1 x <= 0.75  y = -1
y 1 1 -1 -1 -1 -1 -1 1 1 1 x > 0.75  y = 1

Bagging Round 10:

x 0.1 0.1 0.1 0.1 0.3 0.3 0.8 0.8 0.9 0.9 x <= 0.05  y = 1
x > 0.05  y = 1
y 1 1 1 1 1 1 1 1 1 1
Bagging Example

Summary of Training sets:

Round Split Point Left Class Right Class

1 0.35 1 -1
2 0.7 1 1
3 0.35 1 -1
4 0.3 1 -1
5 0.35 1 -1
6 0.75 -1 1
7 0.75 -1 1
8 0.75 -1 1
9 0.75 -1 1
10 0.05 1 1
Bagging Example

Assume test set is the same as the original data

Use majority vote to determine class of ensemble
classifier
Round x=0.1 x=0.2 x=0.3 x=0.4 x=0.5 x=0.6 x=0.7 x=0.8 x=0.9 x=1.0
1 1 1 1 -1 -1 -1 -1 -1 -1 -1
2 1 1 1 1 1 1 1 1 1 1
3 1 1 1 -1 -1 -1 -1 -1 -1 -1
4 1 1 1 -1 -1 -1 -1 -1 -1 -1
5 1 1 1 -1 -1 -1 -1 -1 -1 -1
6 -1 -1 -1 -1 -1 -1 -1 1 1 1
7 -1 -1 -1 -1 -1 -1 -1 1 1 1
8 -1 -1 -1 -1 -1 -1 -1 1 1 1
9 -1 -1 -1 -1 -1 -1 -1 1 1 1
10 1 1 1 1 1 1 1 1 1 1
Sum 2 2 2 -6 -6 -6 -6 2 2 2
Predicted Sign 1 1 1 -1 -1 -1 -1 1 1 1
Class
Bagging and Other Ensemble Methods

A cartoon depiction of how bagging works. Suppose we train an ‘8’ detector on the dataset depicted above, containing an ‘8’, a ‘6’ and a
‘9’. Suppose we make two different resampled datasets. The bagging training procedure is to construct each of these datasets by
sampling with replacement. The first dataset omits the ‘9’ and repeats the ‘8’. On this dataset, the detector learns that a loop on top of the
digit corresponds to an ‘8’. On the second dataset, we repeat the ‘9’ and omit the ‘6’. In this case, the detector learns that a loop on the
bottom of the digit corresponds to an ‘8’. Each of these individual classification rules is brittle, but if we average their output then the
detector is robust, achieving maximal confidence only when both loops of the ‘8’ are present.
Boosting

An iterative procedure to adaptively change

distribution of training data by focusing more on
previously misclassified records
– Initially, all N records are assigned equal
weights
– Unlike bagging, weights may change at the
end of each boosting round
Boosting

Records that are wrongly classified will have their

weights increased
Records that are classified correctly will have
their weights decreased

Original Data 1 2 3 4 5 6 7 8 9 10
Boosting (Round 1) 7 3 2 8 7 9 4 10 6 3
Boosting (Round 2) 5 4 9 4 2 5 1 7 4 2
Boosting (Round 3) 4 4 8 10 4 5 4 6 3 4

• Example 4 is hard to classify

• Its weight is increased, therefore it is more
likely to be chosen again in subsequent rounds
AdaBoost

The AdaBoost model consists of T weak classifiers: C1, C2, …, CT

Error rate:

 w  (C ( x )  y )
N
1
i = j i j j
N j =1

Importance of a classifier:

1  1 − i 
i = ln 
2  i 
AdaBoost Algorithm

Weight update:
− j
( j +1)

w exp
( j)
if C j ( xi ) = yi
wi =i
 
Z j  exp j if C j ( xi )  yi
where Z j is the normalization factor
If any intermediate rounds produce error rate
higher than 50%, the weights are reverted back
to 1/n and the resampling procedure is repeated
Classification:
C * ( x ) = arg max   j (C j ( x ) = y )
T

y j =1
AdaBoost Algorithm
AdaBoost Example

Consider 1-dimensional data set:

Original Data:
x 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
y 1 1 1 -1 -1 -1 -1 1 1 1

Classifier is a decision stump

– Decision rule: x  k versus x > k
– Split point k is chosen based on entropy
xk

True False

yleft yright
AdaBoost Example

Training sets for the first 3 boosting rounds:

Boosting Round 1:
x 0.1 0.4 0.5 0.6 0.6 0.7 0.7 0.7 0.8 1
y 1 -1 -1 -1 -1 -1 -1 -1 1 1

Boosting Round 2:
x 0.1 0.1 0.2 0.2 0.2 0.2 0.3 0.3 0.3 0.3
y 1 1 1 1 1 1 1 1 1 1

Boosting Round 3:
x 0.2 0.2 0.4 0.4 0.4 0.4 0.5 0.6 0.6 0.7
y 1 1 -1 -1 -1 -1 -1 -1 -1 -1

Summary:
Round Split Point Left Class Right Class alpha
1 0.75 -1 1 1.738
2 0.05 1 1 2.7784
3 0.3 1 -1 4.1195
AdaBoost Example

Weights
Round x=0.1 x=0.2 x=0.3 x=0.4 x=0.5 x=0.6 x=0.7 x=0.8 x=0.9 x=1.0
1 0.1 0.1 0.1 0.1 0.1 0.1 0.1 0.1 0.1 0.1
2 0.311 0.311 0.311 0.01 0.01 0.01 0.01 0.01 0.01 0.01
3 0.029 0.029 0.029 0.228 0.228 0.228 0.228 0.009 0.009 0.009

Classification
Round x=0.1 x=0.2 x=0.3 x=0.4 x=0.5 x=0.6 x=0.7 x=0.8 x=0.9 x=1.0
1 -1 -1 -1 -1 -1 -1 -1 1 1 1
2 1 1 1 1 1 1 1 1 1 1
3 1 1 1 -1 -1 -1 -1 -1 -1 -1
Sum 5.16 5.16 5.16 -3.08 -3.08 -3.08 -3.08 0.397 0.397 0.397
Predicted Sign 1 1 1 -1 -1 -1 -1 1 1 1
Class

AIML Lect6 Ensembles
No ratings yet
AIML Lect6 Ensembles
41 pages
Ensemble Classifiers
No ratings yet
Ensemble Classifiers
37 pages
Machine Learning: Ensemble Methods
No ratings yet
Machine Learning: Ensemble Methods
54 pages
Bagging+Boosting+Gradient Boosting
100% (1)
Bagging+Boosting+Gradient Boosting
48 pages
Ensemble Classifiers
100% (1)
Ensemble Classifiers
37 pages
Class Adv Classification V
No ratings yet
Class Adv Classification V
50 pages
Module 7 - Ensemble Learning
No ratings yet
Module 7 - Ensemble Learning
41 pages
Machine Learning and Data Mining: Prof. Alexander Ihler Fall 2012
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler Fall 2012
36 pages
Lecture 17 - Ensemble Learning
No ratings yet
Lecture 17 - Ensemble Learning
31 pages
ensembles_learning
No ratings yet
ensembles_learning
16 pages
ML8Ensembles (1)
No ratings yet
ML8Ensembles (1)
31 pages
Ensemble Methods
No ratings yet
Ensemble Methods
31 pages
MLDM Lect17 Classification Ensembles
No ratings yet
MLDM Lect17 Classification Ensembles
2 pages
Combining Classifiers: Outline
No ratings yet
Combining Classifiers: Outline
15 pages
کتاب هفتم بارگزاری شده
No ratings yet
کتاب هفتم بارگزاری شده
57 pages
Ensemble Classification
No ratings yet
Ensemble Classification
25 pages
12 Ensemble Model
No ratings yet
12 Ensemble Model
90 pages
Ensemble Learning
No ratings yet
Ensemble Learning
52 pages
Lecture 10 Ensemble Methods
No ratings yet
Lecture 10 Ensemble Methods
69 pages
Ensembles 1
No ratings yet
Ensembles 1
4 pages
Ensemble (v6)
No ratings yet
Ensemble (v6)
45 pages
Bagging and Boosting
No ratings yet
Bagging and Boosting
8 pages
Module3
No ratings yet
Module3
26 pages
boosting
No ratings yet
boosting
28 pages
Week 11 EnsembleLearning
No ratings yet
Week 11 EnsembleLearning
34 pages
An Introduction of Ensemble Learning
100% (1)
An Introduction of Ensemble Learning
40 pages
Unit V -Multiple Learners
No ratings yet
Unit V -Multiple Learners
54 pages
Ensembles of Classifiers: Evgueni Smirnov
No ratings yet
Ensembles of Classifiers: Evgueni Smirnov
43 pages
ensemble
No ratings yet
ensemble
33 pages
2.4-Ensemble_methods_lecture_notes (1)
No ratings yet
2.4-Ensemble_methods_lecture_notes (1)
14 pages
Ensemble Methods
100% (1)
Ensemble Methods
15 pages
cz4041 9 Ensemble
No ratings yet
cz4041 9 Ensemble
54 pages
Lecture 2.1 - AML
No ratings yet
Lecture 2.1 - AML
32 pages
5 - EnsembleModeling
No ratings yet
5 - EnsembleModeling
80 pages
Article Review 9 Eng
No ratings yet
Article Review 9 Eng
21 pages
Data Mining - Ensemble Methods
No ratings yet
Data Mining - Ensemble Methods
12 pages
UNIT 3 AML
No ratings yet
UNIT 3 AML
9 pages
Ensemble Methods (Final)
No ratings yet
Ensemble Methods (Final)
16 pages
ENSEMBLE_LEARNING
No ratings yet
ENSEMBLE_LEARNING
9 pages
ML-Lecture-15-Ensemble
No ratings yet
ML-Lecture-15-Ensemble
27 pages
ML-Unit I - Ensemble Methods
No ratings yet
ML-Unit I - Ensemble Methods
54 pages
Voting or Averaging of Predictions of Multiple Pre-Trained Models
No ratings yet
Voting or Averaging of Predictions of Multiple Pre-Trained Models
23 pages
ML UNIT 3-1
No ratings yet
ML UNIT 3-1
14 pages
Data Mining: Ensemble Techniques Introduction To Data Mining, 2 Edition by Tan, Steinbach, Karpatne, Kumar
No ratings yet
Data Mining: Ensemble Techniques Introduction To Data Mining, 2 Edition by Tan, Steinbach, Karpatne, Kumar
11 pages
Group9 ABA Ensemble Model
No ratings yet
Group9 ABA Ensemble Model
5 pages
unit 5 ML
No ratings yet
unit 5 ML
14 pages
Bagging and Boosting: Amit Srinet Dave Snyder
No ratings yet
Bagging and Boosting: Amit Srinet Dave Snyder
33 pages
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
No ratings yet
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
6 pages
Classification Through Ensembling Techniques
No ratings yet
Classification Through Ensembling Techniques
10 pages
Validaciones - Bosstrap
No ratings yet
Validaciones - Bosstrap
50 pages
15 Ada Boost
No ratings yet
15 Ada Boost
15 pages
Ensemble Method
No ratings yet
Ensemble Method
8 pages
ML Mod 5.1
No ratings yet
ML Mod 5.1
18 pages
Ensemble Methods
No ratings yet
Ensemble Methods
30 pages
Unit-3(1)
No ratings yet
Unit-3(1)
59 pages
Unit-3 ML
No ratings yet
Unit-3 ML
18 pages
Bagging vs Boosting in Machine Learning
No ratings yet
Bagging vs Boosting in Machine Learning
5 pages
Multiplication Tables and Flashcards: Times Tables for Children
From Everand
Multiplication Tables and Flashcards: Times Tables for Children
Jack Goldstein
4/5 (1)
My First Padded Board Books of Times Tables: Multiplication Tables From 1-20
From Everand
My First Padded Board Books of Times Tables: Multiplication Tables From 1-20
Wonder House Books
No ratings yet
Solving Math Problems
From Everand
Solving Math Problems
George N. Frempong
No ratings yet
Machine Learning: COMS 4771 Fall 2018
No ratings yet
Machine Learning: COMS 4771 Fall 2018
6 pages
Major Project Ppt Format[1] Hand Gesture Recognition
No ratings yet
Major Project Ppt Format[1] Hand Gesture Recognition
20 pages
JAVIER KMeans Clustering Jupyter Notebook
No ratings yet
JAVIER KMeans Clustering Jupyter Notebook
7 pages
Download Full Policy Analytics, Modelling, and Informatics: Innovative Tools for Solving Complex Social Problems 1st Edition J Ramon Gil-Garcia PDF All Chapters
100% (1)
Download Full Policy Analytics, Modelling, and Informatics: Innovative Tools for Solving Complex Social Problems 1st Edition J Ramon Gil-Garcia PDF All Chapters
55 pages
Deep Learning - Lecture 4
No ratings yet
Deep Learning - Lecture 4
13 pages
Deep Learning-Based Structural Health Monitoring
No ratings yet
Deep Learning-Based Structural Health Monitoring
38 pages
An Incremental Clustering Algorithm Based On Mahalanobis Distance
No ratings yet
An Incremental Clustering Algorithm Based On Mahalanobis Distance
1 page
AI Structural Engineering
100% (1)
AI Structural Engineering
43 pages
Information Extraction On Tourism Domain Using SpaCy and BERT
No ratings yet
Information Extraction On Tourism Domain Using SpaCy and BERT
15 pages
Ai in Sports Cardiology
No ratings yet
Ai in Sports Cardiology
13 pages
Route-The Safe: A Robust Model For Safest Route Prediction Using Crime and Accidental Data
No ratings yet
Route-The Safe: A Robust Model For Safest Route Prediction Using Crime and Accidental Data
15 pages
Rise of the Machines_Application of Machine Learning_Schultz_Fabozzi
No ratings yet
Rise of the Machines_Application of Machine Learning_Schultz_Fabozzi
14 pages
Artificial Neural Network (2019 Pattern) Pyq
No ratings yet
Artificial Neural Network (2019 Pattern) Pyq
3 pages
Tmi 2018 2833635
No ratings yet
Tmi 2018 2833635
14 pages
Hands on Question Answering Systems with BERT Applications in Neural Networks and Natural Language Processing 1st Edition Navin Sabharwal Amit Agrawal - Download the ebook and start exploring right away
100% (2)
Hands on Question Answering Systems with BERT Applications in Neural Networks and Natural Language Processing 1st Edition Navin Sabharwal Amit Agrawal - Download the ebook and start exploring right away
62 pages
Lecture 1 Parallel and Scalable Machine Learning by HPC Morris Riedel
No ratings yet
Lecture 1 Parallel and Scalable Machine Learning by HPC Morris Riedel
50 pages
CNN Image Classification with Overfitting Reduction
No ratings yet
CNN Image Classification with Overfitting Reduction
8 pages
Variable Neighborhood Search (7th International Conference, ICVNS 2019 Rabat)
No ratings yet
Variable Neighborhood Search (7th International Conference, ICVNS 2019 Rabat)
205 pages
IEEE Paper - Intelligent Plant Growth Monitoring System
No ratings yet
IEEE Paper - Intelligent Plant Growth Monitoring System
5 pages
An Improvement of DBSCAN Algorithm To Analyze Cluster For Large Dataset
No ratings yet
An Improvement of DBSCAN Algorithm To Analyze Cluster For Large Dataset
5 pages
Multidisciplinary
No ratings yet
Multidisciplinary
47 pages
Anapub Paper Template
No ratings yet
Anapub Paper Template
10 pages
The Role of An AI Architect
No ratings yet
The Role of An AI Architect
7 pages
Asim - Data Analyst - CV
No ratings yet
Asim - Data Analyst - CV
4 pages
AI notes Module- 4
No ratings yet
AI notes Module- 4
13 pages
Generative AI Report
No ratings yet
Generative AI Report
15 pages
Fooling LIME and SHAP
No ratings yet
Fooling LIME and SHAP
14 pages
Tensorflow in A Nutshell
No ratings yet
Tensorflow in A Nutshell
25 pages
A Smart System For Fake News Detection Using Machine Learning
No ratings yet
A Smart System For Fake News Detection Using Machine Learning
7 pages
EE2211 Introduction To Machine Learning
No ratings yet
EE2211 Introduction To Machine Learning
99 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

lecture slide 12

Uploaded by

lecture slide 12

Uploaded by

Ensemble Techniques

Ensembling methods that combine multiple

Construct a set of classifiers from the training

Predict class label of test records by combining

Suppose there are 25 base

Manipulate data distribution

Sampling with replacement

Build classifier on each bootstrap sample

Each sample has probability (1 – 1/n)k of being

Consider 1-dimensional data set:

Classifier is a decision stump

Bagging Round 10:

Summary of Training sets:

Round Split Point Left Class Right Class

Assume test set is the same as the original data

An iterative procedure to adaptively change

Records that are wrongly classified will have their

• Example 4 is hard to classify

The AdaBoost model consists of T weak classifiers: C1, C2, …, CT

Consider 1-dimensional data set:

Classifier is a decision stump

Training sets for the first 3 boosting rounds:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.