0% found this document useful (0 votes)

27 views7 pages

AI25

Uploaded by

ANANTHI K

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views7 pages

AI25

Uploaded by

ANANTHI K

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

ROHINI COLLEGE OF ENGINEERING AND TECHNOLOGY

4.2 ENSEMBLE LEARNING

"An ensembled model is a machine learning model that combines the predictions from two or
more models.”

There are 3 most common ensemble learning methods in machine learning. These are as follows:
• Bagging
• Boosting
• Stacking
The idea of ensemble learning is to employ multiple learners and combine their predictions. If we
have a committee of M models with uncorrelated errors, simply by averaging them the average
error of a model can be reduced by a factor of M.

• Unfortunately, the key assumption that the errors due to the individual models are uncorrelated
is unrealistic in practice, the errors are typically highly correlated, so the reduction in overall error
is generally small.

 Ensemble modeling is the process of running two or more related but different analytical models
and then synthesizing the results into a single score or spread in order to improve the accuracy
of predictive analytics and data mining applications.

•Ensembles of classifiers is a set of classifiers whose individual decisions combined in some way
to classify new examples.

• Ensemble methods combine several decision trees classifiers to produce better predictive
performance than a single decision tree classifier. The main principle behind the ensemble model
is that a group of weak learners come together to form a strong learner, thus increasing the
accuracy of the model.

 Why do ensemble methods work? Based on one of two basic observations

1.Variance reduction: If the training sets are completely independent, it will always helps to
average an ensemble because this will reduce variance without affecting bias (e.g. bagging) and
reduce sensitivity to individual data points.

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

ROHINI COLLEGE OF ENGINEERING AND TECHNOLOGY

2. Bias reduction: For simple models, average of models has much greater capacity than single
model Averaging models can reduce bias substantially by increasing capacity and control variance
by Citting one component at a time.

4.2.1 Bagging

 Bagging is also called Bootstrap Aggregating Bagging and boosting are meta algorithms
that pool decisions from multiple classifiers. It creates ensembles by repeatedly randomly
resampling the training data.

 Bagging is an ensemble learning technique that helps to improve the performance and
accuracy of machine learning algorithms

 The meta algorithm, which is a special case of the model averaging, was originally
designed for classification and is usually applied to decision tree models, but it can be used
with any type of model for classification or regression.

Bootstrapping is the method of randomly creating samples of data out of a population with
replacement to estimate a population parameter.

Bagging Steps:

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

ROHINI COLLEGE OF ENGINEERING AND TECHNOLOGY

1. Suppose there are N observations and M features in training data set. A sample from training
data set is taken randomly with replacement,

2. A subset of M features is selected randomly and whichever feature gives the best split is used
to split the node iteratively.

3. The tree is grown to the largest

4. Above steps are repeated in times and prediction is given based on the aggregation of predictions
from n number of trees.

Advantages of Bagging

1. Reduces over-fitting of the model.

2. Handles higher dimensionality data very well.

3. Maintains accuracy for missing data.

Disadvantages of Bagging:

1. Since final prediction is based on the mean predictions from subset trees, it won't give precise
values for the classification and regression model.

4.2.2 Boosting

Boosting is an ensemble modeling technique that attempts to build a strong classifier from
the number of weak classifiers. It is done by building a model by using weak models in series.
Firstly, a model is built from the training data. Then the second model is built which tries to correct
the errors present in the first model. This procedure is continued and models are added until either
the complete training data set is predicted correctly or the maximum number of models are added.

Boosting is a very different method to generate multiple predictions (function estimates)

and combine them linearly. Boosting refers to a general and provably effective method of
producing a very accurate classifier by combining rough and moderately inaccurate rules of thumb.

Boosting is a bias reduction technique. It typically improves the performance of a single tree model

To begin, we define an algorithm for finding the rules of thumb, which we call a weak
learner. The boosting algorithm repeatedly calls this weak learner, each time feeding it a different

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

ROHINI COLLEGE OF ENGINEERING AND TECHNOLOGY

distribution over the training data. Each call generates a weak classifier and we must combine all
of these into a single classifier that. hopefully, is much more accurate than any one of the rules.

Train a set of weak hypotheses: h1,….,hT. The combined hypothesis H is a weighted majority
vote of the T weak hypotheses. During the training, focus on the examples that are misclassified.

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

ROHINI COLLEGE OF ENGINEERING AND TECHNOLOGY

Boosting Steps:

1. Draw a random subset of training samples d1 without replacement from the training set D to
train a weak learner C1

2. Draw second random training subset d2 without replacement from the training set and add 50
percent of the samples that were previously falsely classified/misclassified to train a weak learner
C2

3. Find the training samples d3 in the training set D on which C1 and C2 disagree to train a third
weak learner C3

4. Combine all the weak learners via majority voting.

AdaBoost:

AdaBoost was the first really successful boosting algorithm developed for the purpose of
binary classification. AdaBoost is short for Adaptive Boosting and is a very popular boosting
technique that combines multiple “weak classifiers” into a single “strong classifier”. It was
formulated by Yoav Freund and Robert Schapire. They also won the 2003 Gödel Prize for their
work.

Advantages of AdaBoost:

1. Very simple to implement

2.Fairly good generalization

3. The prior error need not be known ahead of time.

Disadvantages of AdaBoost

1. Suboptimal solution

2. Can over fit in presence of noise.

4.2.3 Stacking

There are many ways to ensemble models in machine learning, such as Bagging, Boosting, and
stacking. Stacking is one of the most popular ensemble machine learning techniques used to

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

ROHINI COLLEGE OF ENGINEERING AND TECHNOLOGY

predict multiple nodes to build a new model and improve model performance. Stacking enables us
to train multiple models to solve similar problems, and based on their combined output, it builds a
new model with improved performance.

 Stacking, sometimes called stacked generalization, is an ensemble machine learning

method that combines multiple heterogeneous base or component models via a meta-model
 The base model is trained on the complete training data, and then the meta-model is trained
on the predictions of the base models. The advantage of stacking is the ability to explore
the solution space with different models in the same problem.
 The stacking based model can be visualized in levels and has at least two levels of the
models. The first level typically trains the two or more base learners (can be heterogeneous)
and the second level might be a single meta learner that utilizes the base models predictions
as input and gives the final result as output. A stacked model can have more than two such
levels but increasing the levels doesn't always guarantee better performance.
 In the classification tasks, often logistic regression is used as a meta learner, while linear
regression is more suitable as a meta learner for regression-based tasks.

Stacking is concerned with combining multiple classifiers generated by different learning

algorithms L1,…..LN on a single dataset S, which is composed by a feature vector Si = (xi ,ti).

 The stacking process can be broken into two phases.

1. Generate a set of base-level classifiers C1...CN where Ci=Li(S)

2. Train a meta level classifier to combine the outputs of the base level classifiers,

Fig. shows stacking frame.

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

ROHINI COLLEGE OF ENGINEERING AND TECHNOLOGY

Difference between Bagging and Boosting

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

Ensemble Methods Send
No ratings yet
Ensemble Methods Send
20 pages
ML Chapter 3
No ratings yet
ML Chapter 3
25 pages
Far From The Tree - Andrew Solomon - Free Download, Borrow, and Streaming - Internet Archive
No ratings yet
Far From The Tree - Andrew Solomon - Free Download, Borrow, and Streaming - Internet Archive
3 pages
Catalogue of Unbalanced Chromosome Aberrations in Man 2nd Edition Albert Schinzel PDF Download
100% (3)
Catalogue of Unbalanced Chromosome Aberrations in Man 2nd Edition Albert Schinzel PDF Download
19 pages
Sample Business Plan For Skin Care Company
100% (1)
Sample Business Plan For Skin Care Company
8 pages
Ensemble Learning (Autosaved)
No ratings yet
Ensemble Learning (Autosaved)
31 pages
Lecture 2
No ratings yet
Lecture 2
35 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
24 pages
Unit 4
No ratings yet
Unit 4
24 pages
Ensemble Methods - Bagging, Boosting and Stacking - by Joseph Rocca - Towards Data Science
No ratings yet
Ensemble Methods - Bagging, Boosting and Stacking - by Joseph Rocca - Towards Data Science
20 pages
N260 - Computerised Financial Systems N6 - Instructions - Nov 2024
No ratings yet
N260 - Computerised Financial Systems N6 - Instructions - Nov 2024
19 pages
Ensemble Learning
No ratings yet
Ensemble Learning
26 pages
Unit 4 ML
No ratings yet
Unit 4 ML
25 pages
Bagging
No ratings yet
Bagging
7 pages
Ensemble Learning
100% (1)
Ensemble Learning
7 pages
Unit 4
No ratings yet
Unit 4
45 pages
Unit 4
No ratings yet
Unit 4
17 pages
How To Transmit SAP Purchase Order To Vendor Via E-Mail
100% (14)
How To Transmit SAP Purchase Order To Vendor Via E-Mail
16 pages
Splunk 8.2 Cloud Administration
No ratings yet
Splunk 8.2 Cloud Administration
386 pages
UMl - Unit 3
No ratings yet
UMl - Unit 3
50 pages
p365 High Line Pressure Transducer
No ratings yet
p365 High Line Pressure Transducer
7 pages
Ens Embling
No ratings yet
Ens Embling
8 pages
Java Programming Made Notes
No ratings yet
Java Programming Made Notes
6 pages
Unit 4 Part 1
No ratings yet
Unit 4 Part 1
47 pages
AI31
No ratings yet
AI31
13 pages
Ensemble Methods
No ratings yet
Ensemble Methods
3 pages
Bagging Vs Boosting in Machine Learning - GeeksforGeeks
No ratings yet
Bagging Vs Boosting in Machine Learning - GeeksforGeeks
9 pages
17 MAY - Algebra 2 Spring 2024 Final REVIEW
No ratings yet
17 MAY - Algebra 2 Spring 2024 Final REVIEW
9 pages
Deep Alignment Network: A Convolutional Neural Network For Robust Face Alignment
No ratings yet
Deep Alignment Network: A Convolutional Neural Network For Robust Face Alignment
10 pages
12 Ensemble Model
No ratings yet
12 Ensemble Model
90 pages
ML Unit 3 V2
No ratings yet
ML Unit 3 V2
47 pages
AI10
No ratings yet
AI10
5 pages
AI19
No ratings yet
AI19
4 pages
AI24
No ratings yet
AI24
4 pages
Lecture 5
No ratings yet
Lecture 5
11 pages
ML Exp 9
No ratings yet
ML Exp 9
3 pages
Ensemble Method
No ratings yet
Ensemble Method
8 pages
Unit 5 ML
No ratings yet
Unit 5 ML
14 pages
Unit 4 Updated Notes
No ratings yet
Unit 4 Updated Notes
13 pages
UNIT III Word File
No ratings yet
UNIT III Word File
13 pages
GC 2024 06 30
No ratings yet
GC 2024 06 30
8 pages
Data Mining - Ensemble Methods
No ratings yet
Data Mining - Ensemble Methods
12 pages
AI18
No ratings yet
AI18
11 pages
Unit V - Multiple Learners
No ratings yet
Unit V - Multiple Learners
54 pages
Gaurav Resume
No ratings yet
Gaurav Resume
1 page
Classification Through Ensembling Techniques
No ratings yet
Classification Through Ensembling Techniques
10 pages
Assignment 1
No ratings yet
Assignment 1
4 pages
Ensemble Methods (Final)
No ratings yet
Ensemble Methods (Final)
16 pages
Collaborative Optimization of Dynamic Pricing and Seat Allocation For High-Speed Railways An Empirical Study From China
No ratings yet
Collaborative Optimization of Dynamic Pricing and Seat Allocation For High-Speed Railways An Empirical Study From China
11 pages
Ensemble Final
No ratings yet
Ensemble Final
41 pages
E4fbc2f-C755-Ed1a-C18-F18ec25eb0d Ensemble Learning Bagging Boosting and Stacking
No ratings yet
E4fbc2f-C755-Ed1a-C18-F18ec25eb0d Ensemble Learning Bagging Boosting and Stacking
6 pages
Technical Report
No ratings yet
Technical Report
10 pages
Precedence and Associativity of Operators in Python
No ratings yet
Precedence and Associativity of Operators in Python
2 pages
Ensemble Methods
No ratings yet
Ensemble Methods
31 pages
Bagging Vs Boosting in Machine Learning
No ratings yet
Bagging Vs Boosting in Machine Learning
5 pages
AI22
No ratings yet
AI22
3 pages
Module 7 - Ensemble Learning
No ratings yet
Module 7 - Ensemble Learning
41 pages
Unit 4 ML
No ratings yet
Unit 4 ML
9 pages
Bagging and Boosting
No ratings yet
Bagging and Boosting
8 pages
5 - EnsembleModeling
No ratings yet
5 - EnsembleModeling
80 pages
ML Mod 5.1
No ratings yet
ML Mod 5.1
18 pages
Unit 3 Aml
No ratings yet
Unit 3 Aml
9 pages
RR1720 User Manual PDF
No ratings yet
RR1720 User Manual PDF
71 pages
ML Unit 3-1
No ratings yet
ML Unit 3-1
14 pages
Time To Explore (5) ML
No ratings yet
Time To Explore (5) ML
9 pages
Article Review 9 Eng
No ratings yet
Article Review 9 Eng
21 pages
Voting or Averaging of Predictions of Multiple Pre-Trained Models
No ratings yet
Voting or Averaging of Predictions of Multiple Pre-Trained Models
23 pages
Bagging and Boosting
No ratings yet
Bagging and Boosting
4 pages
Unit-3 ML
No ratings yet
Unit-3 ML
18 pages
Smart Data Monitoring System For Power Loom Using IOT
No ratings yet
Smart Data Monitoring System For Power Loom Using IOT
6 pages
MR - Patel 482 BNS Mahadevpura
No ratings yet
MR - Patel 482 BNS Mahadevpura
18 pages
Research Paper - Juan de Julio Escura
No ratings yet
Research Paper - Juan de Julio Escura
11 pages
Lab
No ratings yet
Lab
86 pages
An Introduction of Ensemble Learning
100% (1)
An Introduction of Ensemble Learning
40 pages
Ensemble Learning in Machine Learning
No ratings yet
Ensemble Learning in Machine Learning
4 pages
John Axon: Email: Website Portfolio
No ratings yet
John Axon: Email: Website Portfolio
3 pages
220245-MSBTE-22619-PHP (Unit 5)
No ratings yet
220245-MSBTE-22619-PHP (Unit 5)
7 pages
Iso 6489-3
No ratings yet
Iso 6489-3
12 pages
Bagging+Boosting+Gradient Boosting
100% (1)
Bagging+Boosting+Gradient Boosting
48 pages
How To Manage BW Transports
No ratings yet
How To Manage BW Transports
14 pages
S1 - Human Computer Interaction
No ratings yet
S1 - Human Computer Interaction
2 pages
Control-M Installation Guide 6.1.03 PDF
No ratings yet
Control-M Installation Guide 6.1.03 PDF
418 pages
Ensemble Methods
100% (1)
Ensemble Methods
15 pages
Analizadores de Presion de Vapor Analizador RVP PDF
No ratings yet
Analizadores de Presion de Vapor Analizador RVP PDF
6 pages
Aiml Unit 4
No ratings yet
Aiml Unit 4
26 pages
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
Urb 100 Tuning Ver1
No ratings yet
Urb 100 Tuning Ver1
23 pages
Wi-Fi Interview Questions & Answers
No ratings yet
Wi-Fi Interview Questions & Answers
6 pages
Ensemble Learning: Proprietary Content. ©great Learning. All Rights Reserved. Unauthorized Use or Distribution Prohibited
No ratings yet
Ensemble Learning: Proprietary Content. ©great Learning. All Rights Reserved. Unauthorized Use or Distribution Prohibited
6 pages
Ch-4 Ensemble Learning
No ratings yet
Ch-4 Ensemble Learning
18 pages
Unit 4 Ensemble Techniques and Unsupervised Learning
100% (1)
Unit 4 Ensemble Techniques and Unsupervised Learning
25 pages
Unit 4
No ratings yet
Unit 4
24 pages
Drive Spares Old PDF
No ratings yet
Drive Spares Old PDF
3 pages
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
From Everand
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
Elaine Tate
No ratings yet
Maximum PC - 100 Websites To See Before You Die (Part 1)
No ratings yet
Maximum PC - 100 Websites To See Before You Die (Part 1)
12 pages
Stages of Development of HRIS
50% (2)
Stages of Development of HRIS
15 pages
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

AI25

Uploaded by

AI25

Uploaded by

ROHINI COLLEGE OF ENGINEERING AND TECHNOLOGY

4.2 ENSEMBLE LEARNING

 Why do ensemble methods work? Based on one of two basic observations

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

3. The tree is grown to the largest

1. Reduces over-fitting of the model.

2. Handles higher dimensionality data very well.

3. Maintains accuracy for missing data.

Boosting is a very different method to generate multiple predictions (function estimates)

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

4. Combine all the weak learners via majority voting.

1. Very simple to implement

2.Fairly good generalization

3. The prior error need not be known ahead of time.

2. Can over fit in presence of noise.

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

 Stacking, sometimes called stacked generalization, is an ensemble machine learning

Stacking is concerned with combining multiple classifiers generated by different learning

 The stacking process can be broken into two phases.

1. Generate a set of base-level classifiers C1...CN where Ci=Li(S)

Fig. shows stacking frame.

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

Difference between Bagging and Boosting

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.