0% found this document useful (0 votes)

10 views47 pages

Unit 4 Part 1

The document discusses the need for ensemble learning, emphasizing that no single algorithm is universally the best due to the No Free Lunch Theorem. It outlines methods for generating diverse learners and various model combination schemes, including voting, bagging, boosting, and stacking, each with its own advantages and use cases. Additionally, it touches on clustering as an unsupervised learning technique, highlighting the k-means algorithm and distance measures.

Uploaded by

manickamrajendran1985

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views47 pages

Unit 4 Part 1

Uploaded by

manickamrajendran1985

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 47

Need of Ensemble Learning

● No Free Lunch Theorem states that there is no single learning algorithm

that in any domain always induces the most accurate learner.
● Each learning algorithm dictates a certain model that comes with a set of
assumptions. This inductive bias leads to error if the assumptions do not
hold for the data.
● Learning is an ill-posed problem and with ﬁnite data, each algorithm
converges to a different solution and fails under different circumstances.
● By suitably combining multiple base learners accuracy can be improved.
Generating Diverse Learners
● the aim is to be able to ﬁnd a set of diverse learners who differ in their
decisions so that they complement each other.
● At the same time, there cannot be a gain in overall success unless the
learners are accurate, at least in their domain of expertise.
● We therefore have this double task of maximizing individual accuracies and
the diversity between learners.

● HOW TO GENERATE DIVERSIFIED LEARNERS?

1. Different Algorithms
2. Different Hyperparameters
3. Different Input Representations - multi- view learning
4. Different Training Sets
1. Different Algorithms
a. We can use different learning algorithms to train different base-learners.
b. Different algorithms make different assumptions about the data and lead to different classiﬁers.
c. For example, one base-learner may be parametric and another may be nonparametric.
d. When we decide on a single algorithm, we give emphasis to a single method and ignore all others.
e. Combining multiple learners based on multiple algorithms, we free ourselves from taking a decision
and we no longer put all our eggs in one basket.
2. Different Hyperparameters
a. We can use the same learning algorithm but use it with different hyperparameters.
b. Examples are the number of hidden units in a multilayer perceptron, k in k-nearest neighbor, error
threshold in decision trees, the kernel function in support vector machines, and so forth.
3. Different Input Representations
a. Separate base-learners may be using different representations of the same input object or event,
making it possible to integrate different types of sensors/measurements/modalities.
b. Different representations make different characteristics explicit allowing better identiﬁcation.
c. In many applications, there are multiple sources of information, and it is desirable to use all of these
data to extract more information and achieve higher accuracy in prediction.
d. For example, in speech recognition, to recognize the uttered words, in addition to the acoustic
input, we can also use the video image of the speaker’s lips and shape of the mouth as the words are
spoken.
● Different Training Sets
○ Another possibility is to train different base-learners by different subsets of the
training set.
○ This can be done randomly by drawing random training sets from the given
sample; this is called bagging.
○ Or, the learners can be trained serially so that instances on which the preceding
base-learners are not accurate are given more emphasis in training later
base-learners; examples are boosting and cascading
Model Combination Schemes

1. MULTIEXPERT COMBINATION
a. Multiexpert combination methods have base-learners that work in parallel.
b. These methods can in turn be divided into two:
i. In the global approach, also called learner fusion, given an input, all base-learners
generate an output and all these outputs are used. Examples are voting and stacking.
ii. In the local approach, or learner selection, for example, in mixture of experts, there is
a gating model, which looks at the input and chooses one (or very few) of the learners
as responsible for generating the output.
2. MULTISTAGE COMBINATION
a. Multistage combination methods use a serial approach where the next combination
base-learner is trained with or tested on only the instances where the previous
base-learners are not accurate enough.
b. The idea is that the base-learners (or the different representations they use) are sorted in
increasing complexity so that a complex base-learner is not used (or its complex
representation is not extracted) unless the preceding simpler base-learners are not
confident. An example is cascading.
METHOD 1 VOTING
Decision Tree Random Forest Support Vector Machine
Decision Tree Random Forest Support Vector Machine
Max Voting
The max voting method is generally used for classification problems. In this technique,
multiple models are used to make predictions for each data point. The predictions by
each model are considered as a ‘vote’. The predictions which we get from the majority of
the models are used as the final prediction.

Averaging
Similar to the max voting technique, multiple predictions are made for each data point in
averaging. In this method, we take an average of predictions from all the models and use
it to make the ﬁnal prediction. Averaging can be used for making predictions in
regression problems or while calculating probabilities for classiﬁcation problems.

Weighted Average
This is an extension of the averaging method. All models are assigned different weights
deﬁning the importance of each model for prediction.
BAGGING
Steps in Bagging
● Bagging is also known as Bootstrap aggregating. It consists of two steps:
bootstrapping and aggregation.
● Bootstrapping
○ Involves resampling subsets of data with replacement from an initial
dataset. In other words, subsets of data are taken from the initial dataset.
○ These subsets of data are called bootstrapped datasets or, simply,
bootstraps.
○ Resampled ‘with replacement’ means an individual data point can be
sampled multiple times. Each bootstrap dataset is used to train a weak
learner.
● Aggregating
○ The individual weak learners are trained independently from each other.
Each learner makes independent predictions.
○ The results of those predictions are aggregated at the end to get the overall
prediction. The predictions are aggregated using either max voting or
averaging.
Detailed Steps

The steps of bagging are as follows:

● We have an initial training dataset containing n-number of instances.

● We create a m-number of subsets of data from the training set. We take a subset of N sample
points from the initial dataset for each subset. Each subset is taken with replacement. This
means that a speciﬁc data point can be sampled more than once.
● For each subset of data, we train the corresponding weak learners independently. These
models are homogeneous, meaning that they are of the same type.
● Each model makes a prediction.
● The predictions are aggregated into a single prediction. For this, either max voting or averaging
is used.
Boosting
Steps
Boosting works with the following steps:

● We sample m-number of subsets from an initial training dataset.

● Using the ﬁrst subset, we train the ﬁrst weak learner.

● We test the trained weak learner using the training data. As a result of the testing, some data points

will be incorrectly predicted.

● Each data point with the wrong prediction is sent into the second subset of data, and this subset is

updated.

● Using this updated subset, we train and test the second weak learner.

● We continue with the following subset until the total number of subsets is reached.

● We now have the total prediction. The overall prediction has already been aggregated at each step, so

there is no need to calculate it.

Stacking

TECHNICAL
ROUND 1

CEO
TECHNICAL
ROUND 2

APTITUDE
ROUND
Steps

● We use initial training data to train m-number of algorithms.

● Using the output of each algorithm, we create a new training set.
● Using the new training set, we create a meta-model algorithm.
● Using the results of the meta-model, we make the final prediction. The results are combined
using weighted averaging.
Key Points
● If you want to reduce the overfitting or variance of your model, you use bagging. If
you are looking to reduce underfitting or bias, you use boosting. If you want to
increase predictive accuracy, use stacking.
● Bagging and boosting both works with homogeneous weak learners. Stacking
works using heterogeneous solid learners.
● All three of these methods can work with either classification or regression
problems.
● One disadvantage of boosting is that it is prone to variance or overfitting. It is
thus not advisable to use boosting for reducing variance. Boosting will do a worse
job in reducing variance as compared to bagging.
● On the other hand, the converse is true. It is not advisable to use bagging to
reduce bias or underfitting. This is because bagging is more prone to bias and does
not help reduce bias.
● Stacked models have the advantage of better prediction accuracy than bagging or boosting. But
because they combine bagged or boosted models, they have the disadvantage of needing much
more time and computational power.
CLUSTERING
ALGORITHM
Key Points
1. Clustering is the task of dividing the unlabeled data or data points into different clusters
such that similar data points fall in the same cluster than those which differ from the others.
2. In simple words, the aim of the clustering process is to segregate groups with similar traits
and assign them into clusters.
3. Clustering is very much important as it determines the intrinsic grouping among the unlabelled
data present.
4. There are no criteria for good clustering. It depends on the user, what is the criteria they may
use which satisfy their need.
5. It is a unsupervised learning technique.
k-Means Algorithm

Step 1:
Step 2 : Compute Distance of Every point from
centroids
Step 2 Continued…
C1 (2,10) C2(5,8) C3(1,2) Assigned
Cluster

A1(2,10)

A2(2,5)

A3(8,4)

A4(5,8)

A5(7,5)

A6(6,4)

A7(1,2)

A8(4,9)
Step 3 Update Centroid
Step 4: Repeat Step 2 with new centroids
Steps
DISTANCE BETWEEN TWO POINTS

1. Euclidean
2. Manhattan
3. Cosine
4. Jaccard
Euclidean Distance
Manhattan Distance

Ensemble,Voting,Bagging,Boosting
No ratings yet
Ensemble,Voting,Bagging,Boosting
15 pages
Regression Analysis (1722021)
No ratings yet
Regression Analysis (1722021)
279 pages
Ensemble Learning-Bagging-Boosting-Stacking
No ratings yet
Ensemble Learning-Bagging-Boosting-Stacking
12 pages
ML-Lecture-15-Ensemble
No ratings yet
ML-Lecture-15-Ensemble
27 pages
Ensemble Learning
100% (1)
Ensemble Learning
7 pages
Ensemble Learning
No ratings yet
Ensemble Learning
26 pages
Unit IV Aiml
No ratings yet
Unit IV Aiml
32 pages
ML-MID-2-IMPORTANT TOPICS
No ratings yet
ML-MID-2-IMPORTANT TOPICS
19 pages
UMl - unit 3
No ratings yet
UMl - unit 3
50 pages
Evaluating Machine Learning Algorithms and Model Selection
No ratings yet
Evaluating Machine Learning Algorithms and Model Selection
10 pages
AIML unit 4
No ratings yet
AIML unit 4
26 pages
UNIT3_class
No ratings yet
UNIT3_class
30 pages
ML Unit 3 V2
No ratings yet
ML Unit 3 V2
47 pages
Bagging and Boosting
No ratings yet
Bagging and Boosting
8 pages
Ai ML Unit 4 Notes
No ratings yet
Ai ML Unit 4 Notes
42 pages
Unit V -Multiple Learners
No ratings yet
Unit V -Multiple Learners
54 pages
12 Ensemble Model
No ratings yet
12 Ensemble Model
90 pages
Module 5,1 Ensemble_Bagging, RF,Boosting
No ratings yet
Module 5,1 Ensemble_Bagging, RF,Boosting
66 pages
Unit 4
No ratings yet
Unit 4
17 pages
unit 4
No ratings yet
unit 4
45 pages
Ensemble Learning
No ratings yet
Ensemble Learning
30 pages
ML Uint 4-2
No ratings yet
ML Uint 4-2
20 pages
learning algorithms
No ratings yet
learning algorithms
24 pages
Ensemble Learning: Proprietary Content. ©great Learning. All Rights Reserved. Unauthorized Use or Distribution Prohibited
No ratings yet
Ensemble Learning: Proprietary Content. ©great Learning. All Rights Reserved. Unauthorized Use or Distribution Prohibited
6 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
24 pages
UNIT-IV
No ratings yet
UNIT-IV
22 pages
M4 - FDS
No ratings yet
M4 - FDS
15 pages
ensemble
No ratings yet
ensemble
33 pages
Ensemble Learning in Machine Learning
No ratings yet
Ensemble Learning in Machine Learning
15 pages
Unit-4-AIML
No ratings yet
Unit-4-AIML
29 pages
Ensemble Final
No ratings yet
Ensemble Final
41 pages
Ensemble Learning
No ratings yet
Ensemble Learning
46 pages
UNIT IV
No ratings yet
UNIT IV
18 pages
Ensemble Methods
No ratings yet
Ensemble Methods
31 pages
Ensemble Learning
No ratings yet
Ensemble Learning
52 pages
Module 7 - Ensemble Learning
No ratings yet
Module 7 - Ensemble Learning
41 pages
Bagging & Boosting
No ratings yet
Bagging & Boosting
10 pages
UNIT III Word File
No ratings yet
UNIT III Word File
13 pages
Unit 4 Ensemble Techniques and Unsupervised Learning
100% (1)
Unit 4 Ensemble Techniques and Unsupervised Learning
25 pages
Unit-3 ML
No ratings yet
Unit-3 ML
18 pages
unit 5 ML
No ratings yet
unit 5 ML
14 pages
An Introduction of Ensemble Learning
100% (1)
An Introduction of Ensemble Learning
40 pages
UNIT 3 AML
No ratings yet
UNIT 3 AML
9 pages
Article Review 9 Eng
No ratings yet
Article Review 9 Eng
21 pages
Classification Through Ensembling Techniques
No ratings yet
Classification Through Ensembling Techniques
10 pages
Bagging vs Boosting in Machine Learning
No ratings yet
Bagging vs Boosting in Machine Learning
5 pages
Bagging+Boosting+Gradient Boosting
100% (1)
Bagging+Boosting+Gradient Boosting
48 pages
7 - Ensemble Techniques-Converted Updated
No ratings yet
7 - Ensemble Techniques-Converted Updated
8 pages
2.4-Ensemble_methods_lecture_notes (1)
No ratings yet
2.4-Ensemble_methods_lecture_notes (1)
14 pages
AI & ML Unit 4 Notes
No ratings yet
AI & ML Unit 4 Notes
16 pages
Group9 ABA Ensemble Model
No ratings yet
Group9 ABA Ensemble Model
5 pages
Ensemble Learning Methods
100% (1)
Ensemble Learning Methods
24 pages
Ch-4 Ensemble Learning
No ratings yet
Ch-4 Ensemble Learning
18 pages
Unit 4
No ratings yet
Unit 4
24 pages
Assignment 1
No ratings yet
Assignment 1
4 pages
AI Project Buissness Document Files
No ratings yet
AI Project Buissness Document Files
21 pages
Unit Iv
No ratings yet
Unit Iv
14 pages
Time To Explore (5) ML
No ratings yet
Time To Explore (5) ML
9 pages
AI 10TH UNIT 3
No ratings yet
AI 10TH UNIT 3
42 pages
Quiz 1 Materials
No ratings yet
Quiz 1 Materials
159 pages
50 AI Engineer Interview Questions & Answers [2025] - DigitalDefynd
No ratings yet
50 AI Engineer Interview Questions & Answers [2025] - DigitalDefynd
27 pages
Predicting and Interpreting Student Performance Using Ensemble Models and Shapley Additive Explanations
No ratings yet
Predicting and Interpreting Student Performance Using Ensemble Models and Shapley Additive Explanations
16 pages
Week 11 EnsembleLearning
No ratings yet
Week 11 EnsembleLearning
34 pages
Intelligent Information Processing With Matlab - Xiu Zhang
No ratings yet
Intelligent Information Processing With Matlab - Xiu Zhang
347 pages
Unit-1
No ratings yet
Unit-1
24 pages
2505.02828v1
No ratings yet
2505.02828v1
41 pages
ECE/CS 559 - Neural Networks Lecture Notes #7: The Backpropagation Algorithm
No ratings yet
ECE/CS 559 - Neural Networks Lecture Notes #7: The Backpropagation Algorithm
9 pages
Describe in Brief Different Types of Regression Algorithms
No ratings yet
Describe in Brief Different Types of Regression Algorithms
25 pages
Coursera Machine Learning Specialization
No ratings yet
Coursera Machine Learning Specialization
46 pages
Kinetic Monte Carlos Simulation of SiO2 Highlighted PDF
No ratings yet
Kinetic Monte Carlos Simulation of SiO2 Highlighted PDF
44 pages
Uber Data Analysis
100% (4)
Uber Data Analysis
37 pages
Supervised Learning With Scikit-Learn
No ratings yet
Supervised Learning With Scikit-Learn
178 pages
Project Report Chetan Sharma
No ratings yet
Project Report Chetan Sharma
114 pages
22BM6JP06 Amrita Mandal Midsem Report
No ratings yet
22BM6JP06 Amrita Mandal Midsem Report
28 pages
(eBook PDF) Forecasting and Predictive Analytics with Forecast X ? 7th Editioninstant download
100% (4)
(eBook PDF) Forecasting and Predictive Analytics with Forecast X ? 7th Editioninstant download
41 pages
Learning Data Mining With Python - Sample Chapter
100% (4)
Learning Data Mining With Python - Sample Chapter
29 pages
Top 170 Machine Learning Interview Questions and Answers (2024) - Reader View
No ratings yet
Top 170 Machine Learning Interview Questions and Answers (2024) - Reader View
51 pages
Cattle Weight Estimation Using Linear Regression A
No ratings yet
Cattle Weight Estimation Using Linear Regression A
8 pages
1809 05620
No ratings yet
1809 05620
11 pages
2023-Scoring Predictors Stunting Based On The Epidemiological Triad
No ratings yet
2023-Scoring Predictors Stunting Based On The Epidemiological Triad
10 pages
(IJETA-V11I3P33) :pankaj Jain, Amit Kumar, Vansh Arora, Harsh Panwar, Harshvardhan Adiwal
No ratings yet
(IJETA-V11I3P33) :pankaj Jain, Amit Kumar, Vansh Arora, Harsh Panwar, Harshvardhan Adiwal
5 pages
Target-Oriented Time-Lapse Waveform Inversion Using Deep
No ratings yet
Target-Oriented Time-Lapse Waveform Inversion Using Deep
11 pages
Assignment 1 Solution
No ratings yet
Assignment 1 Solution
6 pages
Foreign Exchange Forecasting Via Machine Learning: Christian Gonz Alez Rojas Molly Herman
No ratings yet
Foreign Exchange Forecasting Via Machine Learning: Christian Gonz Alez Rojas Molly Herman
6 pages
AI ( X ) PRACTICE PAPER 1
No ratings yet
AI ( X ) PRACTICE PAPER 1
5 pages
Solution Design Document
No ratings yet
Solution Design Document
2 pages
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
From Everand
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
Elaine Tate
No ratings yet
Teaching Primary Programming with Scratch Pupil Book Year 5
From Everand
Teaching Primary Programming with Scratch Pupil Book Year 5
Phil Bagge
No ratings yet
Algebra - Drill Sheets Gr. 3-5
From Everand
Algebra - Drill Sheets Gr. 3-5
Nat Reed
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Unit 4 Part 1

Uploaded by

Unit 4 Part 1

Uploaded by

Need of Ensemble Learning

● No Free Lunch Theorem states that there is no single learning algorithm

● HOW TO GENERATE DIVERSIFIED LEARNERS?

The steps of bagging are as follows:

● We have an initial training dataset containing n-number of instances.

● We sample m-number of subsets from an initial training dataset.

● Using the ﬁrst subset, we train the ﬁrst weak learner.

will be incorrectly predicted.

there is no need to calculate it.

● We use initial training data to train m-number of algorithms.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.