0% found this document useful (0 votes)

53 views22 pages

Random Forest

Random forest is a machine learning algorithm that uses ensemble learning. It builds multiple decision trees during training and outputs the class that is the mode of the classes or mean prediction of the individual trees. It works by bagging, or sampling with replacement from the training data to build each tree. This helps reduce variance and prevent overfitting. Random forest can handle both classification and regression problems and performs well even with missing data or continuous/categorical variables. It is widely used in areas like banking, e-commerce, and medicine due to its accuracy and ability to handle large datasets.

Uploaded by

Kizifi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views22 pages

Random Forest

Uploaded by

Kizifi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 22

Random Forest

What is Random forest ??

• Random forest is a Supervised Machine Learning Algorithm that
is used widely in Classification and Regression problems.
• It builds decision trees on different samples and takes their
majority vote for classification and average in case of regression.
• One of the most important features of the Random Forest Algorithm
is that it can handle the data set containing continuous variables as in
the case of regression and categorical variables as in the case of
classification.
• It performs better results for classification problems.
Working of Random Forest Algorithm
• Before understanding the working of the random forest we must look into the ensemble
technique.
• Ensemble simply means combining multiple models. Thus a collection of models is used to make
predictions rather than an individual model.

• Ensemble uses two types of methods:

• 1. Bagging– It creates a different training subset from sample training data with replacement &
the final output is based on majority voting.
For Example, Random Forest.
• 2. Boosting– It combines weak learners into strong learners by creating sequential models such
that the final model has the highest accuracy.
For example, ADA BOOST, XG BOOST
Random forest works on the Bagging principle.
Bagging
• Bagging, also known as Bootstrap Aggregation is the ensemble technique used by
random forest.
• Bagging chooses a random sample from the data set.
• Hence each model is generated from the samples (Bootstrap Samples) provided by the
Original Data with replacement known as row sampling.
• This step of row sampling with replacement is called bootstrap.
• Now each model is trained independently which generates results.
• The final output is based on majority voting after combining the results of all models.
• This step which involves combining all the results and generating output based on
majority voting is known as aggregation.
Example :
• Here the bootstrap sample is taken from actual data (Bootstrap
sample 01, Bootstrap sample 02, and Bootstrap sample 03) with a
replacement which means there is a high possibility that each sample
won’t contain unique data.
• Now the model (Model 01, Model 02, and Model 03) obtained from
this bootstrap sample is trained independently. Each model generates
results as shown.
• Now Happy emoji is having a majority when compared to sad emoji.
Thus based on majority voting final output is obtained as Happy
emoji.
• Example 2 : consider the fruit basket as the data as shown in the figure
below.
• Now n number of samples are taken from the fruit basket and an
individual decision tree is constructed for each sample.
• Each decision tree will generate an output as shown in the figure.
• The final output is considered based on majority voting. In the below
figure you can see that the majority decision tree gives output as an apple
when compared to a banana, so the final output is taken as an apple.
This algorithm is widely used in E-commerce, banking, medicine, the stock market, etc.

For example: In the Banking industry it can be used to find which customer will default on the loan.
Advantages and Disadvantages of Random Forest
Algorithm
• Advantages
• 1. It can be used in classification and regression problems.
• 2. It solves the problem of overfitting as output is based on majority voting or averaging.
• 3. It performs well even if the data contains null/missing values.
• 4. Each decision tree created is independent of the other thus it shows the property of
parallelization.
• 5. It is highly stable as the average answers given by a large number of trees are taken.
• 6. It maintains diversity as all the attributes are not considered while making each
decision tree though it is not true in all cases.
• 7. It is immune to the curse of dimensionality. Since each tree does not consider all the
attributes, feature space is reduced.
• 8. We don’t have to segregate data into train and test as there will always be 30% of the
data which is not seen by the decision tree made out of bootstrap.
• Disadvantages

• 1. Random forest is highly complex when compared to decision trees

where decisions can be made by following the path of the tree.
• 2. Training time is more compared to other models due to its
complexity. Whenever it has to make a prediction each decision tree
has to generate output for the given input data.
Coding in python – Random Forest

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
4/5 (6458)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
4/5 (648)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
4/5 (1175)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
4.5/5 (1005)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
4.5/5 (1856)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
4/5 (650)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
4/5 (1267)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
4.5/5 (4103)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
4/5 (903)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
4.5/5 (629)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
4/5 (1022)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
4.5/5 (361)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
4.5/5 (1139)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
4.5/5 (582)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
4.5/5 (298)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
4.5/5 (5181)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
4/5 (100)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
4.5/5 (943)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Toibín
3.5/5 (2141)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
3.5/5 (464)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
3.5/5 (2289)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
4.5/5 (280)
Yes Please
From Everand
Yes Please
Amy Poehler
4/5 (2016)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
4/5 (1090)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
4/5 (4372)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
4.5/5 (2033)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
3.5/5 (2814)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
4/5 (2885)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
3.5/5 (233)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
4.5/5 (141)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
4.5/5 (244)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
4/5 (4135)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
4/5 (78)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
3.5/5 (144)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
3.5/5 (919)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
3.5/5 (836)
John Adams
From Everand
John Adams
David McCullough
4.5/5 (2546)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
4/5 (45)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
4.5/5 (815)
Little Women
From Everand
Little Women
Louisa May Alcott
4.5/5 (2369)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
4/5 (278)
Long Short-Term Memory Recurrent Neural Network Architectures For Large Scale Acoustic Modeling
No ratings yet
Long Short-Term Memory Recurrent Neural Network Architectures For Large Scale Acoustic Modeling
5 pages
KNN Solved Example
100% (1)
KNN Solved Example
6 pages
Unsupervised Domain Adaptation by Backpropagation
No ratings yet
Unsupervised Domain Adaptation by Backpropagation
11 pages
Dat Science: CLASS 11: Clustering and Dimensionality Reduction
No ratings yet
Dat Science: CLASS 11: Clustering and Dimensionality Reduction
30 pages
Ppt-Ii NNFL
No ratings yet
Ppt-Ii NNFL
43 pages
NLP and ML Project
100% (1)
NLP and ML Project
37 pages
Transformer
No ratings yet
Transformer
33 pages
Deep Learning
100% (1)
Deep Learning
3 pages
AI Crash Course For Beginners
No ratings yet
AI Crash Course For Beginners
60 pages
Lecture 1
No ratings yet
Lecture 1
21 pages
Introduction To Recurrent Neural Network
No ratings yet
Introduction To Recurrent Neural Network
9 pages
Chapter 6
No ratings yet
Chapter 6
172 pages
Al 3451-Machine Learning 2025
No ratings yet
Al 3451-Machine Learning 2025
3 pages
Lecture 01 Overview
No ratings yet
Lecture 01 Overview
39 pages
Unit 2 - Machine Learning - WWW - Rgpvnotes.in
100% (2)
Unit 2 - Machine Learning - WWW - Rgpvnotes.in
21 pages
Data Augmentation Techniques I
No ratings yet
Data Augmentation Techniques I
23 pages
Data Science Q&A - Latest Ed (2020) - 1 - 2
No ratings yet
Data Science Q&A - Latest Ed (2020) - 1 - 2
2 pages
Density-Based Methods: DBSCAN: Density-Based Clustering Based On Connected Regions With High Density
No ratings yet
Density-Based Methods: DBSCAN: Density-Based Clustering Based On Connected Regions With High Density
3 pages
ML Lecture#4
No ratings yet
ML Lecture#4
109 pages
Character Level Text Classification Via Convolutional Neural Network and Gated Recurrent Unit
No ratings yet
Character Level Text Classification Via Convolutional Neural Network and Gated Recurrent Unit
11 pages
CH-6 DM Clustering
No ratings yet
CH-6 DM Clustering
28 pages
101905CS502H - Neural Networks and Deep Learning - Model Question Paper
100% (1)
101905CS502H - Neural Networks and Deep Learning - Model Question Paper
4 pages
ML Question BanK
No ratings yet
ML Question BanK
5 pages
ANN Lab Manual
100% (3)
ANN Lab Manual
35 pages
Deep Learning Using Linear Support Vector Machines
No ratings yet
Deep Learning Using Linear Support Vector Machines
6 pages
S. Learning - Clase 4
No ratings yet
S. Learning - Clase 4
29 pages
Clustering: Unsupervised Learning
No ratings yet
Clustering: Unsupervised Learning
44 pages
Chapter 3 - Neural Network
No ratings yet
Chapter 3 - Neural Network
47 pages
Cluster-Analysis
No ratings yet
Cluster-Analysis
89 pages
Machine Learning
No ratings yet
Machine Learning
2 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Random Forest

Uploaded by

Random Forest

Uploaded by

Random Forest

What is Random forest ??

• Ensemble uses two types of methods:

• 1. Random forest is highly complex when compared to decision trees

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.