0% found this document useful (0 votes)

211 views16 pages

Understanding Random Forest Algorithm

Random Forest is an ensemble learning method that constructs multiple decision trees and outputs the class that is the mode of the classes or mean prediction of individual trees. It works by constructing decision trees on various sub-samples of the dataset and averaging their predictions. Some key hyperparameters are the number of trees, maximum depth of each tree, and number of features considered at each split. Cross-validation and grid search are used to tune hyperparameters and evaluate performance.

Uploaded by

tanmayi nandiraju

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

211 views16 pages

Understanding Random Forest Algorithm

Uploaded by

tanmayi nandiraju

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

Random

Forest

1
Contents
• What is Random Forest?

• Ensemble Methods - Bagging

• How does Random Forest work?

• Hyper-Parameters in Random Forest

• Parameter Tuning - Cross-Validation & GridSearchCV

• Building RF in Scikit-learn

• Pros and Cons

2
What is Random Forest?
• Random Forest is a
supervised learning
algorithm and capable of
performing both regression
and classification tasks.

• As the name suggests,

Random Forest algorithm
creates a forest with a
number of decision trees.

3
Ensemble method
• Use multiple learning algorithms to obtain
better predictions.
• Train various different models, aggregate
their predictions to improvise stability and
predictive power.

• As we see, we need numbers of

models(learners), whose predictive power
is just slightly better than random chance.
Such learners are called as weak learners.
• We take such weak learners to make one
combined strong learner.

4
Bagging
• The idea behind bagging is combining the results of multiple models (for
instance, all decision trees) to get a generalized result.
• Bagging uses a sampling technique called – Bootstrapping.
• Bootstrapping is a sampling technique in which we create subsets of
observations from the original dataset, with replacement.
• Bagging (or Bootstrap Aggregating) technique uses these subsets (bags) to
get a fair idea of the distribution (complete set).
• The size of subsets created for bagging may be same or less than the original
set.

5
Bagging
• Multiple subsets are created from the original dataset, selecting
observations with replacement.

6
Bagging
• A base model (weak model) is created on
each of these subsets.
• The models run in parallel and are
independent of each other.
• The final predictions are determined by
combining the predictions from all the
models.

7
How does Random Forest work?
• RF consists multiple decision trees which act as base learners.

• Each decision tree is given a subset of random samples from the data set
(hence the name random).

• RF algorithm uses an Ensemble method – Bagging (Bootstrap Aggregating)

• Then, Random Forest train each base learner (i.e Decision Tree) on a different
sample of data and the sampling of data points happens with replacement.

8
Example
• Consider a training dataset : [X1, X2, X3, … X10, Y].

• Random forest will create decision trees taking the input from subset using
bagging as shown below:

9
Hyper-Parameters Random Forest
• Optimization of RF depends on few inbuilt parameters.

• n_estimators* - number of decision trees that the algorithm creates. As the

number tree increases, the performance increases and the predictions are
more stable but it slows down the computation.

• max_features* - maximum number of features that are considered for

splitting a node.

• n_jobs - number of jobs to run in parallel. If n_jobs=1, it uses one processor.

If n_jobs=-1, then the number of jobs is set to the number of cores available.

10
Parameters Random Forest
• max_depth is the maximum depth of the tree. The deeper the tree, the more
splits it has and it captures more information about the data.

• criterion is the function to measure the quality of a split. Supported criteria

are “gini” for the Gini impurity and “entropy” for the information gain.

11
Cross-Validation (CV)
• Cross-validation is a statistical method used to estimate the performance of
machine learning models.

• It is a resampling procedure used to evaluate machine learning models on a

limited data sample.

• The most common method is K-Fold CV.

• Normally, we split the data into – train & test data sets.

• In K-fold CV, training data is further split into K number of subsets, called
folds.

12
Cross-Validation (CV)
• Then iteratively fit the model K times, each time training the data on K-1 of
the folds and evaluating on the Kth fold (called the validation data).

• For example, consider the train data is splitted into 5 folds (K = 5).

• 1st Iteration - train on the first four folds and evaluate on the fifth.

• 2nd Iteration - train on the first, second, third, and fifth fold and evaluate on the fourth.

• And repeat the same procedure.

• At end of training, we average the performance on each of the folds to come

up with final validation metrics for the model.

13
Cross-Validation (CV)
• 5 Fold Cross Validation –

• For hyperparameter tuning, we perform many iterations of the entire K-Fold CV

process, each time using different model settings.

• If we have 10 sets of hyperparameters and are using 5-Fold CV, that represents 50
training loops.

14
GridSearchCV
• Grid-search is used to find the optimal hyperparameters of a model which
results in the most ‘accurate’ predictions.

• To implement the Grid Search algorithm we need to import GridSearchCV

class from the sklearn.model_selection library.

• The first step you need to perform is to create a dictionary of all the
parameters and their corresponding set of values that you want to test for
best performance.

15
Pros & Cons
Pros:
• Random Forest algorithm avoids overfitting.
• For both classification and regression task, the same random forest algorithm can be used.
• The Random Forest algorithm can be used for identifying the most important features from
the training dataset. It helps in feature engineering.

Cons:
• Random Forest is difficult to interpret. Because of averaging the results of many trees
becomes hard for us to figure out why a random forest is making predictions the way it is.
• Random Forest takes a longer time to create. It is computationally expensive compared to a
Decision Tree.

STAT 479: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
No ratings yet
STAT 479: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
16 pages
CNN MATLAB Lab Instructions
No ratings yet
CNN MATLAB Lab Instructions
7 pages
Random Forest Reference Code
100% (1)
Random Forest Reference Code
19 pages
3D Mouse Event Mapping with gluUnProject
No ratings yet
3D Mouse Event Mapping with gluUnProject
4 pages
SVM: Understanding the Optimal Hyperplane
100% (1)
SVM: Understanding the Optimal Hyperplane
37 pages
R2 Model Validation and Cross-Validation
No ratings yet
R2 Model Validation and Cross-Validation
46 pages
Decision Trees
No ratings yet
Decision Trees
32 pages
K Fold Cross Validation
No ratings yet
K Fold Cross Validation
17 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
24 pages
B Ridge - and - Lasso - Regression
No ratings yet
B Ridge - and - Lasso - Regression
5 pages
Gradient Descent: Disclaimer: This PPT Is Modified Based On Hung-Yi Lee
No ratings yet
Gradient Descent: Disclaimer: This PPT Is Modified Based On Hung-Yi Lee
38 pages
Dual Methods For The Minimization of The Total Variation
No ratings yet
Dual Methods For The Minimization of The Total Variation
30 pages
Decision Trees
No ratings yet
Decision Trees
25 pages
Model Generalization
No ratings yet
Model Generalization
117 pages
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
No ratings yet
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
6 pages
Introduction
No ratings yet
Introduction
6 pages
Data Structures: Concepts and Operations
No ratings yet
Data Structures: Concepts and Operations
213 pages
CSE 465 Exam: Decision Trees & SVMs
No ratings yet
CSE 465 Exam: Decision Trees & SVMs
2 pages
Association Rules for Data Analysts
No ratings yet
Association Rules for Data Analysts
16 pages
Unsupervised Learning 2024-PPG
No ratings yet
Unsupervised Learning 2024-PPG
85 pages
Understanding Random Forests in Machine Learning
100% (1)
Understanding Random Forests in Machine Learning
4 pages
C15-Momentum RMSProp Adam
No ratings yet
C15-Momentum RMSProp Adam
23 pages
Unsupervised Learning: K-Means & GMM
No ratings yet
Unsupervised Learning: K-Means & GMM
27 pages
ML Unit-4 Prob Learning
No ratings yet
ML Unit-4 Prob Learning
36 pages
Math4ml PDF
No ratings yet
Math4ml PDF
21 pages
Expectation Maximization
No ratings yet
Expectation Maximization
23 pages
LA
No ratings yet
LA
138 pages
ANN-unit 3
No ratings yet
ANN-unit 3
30 pages
Stats 1 Formulae
No ratings yet
Stats 1 Formulae
26 pages
Dimensionality Reduction Explained
No ratings yet
Dimensionality Reduction Explained
60 pages
3 Python
100% (2)
3 Python
464 pages
DBSCAN Algorithm for Data Scientists
No ratings yet
DBSCAN Algorithm for Data Scientists
10 pages
Deep Learning with RBMs and DBNs
No ratings yet
Deep Learning with RBMs and DBNs
79 pages
ML Unit-2
No ratings yet
ML Unit-2
26 pages
Model Building Through
No ratings yet
Model Building Through
21 pages
Guide To AUC ROC Curve in Machine Learning
No ratings yet
Guide To AUC ROC Curve in Machine Learning
10 pages
1.supervised and Unsupervised
No ratings yet
1.supervised and Unsupervised
42 pages
Supervised Regression in Machine Learning
No ratings yet
Supervised Regression in Machine Learning
32 pages
K-Means Clustering
No ratings yet
K-Means Clustering
6 pages
Data Warehouse Exam Questions
No ratings yet
Data Warehouse Exam Questions
14 pages
Search Algorithms in AI: Overview
No ratings yet
Search Algorithms in AI: Overview
105 pages
DAA VIT AP 27 Maximum Matching in Bipartite Graphs
No ratings yet
DAA VIT AP 27 Maximum Matching in Bipartite Graphs
6 pages
Machine Learning Strategies
No ratings yet
Machine Learning Strategies
59 pages
Naive Bayes Classifier Explained
No ratings yet
Naive Bayes Classifier Explained
11 pages
ML Unit-Iv
No ratings yet
ML Unit-Iv
18 pages
Soft Max
No ratings yet
Soft Max
6 pages
Single Layer Perceptrons Overview
No ratings yet
Single Layer Perceptrons Overview
25 pages
RMQ
No ratings yet
RMQ
74 pages
K-Means Clustering Insights
No ratings yet
K-Means Clustering Insights
8 pages
Optimization Techniques in Deep Learning
No ratings yet
Optimization Techniques in Deep Learning
14 pages
Cluster Validation Techniques Explained
No ratings yet
Cluster Validation Techniques Explained
47 pages
Ensemble Methods - Bagging, Boosting and Stacking - Towards Data Science PDF
No ratings yet
Ensemble Methods - Bagging, Boosting and Stacking - Towards Data Science PDF
37 pages
RBF Networks and KNN Overview
No ratings yet
RBF Networks and KNN Overview
9 pages
Handout - BITS-F464 - Machine - Learning - August 2019
No ratings yet
Handout - BITS-F464 - Machine - Learning - August 2019
4 pages
SVM Kernal
No ratings yet
SVM Kernal
5 pages
Understanding Random Forest in ML
No ratings yet
Understanding Random Forest in ML
3 pages
Random Forest
No ratings yet
Random Forest
25 pages
Random Forest
No ratings yet
Random Forest
29 pages
Random Forests 2
No ratings yet
Random Forests 2
43 pages
Lecture+Notes+-+Random Forests
No ratings yet
Lecture+Notes+-+Random Forests
10 pages
PCA for Data Scientists
No ratings yet
PCA for Data Scientists
11 pages
Dbms
No ratings yet
Dbms
95 pages
Pre Employment Medical Declaration Form PDF
80% (5)
Pre Employment Medical Declaration Form PDF
2 pages
PL/SQL Basics: Features and Syntax
No ratings yet
PL/SQL Basics: Features and Syntax
49 pages
Introduction to Algorithms Guide
No ratings yet
Introduction to Algorithms Guide
28 pages
Class Ix Mathematics Worksheet CH 3 "Coordinate Geometry"
80% (10)
Class Ix Mathematics Worksheet CH 3 "Coordinate Geometry"
3 pages
Comaprison B31.1 B31.3 Piping
No ratings yet
Comaprison B31.1 B31.3 Piping
1 page
The Concept of Romanticism II
100% (2)
The Concept of Romanticism II
27 pages
Articulation Speech
No ratings yet
Articulation Speech
305 pages
International Standard: Products in Fibre-Reinforced Cement - Sampling and Inspection
100% (1)
International Standard: Products in Fibre-Reinforced Cement - Sampling and Inspection
24 pages
Class 11 Python Exam Paper
No ratings yet
Class 11 Python Exam Paper
4 pages
Oily Water Separator Regulations
100% (2)
Oily Water Separator Regulations
31 pages
PMO Maturity Assessment Framework
No ratings yet
PMO Maturity Assessment Framework
2 pages
Shelf Life Estimation
No ratings yet
Shelf Life Estimation
35 pages
Practice Test 22 - Esc 20: A. LISTENING (50 Points)
No ratings yet
Practice Test 22 - Esc 20: A. LISTENING (50 Points)
16 pages
Region VI-Western Visayas Division of Negros Occidental Rizal ST., Brgy. III, Hinigaran, Negros Occidental
No ratings yet
Region VI-Western Visayas Division of Negros Occidental Rizal ST., Brgy. III, Hinigaran, Negros Occidental
1 page
Group Dynamics and Development Stages
No ratings yet
Group Dynamics and Development Stages
18 pages
Judo Coaches' Professional Education Insights
No ratings yet
Judo Coaches' Professional Education Insights
11 pages
Full Adder Design Lab Report
No ratings yet
Full Adder Design Lab Report
6 pages
Spelling Strategies
No ratings yet
Spelling Strategies
8 pages
2 Soap and Detergent Cleaansing Activity
No ratings yet
2 Soap and Detergent Cleaansing Activity
8 pages
BME Blasting Guide for Surface Mining
100% (2)
BME Blasting Guide for Surface Mining
32 pages
Understanding Humor's Psychology
100% (1)
Understanding Humor's Psychology
22 pages
Culebra Sewer Emergency Update
No ratings yet
Culebra Sewer Emergency Update
2 pages
9 Essential CBT Techniques and Tools Psychotherapy
100% (1)
9 Essential CBT Techniques and Tools Psychotherapy
4 pages
My Limitations
No ratings yet
My Limitations
83 pages
Mathematics Song
No ratings yet
Mathematics Song
4 pages
Measuring Personal-Group Relations
No ratings yet
Measuring Personal-Group Relations
7 pages
Recruitment Insights for HR Pros
No ratings yet
Recruitment Insights for HR Pros
31 pages
First Division: THE PEOPLE OF THE PHILIPPINES, Plaintiff-Appellee, vs. BRIAN MERCADO y SARMIENTO, Accused-Appellant
No ratings yet
First Division: THE PEOPLE OF THE PHILIPPINES, Plaintiff-Appellee, vs. BRIAN MERCADO y SARMIENTO, Accused-Appellant
10 pages
Civitas Academica of English Education and Stakeholders’ Un-derstanding of Vision, Missions, Goals, and Targets of English Education Department at IAIN Bukittinggi in Year 2018 by : Melyann Melani, Febria Sri Artika, Ayu Noviasari
No ratings yet
Civitas Academica of English Education and Stakeholders’ Un-derstanding of Vision, Missions, Goals, and Targets of English Education Department at IAIN Bukittinggi in Year 2018 by : Melyann Melani, Febria Sri Artika, Ayu Noviasari
7 pages
Stories of Jedi and Sith Lucasfilm Press PDF Version
100% (2)
Stories of Jedi and Sith Lucasfilm Press PDF Version
137 pages
Piling Suite Tutorial Manual 2015 PDF
No ratings yet
Piling Suite Tutorial Manual 2015 PDF
74 pages
C Langauge
No ratings yet
C Langauge
46 pages
Structural Engineer Profile
No ratings yet
Structural Engineer Profile
1 page

Understanding Random Forest Algorithm

Uploaded by

Understanding Random Forest Algorithm

Uploaded by

Random

• Ensemble Methods - Bagging

• How does Random Forest work?

• Hyper-Parameters in Random Forest

• Parameter Tuning - Cross-Validation & GridSearchCV

• Pros and Cons

• As the name suggests,

• As we see, we need numbers of

• RF algorithm uses an Ensemble method – Bagging (Bootstrap Aggregating)

• n_estimators* - number of decision trees that the algorithm creates. As the

• max_features* - maximum number of features that are considered for

• n_jobs - number of jobs to run in parallel. If n_jobs=1, it uses one processor.

• criterion is the function to measure the quality of a split. Supported criteria

• It is a resampling procedure used to evaluate machine learning models on a

• The most common method is K-Fold CV.

• And repeat the same procedure.

• At end of training, we average the performance on each of the folds to come

• For hyperparameter tuning, we perform many iterations of the entire K-Fold CV

• To implement the Grid Search algorithm we need to import GridSearchCV

You might also like