ML Lecture 15 Ensemble

This document is a lecture on Ensemble Learning Methods, detailing various techniques such as Bagging, Boosting, and Stacking, along with specific algorithms like Random Forest, AdaBoost, and XGBoost. It explains the process of creating multiple classifiers and combining their predictions to improve model performance. Additionally, it covers the advantages and disadvantages of these methods, particularly focusing on Random Forests and their comparison to decision trees.

Uploaded by

Shohanur Rahman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

94 views27 pages

ML Lecture 15 Ensemble

Uploaded by

Shohanur Rahman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

Machine Learning

Lecture 15: Ensemble Learning Methods

COURSE CODE: CSE451
2023
Course Teacher
Dr. Mrinal Kanti Baowaly
Associate Professor
Department of Computer Science and
Engineering, Bangabandhu Sheikh
Mujibur Rahman Science and
Technology University, Bangladesh.

Email: [email protected]
Ensemble Learning
 A powerful way to improve the performance of your model
 Construct a set of classifiers from training data
 Predict class label of test data by combining the predictions made
by multiple classifiers or models
 Examples: Random Forest, AdaBoost, Stochastic Gradient Boosting,
Gradient Boosting Machine(GBM), XGBoost, LightGBM, CatBoost
General Approach
Original
D Training data

Step 1:
Create Multiple D1 D2 .... Dt-1 Dt
Data Sets

Step 2:
Build Multiple C1 C2 Ct -1 Ct
Classifiers

Step 3:
Combine C*
Classifiers
Simple Ensemble Techniques
 Max Voting
 Averaging
 Weighted Averaging
Max Voting
 Multiple models are used to make predictions for each data point
 The predictions by each model are considered as a ‘vote’
 The predictions which we get from the majority of the models are
used as the final prediction
 Generally used for classification problems
 For example, when you asked 5 of your colleagues to rate your movie (out of
5); we’ll assume three of them rated it as 4 while two of them gave it a 5. Since
the majority gave a rating of 4, you can take the final rating of the movie as 4.
You can consider this as taking the mode of all the predictions.
Averaging
 Similar to the max voting technique, multiple predictions are made
for each data
 Take an average of predictions from all the models and use it to
make the final prediction.
 Averaging can be used in regression or classification problems.
 For example, in the previous case study of max voting, the averaging method
would take the average of all the values, i.e. (5+4+5+4+4)/5 = 4.4.
Hence, final rating of the movie is 4.4.
Weighted Averaging
 This is an extension of the averaging method.
 All models are assigned different weights defining the importance
of each model for prediction.
 For example, if two of your colleagues are critics, while others have no prior
experience in this field, then the answers by these two friends are given more
importance as compared to the other people.
The result can be calculated as [(5*0.23) + (4*0.23) + (5*0.18) + (4*0.18) +
(4*0.18)] = 4.41.
Hence, final rating of the movie is 4.41.

Implementation: AnalyticsVidhya, GeeksForGeeks

Advanced Ensemble Techniques
 Bagging: The idea behind bagging is combining the results of
multiple models run in parallel (for instance, all decision trees) to
get a generalized result.
 Boosting: Boosting is a sequential process, where each subsequent
model attempts to correct the errors of the previous model.
 Stacking: Stacking is an ensemble learning technique that uses
multiple models’ (called base models) predictions as features to
build a new model (called meta-model).
Bagging
 Multiple subsets are created from the
original dataset, selecting observations
with replacement (called bootstrapping).
 A base model (weak model) is created on
each of these subsets.
 The models run in parallel and are
independent of each other.
 The final predictions are determined by
combining the predictions from all the
models
Boosting
1. A base (weak) learner takes all the distributions
and assign equal weight or attention to each
observation.
2. If there is any prediction error caused by the base
learning algorithm, then we pay higher weight or
attention to observations having prediction error.
3. Apply the next base learning algorithm.
4. Repeat step 2 to 3 until the algorithm can correctly
classify the output or maximum number of
iterations is reached.
5. The weak learners are combined to form a strong
learner that will predict a more accurate outcome.
An Example of Boosting (AdaBoost)
 B1 consist of 10 data points which consist of two types namely plus(+) and minus(-
) and 5 of which are plus(+) and other 5 are minus(-) and each one has been
assigned equal weight initially. The first model tries to classify the data points and
generates a vertical separator line but it wrongly classifies 3 plus(+) as minus(-).
 B2 consists of the 10 data points from the previous model in which the 3 wrongly
classified plus(+) are weighted more so that the current model tries more to
classify these pluses(+) correctly. This model generates a vertical separator line
which correctly classifies the previously wrongly classified pluses(+) but in this
attempt, it wrongly classifies three minuses(-).
 B3 consists of the 10 data points from the previous model in which the 3 wrongly
classified minus(-) are weighted more so that the current model tries more to
classify these minuses(-) correctly. This model generates a horizontal separator
line which correctly classifies the previously wrongly classified minuses(-).
 B4 combines together B1, B2 and B3 in order to build a strong prediction model
which is much better than any individual model used.
Another Example: Dataaspirant, Detail Implementation: AnalyticsVidhya
HW: Difference between Bagging and
Boosting

Ref: QuantDare
Stacking Ensemble Learning

Level 0

Level 1

Source and Implementation:

GeeksForGeeks, AnalyticsVidhya
Random Forests Classifier
 The random forests algorithm
 How does the algorithm work?
 Its advantages and disadvantages
 Comparison between random forests and decision trees
 Finding important features
 Building a classifier with scikit-learn
Random Forests Algorithm
 It is a popular supervised learning algorithm.
 Random forest builds multiple decision trees (called forest) on
various random samples (or subsets) from a given dataset takes the
prediction from each tree and predicts the final output based on
the majority votes of the predictions.
 It is based on ‘bagging’ ensemble method that yields a more
accurate and stable prediction.
 It can be used both for classification and regression.
How does the algorithm work?
 Select random samples from a given
dataset (using bootstrapping).
 Construct a decision tree for each
sample and get a prediction result
from each decision tree.
 Final prediction is made by selecting
the prediction with the most votes
(for classification) or averaging the
predictions (for regression).
Advantages of Random Forests
 Random forests is considered as a highly accurate and robust
method because of the number of decision trees participating in
the process.
 It likely does not suffer from the overfitting problem because it
creates multiple trees on random subsets, takes the average or
most votes of the predictions of the trees, which cancel out the
biases. The randomness and voting or averaging mechanisms in
random forests elegantly solve the overfitting problem.
 It can handle missing data.
 It can be used in both classification and regression problems.
Disadvantages of Random Forests
 Random forests is slow because it builds multiple decision trees
and makes the final prediction by combining the predictions of
each individual tree.
 The model is difficult to interpret compared to a decision tree,
where you can easily make a decision by following the path in the
tree
Random Forest vs Decision Tree
 Random forest is a set of multiple decision trees whereas decision
tree is a single tree.
 Deep decision tree may suffer from overfitting, but random forest
prevents overfitting by creating multiple trees on random subsets.
 Decision tree is computationally faster, but random forest is slower.
 Random forests is difficult to interpret, while a decision tree is
easily interpretable and can be converted to rules.
Finding Important Features
 Random forests offers a good feature selection indicator.
 Scikit-learn provides an extra variable(feature_importances_) with the
model, which shows the relative importance or contribution of each feature
in the prediction.
 It automatically computes the relevance score of each feature in the
training phase. Then it scales the relevance down so that the sum of all
scores is 1. The higher the score, the more important the feature.
 This score will help you choose the most important features and drop the
least important ones for model building.
 Random forest uses gini importance (or impurity-based feature importance)
to calculate the importance of each feature.
More on Random Forest (LAB)
 Build a Random Forest classifier with scikit-learn
 Find important features of a Random Forest classifier with scikit-
learn
 Build both Decision Tree and Random Forest classifiers and
compare their performances
 Why does Random Forest model outperform the Decision Tree?

Source: DataCamp, AnalyticsVidhya

Advanced Boosting Methods
 What is GBM?
 What is XGBoost?
 What is LightGBM?
 Advantages of using Light GBM and XGBoost
 Build classifiers using GBM, LightGBM and XGBoost
 Compare GBM, LightGBM and XGBoost
 Which algorithm takes the crown: LightGBM or XGBoost?
Source: AnalyticsVidhya [1], [2]
Advanced Boosting Methods(Cont..)
 What is CatBoost?
 Advantages of CatBoost library
 CatBoost in comparison to other boosting algorithms
 Installing CatBoost
 Solving ML challenge using CatBoost

Source: AnalyticsVidhya, Dataaspirant

Comparison of CatBoost to other
boosting algorithms
A Comprehensive Course on Ensemble
Learning

Enroll now
Study Materials of Ensemble Methods
 AnalyticsVidhya: A Comprehensive Guide to Ensemble Learning
(with Python codes)
 GeeksForGeeks: Ensemble Method in Python
 AnalyticsVidhya: Basics of Ensemble Learning Explained in Simple
English
 Dataaspirant: How the Kaggle winners algorithm XGBoost algorithm
works

Ensemble Methods in Data Analytics
No ratings yet
Ensemble Methods in Data Analytics
23 pages
Gradient Descent
No ratings yet
Gradient Descent
18 pages
Machine Learning Most Important Question For Mid Term Ipu University
No ratings yet
Machine Learning Most Important Question For Mid Term Ipu University
36 pages
Lecture 9 PDF
100% (1)
Lecture 9 PDF
28 pages
Evaluating Machine Learning Algorithms
100% (2)
Evaluating Machine Learning Algorithms
42 pages
Logistic Regression for Coupon Usage
100% (1)
Logistic Regression for Coupon Usage
56 pages
Finance-Focused Big Data Techniques
100% (1)
Finance-Focused Big Data Techniques
23 pages
Linear Regression with Python OLS
No ratings yet
Linear Regression with Python OLS
23 pages
Model Generalization
No ratings yet
Model Generalization
117 pages
Random Forest
100% (1)
Random Forest
18 pages
Correlation & Regression Analysis
100% (1)
Correlation & Regression Analysis
39 pages
Outlines: Statements of Problems Objectives Bagging Random Forest Boosting Adaboost
100% (1)
Outlines: Statements of Problems Objectives Bagging Random Forest Boosting Adaboost
14 pages
Bagging and Boosting
100% (1)
Bagging and Boosting
19 pages
Machine Learning in Mechanical Engineering
No ratings yet
Machine Learning in Mechanical Engineering
20 pages
Boosting Algorithms in Machine Learning
100% (1)
Boosting Algorithms in Machine Learning
41 pages
Module 1 Notes
100% (1)
Module 1 Notes
73 pages
Xgboost in Online Transaction Fraud Detection
100% (1)
Xgboost in Online Transaction Fraud Detection
8 pages
Bootstrap Powerpoint
100% (1)
Bootstrap Powerpoint
20 pages
Quiz Feedback1 - Coursera
100% (1)
Quiz Feedback1 - Coursera
7 pages
Decision Trees
No ratings yet
Decision Trees
25 pages
Classification and Regression Trees
100% (1)
Classification and Regression Trees
60 pages
Introduction to Random Forests
No ratings yet
Introduction to Random Forests
30 pages
Understanding Random Forests in Machine Learning
100% (1)
Understanding Random Forests in Machine Learning
4 pages
Least Squares Problems: How To State and Solve Them, Then Evaluate Their Solutions
100% (1)
Least Squares Problems: How To State and Solve Them, Then Evaluate Their Solutions
63 pages
Supervised Learning: Logistic Regression
100% (1)
Supervised Learning: Logistic Regression
35 pages
Logistic Regression Example
100% (1)
Logistic Regression Example
22 pages
Intro to Machine Learning Basics
100% (1)
Intro to Machine Learning Basics
52 pages
Logistic Regression for Red Wine Quality
100% (1)
Logistic Regression for Red Wine Quality
10 pages
K Fold Cross Validation
No ratings yet
K Fold Cross Validation
17 pages
Applied Data Science Camp - Info
100% (1)
Applied Data Science Camp - Info
12 pages
Logistic Regression Overview by Gunjan Bharadwaj
100% (1)
Logistic Regression Overview by Gunjan Bharadwaj
42 pages
Pa - Unit - Iv
No ratings yet
Pa - Unit - Iv
45 pages
ML Guide: Boston House Price Prediction
100% (1)
ML Guide: Boston House Price Prediction
15 pages
Nueral Network Mcqs
No ratings yet
Nueral Network Mcqs
6 pages
SVM Guide for Data Science Enthusiasts
100% (1)
SVM Guide for Data Science Enthusiasts
28 pages
Employee Attrition Analysis and Modeling
100% (1)
Employee Attrition Analysis and Modeling
15 pages
ML Lect1
100% (1)
ML Lect1
51 pages
Data Science Interview Stats Q&A
No ratings yet
Data Science Interview Stats Q&A
5 pages
Insurance Claim Prediction Models Analysis
67% (3)
Insurance Claim Prediction Models Analysis
33 pages
ML MU Unit 2
100% (3)
ML MU Unit 2
84 pages
Advanced Optimization Homework
No ratings yet
Advanced Optimization Homework
3 pages
Understanding Random Forest Algorithm
No ratings yet
Understanding Random Forest Algorithm
8 pages
Correlation Measures and Hypothesis Tests
100% (1)
Correlation Measures and Hypothesis Tests
24 pages
Model Validation in Machine Learning
100% (2)
Model Validation in Machine Learning
26 pages
Gradiant Boosting Algorithm Baisics
No ratings yet
Gradiant Boosting Algorithm Baisics
21 pages
Self Organizing Map
No ratings yet
Self Organizing Map
4 pages
Logistic Regression in R
No ratings yet
Logistic Regression in R
19 pages
Importing Libraries: Import As Import As Import As From Import As From Import From Import Import
100% (1)
Importing Libraries: Import As Import As Import As From Import As From Import From Import Import
11 pages
Feature Selection Technique
No ratings yet
Feature Selection Technique
7 pages
Presentation On ML
No ratings yet
Presentation On ML
469 pages
Machine Learning Algorithm Guide
100% (1)
Machine Learning Algorithm Guide
15 pages
Random Forest Algorithm
No ratings yet
Random Forest Algorithm
3 pages
Introduction to Applied Machine Learning
100% (1)
Introduction to Applied Machine Learning
48 pages
Ensemble Methods - Bagging, Boosting and Stacking - Towards Data Science PDF
No ratings yet
Ensemble Methods - Bagging, Boosting and Stacking - Towards Data Science PDF
37 pages
Chapter 5 - Classification Problems
100% (1)
Chapter 5 - Classification Problems
25 pages
Linear Regression Models Overview
100% (1)
Linear Regression Models Overview
39 pages
Classification and Prediction
100% (2)
Classification and Prediction
31 pages
Churn Data
100% (1)
Churn Data
56 pages
Bagging
No ratings yet
Bagging
7 pages
Lecture 6
No ratings yet
Lecture 6
24 pages
ML Lecture 14 SVM
No ratings yet
ML Lecture 14 SVM
15 pages
ML Lecture 13 KNN
No ratings yet
ML Lecture 13 KNN
14 pages
ML Lecture 11 Evaluation
No ratings yet
ML Lecture 11 Evaluation
17 pages
ML Lecture 8 9 Classification
No ratings yet
ML Lecture 8 9 Classification
35 pages
ML Lecture 12 NB
No ratings yet
ML Lecture 12 NB
15 pages
6814911878
No ratings yet
6814911878
72 pages
Approval
No ratings yet
Approval
2 pages
ML Lecture 2 3 Types
No ratings yet
ML Lecture 2 3 Types
27 pages
Gauss-Seidel Method Explained
No ratings yet
Gauss-Seidel Method Explained
28 pages
ML Lecture 1 Intro
No ratings yet
ML Lecture 1 Intro
21 pages
Organization of 8086
No ratings yet
Organization of 8086
22 pages
Intro To Microprocessor
No ratings yet
Intro To Microprocessor
26 pages
Research Paper
No ratings yet
Research Paper
7 pages
Assignment 2
No ratings yet
Assignment 2
10 pages
Observer
No ratings yet
Observer
12 pages
Software Testing: Key Concepts
No ratings yet
Software Testing: Key Concepts
61 pages
Lec05 System Modeling Part2
No ratings yet
Lec05 System Modeling Part2
21 pages
Project Management
No ratings yet
Project Management
25 pages
Lec03 Agile
No ratings yet
Lec03 Agile
28 pages
Factory
No ratings yet
Factory
7 pages
Lec02 Process Model
No ratings yet
Lec02 Process Model
37 pages
Lec01 Intro
No ratings yet
Lec01 Intro
27 pages
Activity Case
No ratings yet
Activity Case
34 pages
Coconut-Dehuskersdoc
No ratings yet
Coconut-Dehuskersdoc
5 pages
Effective TV Commercial Strategies
No ratings yet
Effective TV Commercial Strategies
25 pages
Hubli Airport AIP Details
100% (1)
Hubli Airport AIP Details
15 pages
Catálogo SF1 - 2014
No ratings yet
Catálogo SF1 - 2014
52 pages
Print Culture Assignment
No ratings yet
Print Culture Assignment
2 pages
Adobe Scan 12-May-2023
No ratings yet
Adobe Scan 12-May-2023
1 page
Zephyr/Venezia Range Hood Manual
No ratings yet
Zephyr/Venezia Range Hood Manual
14 pages
vMOS 4.0 Methodology for Video Quality
No ratings yet
vMOS 4.0 Methodology for Video Quality
10 pages
Your Analysis of Marks Scored: Big Bang Edge Test
No ratings yet
Your Analysis of Marks Scored: Big Bang Edge Test
4 pages
Six Sigma 1 Intro
No ratings yet
Six Sigma 1 Intro
59 pages
PD IEC - TS 60815-4-2016-Daoba Baba PDF
100% (1)
PD IEC - TS 60815-4-2016-Daoba Baba PDF
35 pages
Oblum Electrical Industries Private Limited
No ratings yet
Oblum Electrical Industries Private Limited
6 pages
Motic Live Imaging Module-En
No ratings yet
Motic Live Imaging Module-En
52 pages
Photographing and Casting Impression Evidence: LETN-160-0011
No ratings yet
Photographing and Casting Impression Evidence: LETN-160-0011
17 pages
Ameya Virkud: ST - Joseph High School, Panvel, Maharashtra
No ratings yet
Ameya Virkud: ST - Joseph High School, Panvel, Maharashtra
1 page
Arquitectura Germinal
No ratings yet
Arquitectura Germinal
16 pages
Seminar on Subsoil Investigation Techniques
0% (1)
Seminar on Subsoil Investigation Techniques
2 pages
F65A Crane Specs & Features
No ratings yet
F65A Crane Specs & Features
16 pages
Parts PKX301
No ratings yet
Parts PKX301
28 pages
Vsphere 4 Reference Card
No ratings yet
Vsphere 4 Reference Card
2 pages
Types of Levers and Simple Machines
No ratings yet
Types of Levers and Simple Machines
12 pages
Epson Stylus Cx7300, Cx8300 Series Service Manual
No ratings yet
Epson Stylus Cx7300, Cx8300 Series Service Manual
51 pages
Product Catalog Product Catalog: Ledlenser, A Global Leader in The Portable Light Industry
No ratings yet
Product Catalog Product Catalog: Ledlenser, A Global Leader in The Portable Light Industry
192 pages
Controlling Ejector Performance: BY C. G. Blatchley Schutte & Koerting
No ratings yet
Controlling Ejector Performance: BY C. G. Blatchley Schutte & Koerting
9 pages
Huliot UltraSilent Technical Catalog
No ratings yet
Huliot UltraSilent Technical Catalog
32 pages
OKM Catalog en
100% (1)
OKM Catalog en
53 pages
SCJP - Question Bank 1
100% (1)
SCJP - Question Bank 1
15 pages
Six Sigma Green Belt Training Overview
No ratings yet
Six Sigma Green Belt Training Overview
2 pages
Parking Study Methodologies Explained
No ratings yet
Parking Study Methodologies Explained
4 pages
High Speed Spindle Design Overview
No ratings yet
High Speed Spindle Design Overview
13 pages

ML Lecture 15 Ensemble

Uploaded by

ML Lecture 15 Ensemble

Uploaded by

Machine Learning

Lecture 15: Ensemble Learning Methods

Implementation: AnalyticsVidhya, GeeksForGeeks

Source and Implementation:

Source: DataCamp, AnalyticsVidhya

Source: AnalyticsVidhya, Dataaspirant

You might also like