ML Unit5

Ensemble learning combines multiple models to enhance predictive accuracy in machine learning, addressing issues like statistical, computational, and representational problems. Techniques such as Bagging, Random Forest, and Boosting are used to create diverse models that can correct each other's errors. While ensemble methods improve performance, they can also complicate understanding due to the complexity of multiple classifiers working together.

Uploaded by

Sanskruti Raut

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views5 pages

ML Unit5

Uploaded by

Sanskruti Raut

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

ENSEMBLE LEARNING:

The ensemble methods in machine learning combine the insights obtained from multiple
learning models to facilitate accurate and improved decisions. These methods follow the same
principle as the example of buying an air-conditioner cited above
Ensemble learning helps improve machine learning results by combining several models.
This approach allows the production of better predictive performance compared to a single
model. Basic idea is to learn a set of classifiers (experts) and to allow them to vote.
Advantage : Improvement in predictive accuracy.
Disadvantage : It is difficult to understand an ensemble of classifiers.

Why do ensembles work

Dietterich(2002) showed that ensembles overcome three problems –
 StatisticalProblem –
The Statistical Problem arises when the hypothesis space is too large for the
amount of available data. Hence, there are many hypotheses with the same
accuracy on the data and the learning algorithm chooses only one of them! There
is a risk that the accuracy of the chosen hypothesis is low on unseen data!
 ComputationalProblem –
The Computational Problem arises when the learning algorithm cannot guarantees
finding the best hypothesis.
 RepresentationalProblem –
The Representational Problem arises when the hypothesis space does not contain
any good approximation of the target class(es).
Main Challenge for Developing Ensemble Models
The main challenge is not to obtain highly accurate base models, but rather to obtain base
models which make different kinds of errors. For example, if ensembles are used for
classification, high accuracies can be accomplished if different base models misclassify
different training examples, even if the base classifier accuracy is low.
Methods for Independently Constructing Ensembles –
 Majority Vote
 Bagging and Random Forest
 Randomness Injection
 Feature-Selection Ensembles
 Error-Correcting Output Coding
Methods for Coordinated Construction of Ensembles –
 Boosting
 Stacking
Reliable Classification: Meta-Classifier Approach
Co-Training and Self-Training
Types of Ensemble Classifier –
Bagging:
Bagging (Bootstrap Aggregation) is used to reduce the variance of a decision tree. Suppose
a set D of d tuples, at each iteration i, a training set Di of d tuples is sampled with replacement
from D (i.e., bootstrap). Then a classifier model M i is learned for each training set D < i.
Each classifier Mi returns its class prediction. The bagged classifier M* counts the votes and
assigns the class with the most votes to X (unknown sample).
Implementation steps of Bagging –
1. Multiple subsets are created from the original data set with equal tuples, selecting
observations with replacement.
2. A base model is created on each of these subsets.
3. Each model is learned in parallel from each training set and independent of each
other.
4. The final predictions are determined by combining the predictions from all the
models.

Bagging
Bootstrap Aggregating, also known as bagging, is a machine learning ensemble meta-
algorithm designed to improve the stability and accuracy of machine learning algorithms
used in statistical classification and regression. It decreases the variance and helps to
avoid overfitting. It is usually applied to decision tree methods. Bagging is a special case of
the model averaging approach.
Description of the Technique
Suppose a set D of d tuples, at each iteration i, a training set D i of d tuples is selected via row
sampling with a replacement method (i.e., there can be repetitive elements from different d
tuples) from D (i.e., bootstrap). Then a classifier model Mi is learned for each training set D
< i. Each classifier Mi returns its class prediction. The bagged classifier M* counts the votes
and assigns the class with the most votes to X (unknown sample).
Implementation Steps of Bagging
 Step 1: Multiple subsets are created from the original data set with equal tuples,
selecting observations with replacement.
 Step 2: A base model is created on each of these subsets.
 Step 3: Each model is learned in parallel with each training set and independent
of each other.
 Step 4: The final predictions are determined by combining the predictions from
all the models.

An illustration for the concept of bootstrap aggregating (Bagging)

Example of Bagging
The Random Forest model uses Bagging, where decision tree models with higher variance
are present. It makes random feature selection to grow trees. Several random trees make a
Random Forest.
Random Forest:
Random Forest is an extension over bagging. Each classifier in the ensemble is a
decision tree classifier and is generated using a random selection of attributes at each
node to determine the split. During classification, each tree votes and the most popular
class is returned.
Implementation steps of Random Forest –
1. Multiple subsets are created from the original data set, selecting
observations with replacement.
2. A subset of features is selected randomly and whichever feature gives the
best split is used to split the node iteratively.
3. The tree is grown to the largest.
4. Repeat the above steps and prediction is given based on the aggregation of
predictions from n number of trees.
5.

Boosting
Boosting is an ensemble modeling technique that attempts to build a strong classifier from
the number of weak classifiers. It is done by building a model by using weak models in series.
Firstly, a model is built from the training data. Then the second model is built which tries to
correct the errors present in the first model. This procedure is continued and models are added
until either the complete training data set is predicted correctly or the maximum number of
models is added.
Boosting Algorithms
There are several boosting algorithms. The original ones, proposed by Robert
Schapire and Yoav Freund were not adaptive and could not take full advantage of the weak
learners. Schapire and Freund then developed AdaBoost, an adaptive boosting algorithm that
won the prestigious Gödel Prize. AdaBoost was the first really successful boosting algorithm
developed for the purpose of binary classification. AdaBoost is short for Adaptive Boosting
and is a very popular boosting technique that combines multiple “weak classifiers” into a
single “strong classifier”.
Algorithm:
1. Initialise the dataset and assign equal weight to each of the data point.
2. Provide this as input to the model and identify the wrongly classified data points.
3. Increase the weight of the wrongly classified data points and decrease the weights
of correctly classified data points. And then normalize the weights of all data
points.
4. if (got required results)
Goto step 5
else
Goto step 2
5. End
An illustration presenting the intuition behind the boosting algorithm, consisting of the

parallel learners and

Unit-3 ML
No ratings yet
Unit-3 ML
18 pages
Bagging
No ratings yet
Bagging
7 pages
Ensemble Learning Techniques Explained
100% (1)
Ensemble Learning Techniques Explained
12 pages
Unit 3
No ratings yet
Unit 3
59 pages
Bagging Vs Boosting in Machine Learning
100% (1)
Bagging Vs Boosting in Machine Learning
4 pages
Enseble LEarning
100% (1)
Enseble LEarning
57 pages
Bagging+Boosting+Gradient Boosting
100% (1)
Bagging+Boosting+Gradient Boosting
48 pages
Bagging vs Boosting in Ensemble Learning
No ratings yet
Bagging vs Boosting in Ensemble Learning
40 pages
Ch-4 Ensemble Learning
No ratings yet
Ch-4 Ensemble Learning
18 pages
Ensemble TBL Notes
No ratings yet
Ensemble TBL Notes
2 pages
ML Unit 3-1
No ratings yet
ML Unit 3-1
14 pages
Unit Iv
No ratings yet
Unit Iv
14 pages
Unit 3
No ratings yet
Unit 3
63 pages
Unit V - Multiple Learners
No ratings yet
Unit V - Multiple Learners
54 pages
Ensemble Methods Send
No ratings yet
Ensemble Methods Send
20 pages
Group9 ABA Ensemble Model
No ratings yet
Group9 ABA Ensemble Model
5 pages
22AIP3101A Session 11
No ratings yet
22AIP3101A Session 11
30 pages
2.4-Ensemble Methods Lecture Notes
No ratings yet
2.4-Ensemble Methods Lecture Notes
14 pages
UNIT III Word File
No ratings yet
UNIT III Word File
13 pages
Bagging and Boosting
No ratings yet
Bagging and Boosting
8 pages
Lecture 5
No ratings yet
Lecture 5
11 pages
Unit 5 Ensemble Model
No ratings yet
Unit 5 Ensemble Model
36 pages
Ensemble Final
No ratings yet
Ensemble Final
41 pages
Assessing Predictive Models
No ratings yet
Assessing Predictive Models
25 pages
Ensemble Methods
No ratings yet
Ensemble Methods
31 pages
Unit 4 Ensemble Techniques and Unsupervised Learning
100% (1)
Unit 4 Ensemble Techniques and Unsupervised Learning
25 pages
Ensemble Learning in Machine Learning
No ratings yet
Ensemble Learning in Machine Learning
4 pages
Module 7 - Ensemble Learning
No ratings yet
Module 7 - Ensemble Learning
41 pages
ML Unit-3
No ratings yet
ML Unit-3
15 pages
ML Mod 5.1
No ratings yet
ML Mod 5.1
18 pages
Ensemble Learning
No ratings yet
Ensemble Learning
52 pages
Eda - M4
No ratings yet
Eda - M4
7 pages
Unit 4
No ratings yet
Unit 4
17 pages
Unit 3 Aml
No ratings yet
Unit 3 Aml
9 pages
Ensemble Methods
100% (1)
Ensemble Methods
15 pages
DMML Unit 5 Part 1&2
No ratings yet
DMML Unit 5 Part 1&2
21 pages
Ensembling Techniques
No ratings yet
Ensembling Techniques
11 pages
LR Desktop Udo6rlp
No ratings yet
LR Desktop Udo6rlp
4 pages
Module 5,1 Ensemble - Bagging, RF, Boosting
No ratings yet
Module 5,1 Ensemble - Bagging, RF, Boosting
66 pages
Week 11 EnsembleLearning
No ratings yet
Week 11 EnsembleLearning
34 pages
Ensemble Models
No ratings yet
Ensemble Models
52 pages
Ensemble Learning
No ratings yet
Ensemble Learning
13 pages
Ensemble Learning in Machine Learning
No ratings yet
Ensemble Learning in Machine Learning
61 pages
Aiml Unit 4
No ratings yet
Aiml Unit 4
26 pages
5 - EnsembleModeling
No ratings yet
5 - EnsembleModeling
80 pages
Unit 4 ML
No ratings yet
Unit 4 ML
25 pages
An Introduction of Ensemble Learning
100% (1)
An Introduction of Ensemble Learning
40 pages
Lecture 6
No ratings yet
Lecture 6
24 pages
ML Lecture 15 Ensemble
No ratings yet
ML Lecture 15 Ensemble
27 pages
UNIT 3-ML (Autosaved) (F)
No ratings yet
UNIT 3-ML (Autosaved) (F)
39 pages
ML Unit@4
No ratings yet
ML Unit@4
70 pages
Class Adv Classification V
No ratings yet
Class Adv Classification V
50 pages
Unit 4
No ratings yet
Unit 4
24 pages
M7 L4 RR2 Only Required Slides
No ratings yet
M7 L4 RR2 Only Required Slides
17 pages
Module 2
No ratings yet
Module 2
34 pages
Machine Learning Ensembling Guide
No ratings yet
Machine Learning Ensembling Guide
7 pages
Ensemble Learning
No ratings yet
Ensemble Learning
15 pages
Assignment 1
No ratings yet
Assignment 1
4 pages
12 Ensemble Model
No ratings yet
12 Ensemble Model
90 pages
Verified Websites4000
No ratings yet
Verified Websites4000
46 pages
Vasavi Parameswara Surya Service Station 214380
No ratings yet
Vasavi Parameswara Surya Service Station 214380
35 pages
CH L Narasimham & Co 189196
No ratings yet
CH L Narasimham & Co 189196
36 pages
ESA Report For Pasighat Bazar 17667 (HT)
No ratings yet
ESA Report For Pasighat Bazar 17667 (HT)
30 pages
Lakshmi Oil Fillings 148350
No ratings yet
Lakshmi Oil Fillings 148350
36 pages
Haritha Fuelling Stationn 146790
No ratings yet
Haritha Fuelling Stationn 146790
36 pages
ESA Report For Dibrugarh Town 4265 (HT)
No ratings yet
ESA Report For Dibrugarh Town 4265 (HT)
32 pages
ESA Report For Kulajan 8506 (HT)
No ratings yet
ESA Report For Kulajan 8506 (HT)
31 pages
Shivashakti Petroleum 225047
No ratings yet
Shivashakti Petroleum 225047
35 pages
10.ESA Report For SBI RBO IV Vellore 14756
No ratings yet
10.ESA Report For SBI RBO IV Vellore 14756
25 pages
14.ESA Report For SBI Sainathpuram 16539
No ratings yet
14.ESA Report For SBI Sainathpuram 16539
25 pages
13.ESA Report For SBI Vellore Fort City Branch 18714
No ratings yet
13.ESA Report For SBI Vellore Fort City Branch 18714
25 pages
S B Gowdaiah 100668
No ratings yet
S B Gowdaiah 100668
35 pages
Iot Home Security
No ratings yet
Iot Home Security
15 pages
Coorg Coffee Growers 110777
No ratings yet
Coorg Coffee Growers 110777
36 pages
ESA Report For Tengakhat 14181 (LT)
No ratings yet
ESA Report For Tengakhat 14181 (LT)
20 pages
ESA Report For.. Bakulani Chariali 9143 (LT)
No ratings yet
ESA Report For.. Bakulani Chariali 9143 (LT)
21 pages
EDA - EV Presentation
No ratings yet
EDA - EV Presentation
14 pages
Udyam 4000
No ratings yet
Udyam 4000
241 pages
Press 10 3 25
No ratings yet
Press 10 3 25
5 pages
Elektrikal LT4
No ratings yet
Elektrikal LT4
2 pages
Electrical Engineering & Telecommunications: Elec1111
No ratings yet
Electrical Engineering & Telecommunications: Elec1111
7 pages
Music's Ineffectiveness in Social Issues
No ratings yet
Music's Ineffectiveness in Social Issues
3 pages
Health Amp Happiness Private Limited,: Grand Total
No ratings yet
Health Amp Happiness Private Limited,: Grand Total
1 page
TTA PTTA Definition
No ratings yet
TTA PTTA Definition
2 pages
Clamp FEM Analysis
100% (1)
Clamp FEM Analysis
27 pages
Iandolo - Hot Modified Technique With A New Biosealer
No ratings yet
Iandolo - Hot Modified Technique With A New Biosealer
3 pages
Electric Circuits Theory 1
0% (1)
Electric Circuits Theory 1
7 pages
Bosch Gear Reducers Re 76108
No ratings yet
Bosch Gear Reducers Re 76108
30 pages
Baron & Ensley (2006)
No ratings yet
Baron & Ensley (2006)
14 pages
Fabric Printing Techniques Guide
No ratings yet
Fabric Printing Techniques Guide
90 pages
Propylene Glysol Assay
No ratings yet
Propylene Glysol Assay
11 pages
Understanding Pointfree Haskell Style
No ratings yet
Understanding Pointfree Haskell Style
8 pages
Tata Motors CRM with Oracle Siebel
No ratings yet
Tata Motors CRM with Oracle Siebel
17 pages
COA Tute 8 Main
No ratings yet
COA Tute 8 Main
3 pages
FSD Job Sheet Woven
No ratings yet
FSD Job Sheet Woven
4 pages
Crystal Habit Modification Using Habit Modifiers: Satyawati S. Joshi
No ratings yet
Crystal Habit Modification Using Habit Modifiers: Satyawati S. Joshi
26 pages
GGBS Ball Mill Visit Report
No ratings yet
GGBS Ball Mill Visit Report
14 pages
Hallelujah, Amen: From Judas Maccabaeus Arranged For Children's Voices G. F. Handel / Arr. D. Schaller, 2019 84
No ratings yet
Hallelujah, Amen: From Judas Maccabaeus Arranged For Children's Voices G. F. Handel / Arr. D. Schaller, 2019 84
2 pages
Qam601 - Statistics For Management CASE STUDY of The Kolkata Digest
No ratings yet
Qam601 - Statistics For Management CASE STUDY of The Kolkata Digest
9 pages
Structural Rehabilitation Table Content
No ratings yet
Structural Rehabilitation Table Content
4 pages
Salary Preparation Process Guide
No ratings yet
Salary Preparation Process Guide
4 pages
HR Executive Resume of Soumen Biswas
No ratings yet
HR Executive Resume of Soumen Biswas
5 pages
Manual Powder Coating Device
No ratings yet
Manual Powder Coating Device
8 pages
Industrial Crane Solutions by Bonfiglioli
No ratings yet
Industrial Crane Solutions by Bonfiglioli
20 pages
Yan 2008
No ratings yet
Yan 2008
5 pages
Sse 223
No ratings yet
Sse 223
153 pages
Serialism Teaching Guide for Educators
100% (1)
Serialism Teaching Guide for Educators
12 pages
Importance of Teaching Culture in EFL
No ratings yet
Importance of Teaching Culture in EFL
15 pages
Outline of Galaxies
No ratings yet
Outline of Galaxies
6 pages

ML Unit5

Uploaded by

ML Unit5

Uploaded by

ENSEMBLE LEARNING:

Why do ensembles work

An illustration for the concept of bootstrap aggregating (Bagging)

parallel learners and

You might also like