0% found this document useful (0 votes)

16 views15 pages

Ensemble Learning

Uploaded by

disha nagpure

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views15 pages

Ensemble Learning

Uploaded by

disha nagpure

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Ensemble Learning Designed by Hassan Elhefny

Ensemble Learning
What is the meaning of Ensemble Learning?
Ensemble learning is a machine learning technique that involves combining multiple models (or
"learners") to improve the overall performance of a predictive model. The idea is that by combining
several different models, each with its own strengths and weaknesses, the ensemble model will be more
accurate and robust than any individual model.

Ensemble Learning Combination Types

1. Bagging
2. Boosting
3. Stacking

Bagging
Bagging, short for Bootstrap Aggregating, is an ensemble learning technique that involves building
multiple models using subsets of the training data and combining their predictions through a
process of aggregation.

P a g e 1 | 15
Ensemble Learning Designed by Hassan Elhefny
2

In bagging, several subsets of the training data are created by randomly sampling the original data
with replacement. Each subset is used to train a separate model, which is then combined with the other
models to form the final ensemble model.
The aggregation process in bagging can be done by taking the average (for regression) or the majority
vote (for classification) of the predictions made by the individual models. The resulting ensemble
model is usually more robust and accurate than any individual model.
One of the main advantages of bagging is that it can help to reduce overfitting, which occurs when a
model is too complex and captures noise in the training data rather than the underlying pattern. By
training multiple models on different subsets of the data, bagging can help to reduce the impact of noise
and increase the generalization performance of the ensemble model.

Popular Bagging Models

There are several popular bagging models in machine learning, including:

1. Random Forest: A decision tree-based model that uses bagging to create an ensemble of
decision trees. Random Forest is a popular model for classification and regression problems.

2. Bagging meta-estimator: A general-purpose bagging model that can be used with any base
estimator, such as decision trees, SVMs, or neural networks.

3. Extra Trees: An extension of Random Forest that introduces more randomness into the tree-
building process by using random thresholds for each feature.

P a g e 2 | 15
Ensemble Learning Designed by Hassan Elhefny
3

4. Bootstrap aggregating for regression (BaggingRegressor): A bagging model specifically

designed for regression problems.

5. Bootstrap aggregating for classification (BaggingClassifier): A bagging model specifically

designed for classification problems.

Random Forest

Random Forest Classifier is a popular ensemble learning algorithm that combines multiple decision
trees to make predictions for classification problems. It is a type of bagging method that creates
multiple decision trees using a random subset of the features and training data and then combines
the results of these trees to make a prediction.

P a g e 3 | 15
4 Ensemble Learning Designed by Prof. Disha Nagpure
In a random forest, each decision tree is built on a different bootstrap sample of the training data,
which is created by randomly sampling the training data with replacement. Additionally, at each split
of the tree, only a random subset of the features is considered for splitting, rather than all the features.
This helps to reduce the correlation between the trees and increases the diversity of the ensemble.
To make a prediction using a random forest classifier, the prediction of each decision tree is combined
through a majority vote. That is, the output class with the highest number of votes among all the decision
trees is considered the final output.
Random Forest Classifier has several advantages over individual decision trees, including better
generalization performance, reduced overfitting, and improved accuracy. They are also relatively
easy to use and can handle large amounts of data and many features.
Random Forest Classifier is commonly used in a variety of applications, such as medical diagnosis,
sentiment analysis, image classification, and other fields. They are particularly effective in situations
where the data has many features or where there is a high degree of noise or missing data.

What are the main differences between Random Forest and Extra Trees Classifiers?
The main difference between Random Forest and Extra Trees classifiers in bagging is in the wa
individual trees are constructed.
In Random Forest, each decision tree is built using a random subset of features, and the optimal
splitting point for each feature is selected among a subset of randomly sampled thresholds. This
process adds some randomness to the tree-building process, which helps to reduce overfitting and
increase the diversity of the trees in the ensemble.
In contrast, Extra Trees classifiers use an even more random approach to the tree-building process.
Instead of searching for the optimal splitting point for each feature, Extra Trees classifiers select the
splitting point randomly for each feature, without any optimization. This results in a higher level of
randomness in the tree-building process, which can further reduce overfitting and increase the diversity of
the trees.
The increased randomness in Extra Trees classifiers can make them more effective in situations
where the data has many noisy features, and where the optimal splitting points for the features may
be hard to identify.
Page4|9
5 Ensemble Learning Designed by Prof. Disha Nagpure

However, this increased randomness can also make Extra Trees classifiers
more computationally expensive than Random Forest classifiers, as each tree
must be built using a larger number of randomly selected thresholds.

Boosting
Boostingis an ensemble lear
ningtechnique that combines multiple weak
learners to createstrong
a learner. In contrast to bagging methods like
Random Forest and Extra Trees, which create multiple models

independently
and combinetheir results, boostingmethods build a

sequence of models iteratively , with each subsequent

model trying to improveupon the errorsof the previous model
.

The basic idea of boostingis to assign higher weights to the samples that
are misclassified by the previous modelso that the subsequent model focuses
more on these samples. The process is repeated iteratively , with each
subsequent model trying to correct the mistakes of the previous models.
The final predictionis a weighted combination of the predictions of all the
models in the sequence.

P a g e 5|9
6 Ensemble Learning Designed by Prof. Disha Nagpure

Boostingmethods are particularly effectivein situations where the data has

many featuresor where the signal- to- noise ratio is low
. They are widely used
in applications such as natural language processing, computer vision , and
finance, among others.

Boosting Models
There are several types of boos ting methods, including:

1. AdaBoost : Adaboost is a popular boosting algorithm that assigns

higher weights to the misclassified samples and trains subsequent
models to focus on these samples . The weights are updatediteratively
,
and the final predictionis a weighted sumof the predictions of all the
models in the sequence.

2. Gradient Boosting : Gradient boosting is a boosting algorithm that

iteratively trains models to correct the residuals(i.e., the errors) of the
previous models. The final pre diction is a sum of the predictions of all
the models in the sequence.

3. XGBoost : XGBoost is an optimizedimplementation of gradient boosting

that uses a regularized objective function
and parallel processing
to
improve performance and reduce overfitting
.

Page 6|9
7 Ensemble Learning Designed by Prof. Disha Nagpure
4. LGBM Classifier: LightGBM (LGBM) is a popular gradient-boosting framework that was develop
Microsoft. It is designed to be fast, scalable, and highly efficient for large-scale data and is known
ability to handle large datasets.

Parallel Processing in XGBOOST: XGBoost can perform parallel processing in two ways: data
parallelism and task parallelism. Data parallelism involves partitioning the dataset into smaller
subsets and training multiple models on each subset in parallel. Task parallelism involves
parallelizing the computations within a single model, such as parallelizing the gradient computation.
To enable parallel processing in XGBoost, you can set the n_jobs parameter to specify the number of
CPU cores to use. For example, setting n_jobs=-1 will use all available CPU cores for parallel
processing.

What are the main differences between XGBOOST, Gradient Boosting, LGBM, and Adaboost?
All XGBoost, Gradient Boosting, LightGBM (LGBM), and AdaBoost are ensemble methods that use
boosting to improve the accuracy of machine learning models. However, there are some key differences
between them.

1. XGBoost (Extreme Gradient Boosting): XGBoost is a popular implementation of gradient

boosting that was designed to improve upon the original gradient boosting algorithm. XGBoost
uses a regularized objective function and a distributed computing framework (Parallel
Processing) to achieve faster and more accurate results than traditional gradient boosting. It
also includes features such as tree pruning and handling missing data.

2. Gradient Boosting: Gradient Boosting is a general technique for improving the accuracy of
machine learning models by combining multiple weak models. It works by iteratively training
new models on the errors of the previous models and adding them to the ensemble. Gradient
Boosting is a powerful technique, but it can be slow and prone to overfitting.

3. LightGBM (Light Gradient Boosting Machine): LightGBM is another implementation of

gradient boosting that was designed to be faster and more memory-efficient than other methods.
It uses a histogrambased approach to split data and reduces the time required for sorting,

Page7|9
8 Ensemble Learning Designed by Prof. Disha Nagpure
making it well-suited for large datasets. LightGBM also includes features such as GP
acceleration and distributed training.

4. AdaBoost (Adaptive Boosting): AdaBoost is a boosting algorithm that works by weighting the
training data and focusing on the misclassified samples. It trains weak models on the weighted
data and combines them into an ensemble. AdaBoost is a simple and effective algorithm, but it
can be sensitive to noisy data.
In summary, while all these algorithms use boosting to improve the accuracy of machine learning models,
they differ in their implementation details and optimization techniques. XGBoost and LightGBM are
known for their speed and performance on large datasets, while Gradient Boosting and AdaBoost are
more widely used and provide a good balance between accuracy and simplicity.

Stacking
Stacking is an ensemble learning technique that involves combining multiple machine learning models to
improve the accuracy of predictions. It is a meta-learning approach that works by training multiple

base models on the same data and then using a meta-model to combine their predictions.
Once the meta-model is trained, it can be used to make predictions on the test set. The final output is a
combination of predictions from the base models and the meta-model.
The main advantage of stacking is that it can often produce more accurate predictions than using a
single model by Combining the predictions from multiple models, can reduce the bias and variance of
the overall model and improve the overall performance.

However, stacking can also be more computationally expensive and require more data than
other ensemble methods like bagging and boosting. It also requires careful tuning of the hyperparameters
of both the base models and the meta-model.
Overall, stacking is a powerful technique for improving the accuracy of machine learning models, but
it requires careful implementation and tuning to achieve the best results.

Page8|9

BDC 10th Ed Vol1
94% (16)
BDC 10th Ed Vol1
341 pages
Modern Muslim Girl Names A-Z
64% (77)
Modern Muslim Girl Names A-Z
44 pages
Mathematics in The Modern World 1st Year 1st Semester
97% (121)
Mathematics in The Modern World 1st Year 1st Semester
46 pages
15 Sample Papers Maths (Standard) Class 10
67% (27)
15 Sample Papers Maths (Standard) Class 10
57 pages
You Become What You Think
91% (96)
You Become What You Think
71 pages
NCAE English Reviewer in English
92% (63)
NCAE English Reviewer in English
7 pages
2026 JAMB Syllabus
83% (6)
2026 JAMB Syllabus
2 pages
SB Answer Key - Speak Out PreIntermediate
78% (404)
SB Answer Key - Speak Out PreIntermediate
31 pages
Reviewer in Math 9 3rd Quarter
100% (9)
Reviewer in Math 9 3rd Quarter
12 pages
3rd Periodical Test in Math 6 With TOS and Answer Key
88% (187)
3rd Periodical Test in Math 6 With TOS and Answer Key
6 pages
ICSE 2026 Specimen Paper
62% (26)
ICSE 2026 Specimen Paper
132 pages
Free Fire Headshot Config File
82% (34)
Free Fire Headshot Config File
1 page
English Central Class 7 (TRP) Answer Key
67% (9)
English Central Class 7 (TRP) Answer Key
328 pages
Sample Imrad Format
90% (52)
Sample Imrad Format
12 pages
Text and Tests 4
79% (14)
Text and Tests 4
561 pages
Format of 4a's Lesson Plan
82% (17)
Format of 4a's Lesson Plan
3 pages
Grade 7 CBC Social Studies Complete Notes
72% (32)
Grade 7 CBC Social Studies Complete Notes
117 pages
Surat Yasin
90% (31)
Surat Yasin
12 pages
PDF Maths Class 9 Annual Exam Ques Paper
89% (9)
PDF Maths Class 9 Annual Exam Ques Paper
10 pages
CBSE Class 10 History Notes Chapter 2 - Nationalism in India
88% (129)
CBSE Class 10 History Notes Chapter 2 - Nationalism in India
7 pages
Parts of A Research Paper (Chapters 1-5)
98% (40)
Parts of A Research Paper (Chapters 1-5)
6 pages
The Nationalism in India (Prashant Kirad)
93% (187)
The Nationalism in India (Prashant Kirad)
13 pages
Spoken English Guru Daily Use English Sentences Ebook PDF
75% (117)
Spoken English Guru Daily Use English Sentences Ebook PDF
200 pages
Class 10 Mathematics Blueprint 2026
70% (10)
Class 10 Mathematics Blueprint 2026
3 pages
Perimeter and Area CBSE Class 6 Worksheet
82% (56)
Perimeter and Area CBSE Class 6 Worksheet
3 pages
Best SST Notes For Class 10
92% (64)
Best SST Notes For Class 10
98 pages
Practice Questions Trigonometry Class X
95% (43)
Practice Questions Trigonometry Class X
2 pages
Experience Work Sheet Sample
94% (142)
Experience Work Sheet Sample
2 pages
New Frontiers British English - Student Book 4 TG (En)
93% (15)
New Frontiers British English - Student Book 4 TG (En)
142 pages
Maths Formulas For Class 11 - All Important 11th Class Math Formulae
86% (35)
Maths Formulas For Class 11 - All Important 11th Class Math Formulae
9 pages