You are on page 1of 9

Group members

1. ANMOL KAKKAR(008)

Random Forest 2.
3.
4.
AYUBI SHARMA(009)
HARGUN SINGH (014)
KHYATI TIWARI(020)

Classifier 5.
6.
7.
NIVEDITA PANDEY (027)
PRACHI GARG(028)
SAURABH SHARMA(041)
8. SIDDHANT(038)
9. SONIYA (040)
What is random forest classifier?
Random forest is a Supervised Machine
Learning Algorithm that is used widely in
Classification and Regression problems.
It builds decision trees on different
samples and takes their majority vote for
classification and average in case of
regression.
Why Random Forest Classifier is Better than Decision Tree?

Decision Tree are highly sensitive to training


data set which could result in high variance .
So, our model fail to generalize the prediction.

For eg :
Advantages
1. It reduces overfitting in decision trees and helps to improve the
accuracy.
2. It runs efficiently on large data bases.
3. It is flexible to both classification and regression problems.
4. It works well with both categorical and continuous values.
5. Normalising of data is not required as it uses a rule-based
approach.
6. It offers an experimental method for detecting variable interactions.
Disadvantages
1. It requires much computational power as well as resources as it
builds numerous trees to combine their outputs.
2. It also requires much time for training as it combines a lot of
decision trees to determine the class.
3. Due to the ensemble of decision trees, it also suffers interpretability
and fails to determine the significance of each variable.
Operation of Random Forest
The working of random forest algorithm is as follows:

1. 1. A random seed is chosen which pulls out at random a collection of samples


from the training dataset , this process is also called bootstrapping while
maintaining the class distribution.
2. With this selected data set, a random set of attributes from the original data set is
chosen.
3. After that we build the different decision tree based on features subset.
4. Combine all the predictions(called aggregation). In classification problem , we
take majority voting for final decision.
Real World Random Forests

Being a Machine Learning model that can be used for both classification and
Prediction, combined with good efficiency, this is a popular model in various
arenas.

Random Forest can be applied to any data set with multi-dimensions, so it is


a popular choice when it comes to identifying customer loyalty in Retail,
predicting stock prices in Finance, recommending products to customers
even identifying the right composition of chemicals in the Manufacturing
industry.

With its ability to do both prediction and classification, it produces better


efficiency than most of the classical models in most of the arenas.
Banking Sector
Banking analysis requires a lot of effort as it contains a high risk of profit and loss. Customer analysis is
one of the most used studies adopted in banking sectors. Problems such as loan default chance of a
customer or for detecting any fraud transaction, random forest can be a great choice.

Healthcare Sectors
In pharmaceutical industries, random forest can be used to identify the potential of a certain medicine or
the composition of chemicals required for medicines. It can also be used in hospitals to identify the
diseases suffered by a patient, risk of cancer in a patient, and many other diseases where early analysis
and research play a crucial role.

You might also like