Professional Documents
Culture Documents
Machine Learning
V I S WA N AT H U N I L E K H A J
M ASHRITHA
M R AT N A P R AVA L L I K A
L S H A R AT H K U M A R
G N S A I P R AV E E N C H O W D R AY
What is Machine Learning
Machine learning is an application of artificial
intelligence that involves algorithms and data
that automatically analyse and make decision
by itself without human intervention.
•Supervised learning is a process of providing input data as well as correct output data to the
machine learning model. The aim of a supervised learning algorithm is to find a mapping
function to map the input variable(x) with the output variable(y).
How Supervised Learning
Works?
The working of Supervised learning can be easily understood by the below example and
diagram:
Steps Involved in Supervised Learning:
1.CLASSIFICATION
2.REGRESSION
CLASSIFICATION
Classification algorithms are used when the output variable
is categorical, which means there are two classes such as
Yes-No, Male-Female, True-false, etc.
•ADVANTAGES OF SUPERVISED LEARNING
•Prior experiences
•classes of objects
•fraud detection,spam filtering
•DISADVANTAGES OF SUPERVISED LEARNING
•Complex tasks
•computation times
REGRESSION
Regression algorithms are used if there is a relationship between
the input variable and the output variable. It is used for the
prediction of continuous variables, such as Weather forecasting,
Market Trends, etc. Below are some popular Regression algorithms
which come under supervised learning:
Linear Regression
Regression Trees
Non-Linear Regression
Bayesian Linear Regression
Polynomial Regression
UNSUPERVISED LEARNING
The input object, or sample, has a corresponding label so that the algorithms learn to identify and classify
those input objects which match with the same label.
In other words, the algorithms create maps from given inputs to specific outcomes based on what they
learn from training data that has been labeled by machine learning engineers or data scientists.
Moreover, supervised learning uses both labeled training data and labeled validation data. This allows the
accuracy of supervised learning outputs to be checked for accuracy in a way that unsupervised learning
cannot be measured. Machine learning engineers or data scientists may opt to use a combination of labeled
and unlabeled data to train their algorithms. This in-between option is appropriately called semi-
supervised learning.
Clustering and other types of unsupervised learning
Clustering is the grouping of objects or data points that are similar to each other and dissimilar to objects in other
clusters.
Machine learning engineers and data scientists can use different algorithms for clustering, with the algorithms
themselves falling into different categories based on how they work. The categories include the following:
a. exclusive clustering
b. overlapping clustering
c. hierarchical clustering
d. probabilistic clustering
Disadvantages
Although organizations value those features of unsupervised learning, there are some disadvantages, including the
following:
2. difficulty checking the accuracy of the unsupervised learning outputs, as there are no labeled data sets to verify the
results;
3. the need for engineers and data scientists to spend more time interpreting and labeling results with unsupervised
learning than they would with supervised learning; and
4. the lack of full insight into how or why an unsupervised system reaches its results.
There is an additional disadvantage with clustering as well, in that cluster analysis could overestimate the similarities
in the input objects and thereby obscure individual data points that may be important for some use cases, such as
customer segmentation where the objective is to understand individual customers and their unique buying habits.
Examples and use cases
1. clustering anomaly detection, whereby algorithms can identify unusual data points in data
sets, a capability particularly useful to identity fraudulent activity or human errors or faulty
products; and
2. association mining, where algorithms find associations among data points, a capability that
retailers, for example, can use to identify what products are often bought together
Applications of Reinforcement Learning