Professional Documents
Culture Documents
ML By Poonam Dhamal
Content
• What is Machine Learning,
• Machine Learning applications
• Training versus Testing
• Positive and Negative Class
• Cross-validation
• Dimensionality Reduction
• Subset Selection
• Principal Component Analysis.
ML By Poonam Dhamal
Learn?
• Machine learning is a growing technology which enables
computers to learn automatically from past data.
• Machine learning uses various algorithms for building
mathematical models and making predictions using historical
data or information.
• Learning is used when:
– Human expertise does not exist (navigating on Mars),
– Humans are unable to explain their expertise (speech
recognition)
– Solution changes in time (routing on a computer network)
– Solution needs to be adapted to particular cases (user
biometrics)
ML By Poonam Dhamal
What Is Machine Learning?
• Machine Learning is said as a subset of artificial
intelligence that is mainly concerned with the development of
algorithms which allow a computer to learn from the data and
past experiences on their own.
• The term machine learning was first introduced by Arthur
Samuel in 1959.
ML By Poonam Dhamal
What Is Machine Learning?
• Machine learning enables a machine to automatically learn
from data, improve performance from experiences, and
predict things without being explicitly programmed.
• A machine has the ability to learn if it can improve its
performance by gaining more data.
• A Machine Learning system learns from historical data, builds
the prediction models, and whenever it receives new data,
predicts the output for it.
• The accuracy of predicted output depends upon the amount
of data, as the huge amount of data helps to build a better
model which predicts the output more accurately.
ML By Poonam Dhamal
What Is Machine Learning?
• Suppose we have a complex problem, where we need to
perform some predictions, so instead of writing a code for it,
we just need to feed the data to generic algorithms, and with
the help of these algorithms, machine builds the logic as per
the data and predict the output.
ML By Poonam Dhamal
Traditional Programming
Data
Computer Output
Program
Machine Learning
Data
Computer Program
Output
ML By Poonam Dhamal
What Is Machine Learning?
Aspect of AI: creates knowledge
Definition:
“changes in [a] system that ... enable [it] to do the same task or tasks
drawn from the same population more efficiently and more
effectively the next time.'' (Simon 1983)
There are two ways that a system can improve:
1. By acquiring new knowledge
– acquiring new facts
– acquiring new skills
2. By adapting its behavior
– solving problems more accurately
– solving problems more efficiently
ML By Poonam Dhamal
What Is Machine Learning?
• Herbert Simon: “Learning is any process by
which a system improves performance from
experience.”
• What is the task?
– Classification
– Categorization/clustering
– Problem solving / planning / control
– Prediction
– others
ML By Poonam Dhamal
What Is Machine Learning?
• Herbert Simon: “Learning is any process by
which a system improves performance from
experience.”
• What is the task?
– Classification
– Categorization/clustering
– Problem solving / planning / control
– Prediction
– others
ML By Poonam Dhamal
What Is Machine Learning?
• A branch of artificial intelligence, concerned
with the design and development of
algorithms that allow computers to evolve
behaviors based on empirical data.
ML By Poonam Dhamal
Features of Machine Learning
ML By Poonam Dhamal
Need for Machine Learning?
• As a human, we have some limitations as we cannot access
the huge amount of data manually, so for this, we need some
computer systems and here comes the machine learning to
make things easy for us.
• We can train machine learning algorithms by providing them
the huge amount of data and let them explore the data,
construct the models, and predict the required output
automatically
• The performance of the machine learning algorithm depends
on the amount of data, and it can be determined by the cost
function. With the help of machine learning, we can save both
time and money.
ML By Poonam Dhamal
Need for Machine Learning?
• It is very hard to write programs that solve problems like
recognizing a face.
– We don’t know what program to write because we don’t know
how our brain does it.
– Even if we had a good idea about how to do it, the program
might be horrendously complicated.
• Instead of writing a program by hand, we collect lots of examples
that specify the correct output for a given input.
• A machine learning algorithm then takes these examples and
produces a program that does the job.
– The program produced by the learning algorithm may look very
different from a typical hand-written program. It may contain
millions of numbers.
– If we do it right, the program works for new cases as well as the
ones we trained it on.
ML By Poonam Dhamal
Importance of Machine Learning
• Rapid increment in the production of data
• Solving complex problems, which are difficult for a human
• Decision making in various sector including finance
• Finding hidden patterns and extracting useful information
from data.
ML By Poonam Dhamal
Applications
• Web search
• Computational biology
• Finance
• E-commerce
• Space exploration
• Robotics
• Information extraction
• Social networks
• Debugging
ML By Poonam Dhamal
Applications
ML By Poonam Dhamal
ML in a Nutshell
• Tens of thousands of machine learning
algorithms
• Hundreds new every year
• Every machine learning algorithm has three
components:
– Representation
– Evaluation
– Optimization
ML By Poonam Dhamal
Representation
• Decision trees
• Sets of rules / Logic programs
• Instances
• Graphical models (Bayes/Markov nets)
• Neural networks
• Support vector machines
• Model ensembles
• Etc.
ML By Poonam Dhamal
Evaluation
• Accuracy
• Precision and recall
• Squared error
• Likelihood
• Posterior probability
• Cost / Utility
• Margin
• Entropy
• K-L divergence
• Etc.
ML By Poonam Dhamal
Optimization
• Combinatorial optimization
– E.g.: Greedy search
• Convex optimization
– E.g.: Gradient descent
• Constrained optimization
– E.g.: Linear programming
ML By Poonam Dhamal
History of Machine Learning
ML By Poonam Dhamal
History of Machine Learning
ML By Poonam Dhamal
History of Machine Learning
ML By Poonam Dhamal
History of Machine Learning
ML By Poonam Dhamal
History of Machine Learning
ML By Poonam Dhamal
Difference between Artificial intelligence and Machine learning
ML By Poonam Dhamal
Difference between Artificial intelligence and Machine learning
• Artificial Intelligence
• Machine learning
ML By Poonam Dhamal
Difference between Artificial intelligence and Machine learning
ML By Poonam Dhamal
Difference between Artificial intelligence and Machine learning
Machine learning and deep learning are Deep learning is a main subset of machine
the two main subsets of AI. learning.
AI has a very wide range of scope. Machine learning has a limited scope.
ML By Poonam Dhamal
Difference between Artificial intelligence and Machine learning
On the basis of capabilities, AI can be Machine learning can also be divided into
divided into three types, which are, Weak mainly three types that are Supervised
AI, General AI, and Strong AI. learning, Unsupervised learning,
and Reinforcement learning.
It includes learning, reasoning, and self- It includes learning and self-correction
correction. when introduced with new data.
AI completely deals with Structured, Machine learning deals with Structured
semi-structured, and unstructured data. and semi-structured data.
ML By Poonam Dhamal
Classification of Machine Learning
ML By Poonam Dhamal
Classification of Machine Learning
Supervised learning
• Supervised learning is a type of machine learning method
in which we provide sample labeled data to the machine
learning system in order to train it, and on that basis, it
predicts the output.
• The supervised learning is based on supervision e.g. spam
filtering.
• The goal of supervised learning is to map input data with
the output data.
• Supervised learning can be grouped further in two
categories of algorithms: Classification & Regression
ML By Poonam Dhamal
Classification of Machine Learning
Unsupervised learning
• Unsupervised learning is a learning method in which a
machine learns without any supervision.
• The training is provided to the machine with the set of
data that has not been labeled, classified, or categorized,
and the algorithm needs to act on that data without any
supervision.
• The goal of unsupervised learning is to restructure the
input data into new features or a group of objects with
similar patterns.
• In unsupervised learning, we don't have a predetermined
result.
• Unsupervised learning can be grouped further in two
categories of algorithms:Clustering & Association
ML By Poonam Dhamal
Classification of Machine Learning
Reinforcement learning
• Reinforcement learning is a feedback-based learning
method, in which a learning agent gets a reward for each
right action and gets a penalty for each wrong action.
• The agent learns automatically with these feedbacks and
improves its performance.
• In reinforcement learning, the agent interacts with the
environment and explores it.
• The goal of an agent is to get the most reward points, and
hence, it improves its performance.
• The robotic dog, which automatically learns the movement
of his arms, is an example of Reinforcement learning.
ML By Poonam Dhamal
Machine learning Life cycle
ML By Poonam Dhamal
Machine learning Life cycle
1. Gathering Data
• The goal of this step is to identify and obtain all data-
related problems.
• data can be collected from various sources such
as files, database, internet, or mobile devices.
• The quantity and quality of the collected data will
determine the efficiency of the output.
• The more will be the data, the more accurate will be the
prediction.
• This step includes the below tasks:
Identify various data sources, Collect data, Integrate the
data obtained from different sources
• A coherent set of data, also called as a dataset.
ML By Poonam Dhamal
Machine learning Life cycle
1. Gathering Data
• A dataset is a collection of data in which data is arranged
in some order. A dataset can contain any data from a series
of an array to a database table. Below table shows an
example of the dataset:
Country Age Salary Purchased
India 38 48000 No
France 43 45000 Yes
Germany 30 54000 No
France 48 65000 No
Germany 40 Yes
India 35 58000 Yes
ML By Poonam Dhamal
Machine learning Life cycle
1. Gathering Data
• A tabular dataset can be understood as a database table
or matrix, where each column corresponds to a particular
variable, and each row corresponds to the fields of the
dataset.
• The most supported file type for a tabular dataset
is "Comma Separated File," or CSV.
• Types of data in datasets
Numerical data: Such as house price, temperature, etc.
Categorical data: Such as Yes/No, True/False, Blue/green, etc.
Ordinal data: These data are similar to categorical data but
can be measured on the basis of comparison.
ML By Poonam Dhamal
Machine learning Life cycle
Need of Dataset
• To work with machine learning projects, we need a huge
amount of data, because, without the data, one cannot
train ML/AI models.
• Collecting and preparing the dataset is one of the most
crucial parts while creating an ML/AI project.
• The technology applied behind any ML projects cannot
work properly if the dataset is not well prepared and pre-
processed.
• During the development of the ML project, the developers
completely rely on the datasets.
• In building ML applications, datasets are divided into two
parts: Training dataset & Test Dataset
ML By Poonam Dhamal
Machine learning Life cycle
ML By Poonam Dhamal
Machine learning Life cycle
ML By Poonam Dhamal
Machine learning Life cycle
2. Data preparation
• Data preparation is a step where we put our data into a
suitable place and prepare it to use in our machine
learning training.
• In this step, first, we put all data together, and then
randomize the ordering of data.
• This step can be further divided into two processes:
1. Data exploration:
It is used to understand the nature of data that we have to
work with. We need to understand the characteristics,
format, and quality of data.
A better understanding of data leads to an effective outcome.
In this, we find Correlations, general trends, and outliers.
ML By Poonam Dhamal
Machine learning Life cycle
2. Data pre-processing:
the next step is preprocessing of data for its analysis.
It involves below steps:
• Getting the dataset
• Importing libraries
• Importing datasets
• Finding Missing Data
• Encoding Categorical Data
• Splitting dataset into training and test set
• Feature scaling
ML By Poonam Dhamal
Machine learning Life cycle
3. Data Wrangling
• Data wrangling is the process of cleaning and converting raw
data into a useable format.
• t is the process of cleaning the data, selecting the variable to
use, and transforming the data in a proper format to make it
more suitable for analysis in the next step.
• It is one of the most important steps of the complete process.
Cleaning of data is required to address the quality issues.
• In real-world applications, collected data may have various
issues, including: Missing Values, Duplicate data, Invalid data,
Noise
• So, we use various filtering techniques to clean the data.
• It is mandatory to detect and remove the above issues because
it can negatively affect the quality of the outcome.
ML By Poonam Dhamal
Machine learning Life cycle
4. Data Analysis
• Now the cleaned and prepared data is passed on to the analysis
step.
• This step involves:
Selection of analytical techniques, Building models, Review the
result
• The aim of this step is to build a machine learning model to
analyze the data using various analytical techniques and review
the outcome.
• It starts with the determination of the type of the problems,
where we select the machine learning techniques such
as Classification, Regression, Cluster analysis, Association, etc.
then build the model using prepared data, and evaluate the
model.
• Hence, in this step, we take the data and use machine learning
algorithms to build theMLmodel.
By Poonam Dhamal
Machine learning Life cycle
5. Train Model
• in this step we train our model to improve its performance
for better outcome of the problem.
• We use datasets to train the model using various machine
learning algorithms.
• Training a model is required so that it can understand the
various patterns, rules, and, features.
6. Test Model
• Once our machine learning model has been trained on a
given dataset, then we test the model.
• In this step, we check for the accuracy of our model by
providing a test dataset to it.
• Testing the model determines the percentage accuracy of
the model as per theMLrequirement
By Poonam Dhamal of project or problem.
Machine learning Life cycle
7. Deployment
• The last step of machine learning life cycle is deployment,
where we deploy the model in the real-world system.
• If the above-prepared model is producing an accurate
result as per our requirement with acceptable speed, then
we deploy the model in the real system.
• But before deploying the project, we will check whether it
is improving its performance using available data or not.
• But before deploying the project, we will check whether it
is improving its performance using available data or not.
ML By Poonam Dhamal
Supervised Machine Learning
ML By Poonam Dhamal
Supervised Machine Learning
ML By Poonam Dhamal
Example Supervised Machine Learning
ML By Poonam Dhamal
Supervised Machine Learning
1. Regression
Regression algorithms are used if there is a relationship between
the input variable and the output variable. It is used for the
prediction of continuous variables, such as Weather forecasting,
Market Trends, etc. Below are some popular Regression
algorithms which come under supervised learning:
Linear Regression, Regression Trees, Non-Linear Regression
Bayesian Linear Regression, Polynomial Regression
ML By Poonam Dhamal
Supervised Machine Learning
2. Classification
Classification algorithms are used when the output variable is
categorical, which means there are two classes such as Yes-No,
Male-Female, True-false, etc.
Spam Filtering,
Random Forest
Decision Trees
Logistic Regression
Support vector Machines
ML By Poonam Dhamal
Advantages of Supervised learning
ML By Poonam Dhamal
Unsupervised learning
ML By Poonam Dhamal
Why Unsupervised learning
ML By Poonam Dhamal
Working of Unsupervised learning
ML By Poonam Dhamal
Types of Unsupervised learning
ML By Poonam Dhamal
Types of Unsupervised learning
ML By Poonam Dhamal
Algorithms of Unsupervised learning
• K-means clustering
• KNN (k-nearest neighbors)
• Hierarchal clustering
• Anomaly detection
• Neural Networks
• Principle Component Analysis
• Independent Component Analysis
• Apriori algorithm
• Singular value decomposition
ML By Poonam Dhamal
Advantages of Unsupervised learning
ML By Poonam Dhamal
Example Supervised Machine Learning
ML By Poonam Dhamal
Example of unupervised Machine Learning
ML By Poonam Dhamal
Difference Between Supervised and Unsupervised learning
ML By Poonam Dhamal
Supervised Learning Unsupervised Learning
Supervised learning algorithms are trained using labeled Unsupervised learning algorithms are trained using
data. unlabeled data.
Supervised learning model takes direct feedback to check Unsupervised learning model does not take any feedback.
if it is predicting correct output or not.
Supervised learning model predicts the output. Unsupervised learning model finds the hidden patterns in
data.
In supervised learning, input data is provided to the In unsupervised learning, only input data is provided to
model along with the output. the model.
The goal of supervised learning is to train the model so The goal of unsupervised learning is to find the hidden
that it can predict the output when it is given new data. patterns and useful insights from the unknown dataset.
ML By Poonam Dhamal
Supervised Learning Unsupervised Learning
Supervised learning needs supervision to train Unsupervised learning does not need any
the model. supervision to train the model.
Supervised learning can be categorized Unsupervised Learning can be classified
in Classification and Regression problems. in Clustering and Associations problems.
Supervised learning can be used for those cases Unsupervised learning can be used for those
where we know the input as well as cases where we have only input data and no
corresponding outputs. corresponding output data.
Supervised learning model produces an Unsupervised learning model may give less
accurate result. accurate result as compared to supervised
learning.
Supervised learning is not close to true Unsupervised learning is more close to the true
Artificial intelligence as in this, we first train Artificial Intelligence as it learns similarly as a
the model for each data, and then only it can child learns daily routine things by his
predict the correct output. experiences.
It includes various algorithms such as Linear It includes various algorithms such as
Regression, Logistic Regression, Support Clustering, KNN, and Apriori algorithm.
Vector Machine, Multi-class Classification,
Decision tree, Bayesian Logic, etc.
ML By Poonam Dhamal