Professional Documents
Culture Documents
CourseNotes ML
CourseNotes ML
COURSE NOTES
1.- INFORMACIÓN
2.- ASISTENTES:
3.- AGENDA
1.
4.- NOTES
01 1. WELCOME W1
Formato: 0101FR001
Vigencia: 30/Jun/2021
INGENIERÍA Y DESARROLLO TECNOLÓGICO
COURSE NOTES
02 Learning Objectives
o Give examples of Machine Learning in various industries.
o Outline the steps machine learning uses to solve problems.
o Provide examples of various techniques used in machine learning.
o Describe the Python libraries for Machine Learning.
o Explain the differences between Supervised and Unsupervised
algorithms.
o Describe the capabilities of various algorithms.
Formato: 0101FR001
Vigencia: 30/Jun/2021
INGENIERÍA Y DESARROLLO TECNOLÓGICO
COURSE NOTES
Formato: 0101FR001
Vigencia: 30/Jun/2021
INGENIERÍA Y DESARROLLO TECNOLÓGICO
COURSE NOTES
o Anomaly detection
Is used to discover abnormal and unused cases, for example, it
is used for credit card fraud detection
o Sequence mining
Is used for predicting the next events, click-stream in websites
(Markov Model, HMM)
o Dimension reduction
Is used to reduce the size of data (PCA)
o Recommendation systems
This associates people’s preferences whit others who have
similar tastes and recommends new items to them such as books
or movies
Which Machine Learning technique is proper for grouping of similar cases in a dataset, for example to find similar patients, or for
05 customers’ segmentation in a bank? W1.2
o Clustering
Formato: 0101FR001
Vigencia: 30/Jun/2021
INGENIERÍA Y DESARROLLO TECNOLÓGICO
COURSE NOTES
Artificial Intelligence
o AI Is a general field with broad scope including:
Computer Vision
Language processing
Creativity
Summarization
Etc.
Machine Learning
o ML Is a branch of AI that cover the statistical part of artificial
07 intelligence. It teaches the computer to solve problems by looking at W1.2
hundreds or thousand examples, learning for them, and then using
that experience to solve the same problem in new situations
Deep Learning
o DP is a very special field of Machine Learning where computer can
actually learn and make intelligence decisions on their own. Deep
Learning involves a deeper level of automation in comparison with
most ML algorithms
Formato: 0101FR001
Vigencia: 30/Jun/2021
INGENIERÍA Y DESARROLLO TECNOLÓGICO
COURSE NOTES
09 Python W1.3
o Python is a popular and powerful general purpose programming
language that recently emerged as a preferred language among
Data Scientist
NumPy
o It´s a math library to work with N-Dimensional arrays in Python. It
enables you to do computation efficiently and effectively. It´s better
than regular Python because of its amazing capabilities working
with arrays, dictionaries, functions, datatypes, and to working with
images you need to know NumPy
SciPy
MatPlotLib
RECOMMENDATION
Pandas
Formato: 0101FR001
Vigencia: 30/Jun/2021
INGENIERÍA Y DESARROLLO TECNOLÓGICO
COURSE NOTES
SciKit Learn
SciKit-Learn Library
o It´s a free Machine Learning library for the Python programming
language. it has most of the classification, regression, and clustering
algorithms, and it´s designed to work with python numerical and
scientific libraries [NumPy & SciPy]
o Most of the task that need to be done in a Machine Learning
pipeline are implemented already in SciKit-Learn including:
Preprocessing of data
10 Feature selection W1.3
Feature extraction
Train test splitting
Defining the algorithms
Fitting models
Tuning parameters
Prediction
Evaluation
Exporting the model
Formato: 0101FR001
Vigencia: 30/Jun/2021
INGENIERÍA Y DESARROLLO TECNOLÓGICO
COURSE NOTES
Formato: 0101FR001
Vigencia: 30/Jun/2021
INGENIERÍA Y DESARROLLO TECNOLÓGICO
COURSE NOTES
Formato: 0101FR001
Vigencia: 30/Jun/2021
INGENIERÍA Y DESARROLLO TECNOLÓGICO
COURSE NOTES
Classification
18 W1.4
o It´s the process of predicting a discrete class labels or categories
Formato: 0101FR001
Vigencia: 30/Jun/2021
INGENIERÍA Y DESARROLLO TECNOLÓGICO
COURSE NOTES
Regression
19 o It’s the process of predicting continuous values as opposed to W1.4
predicting categorical value in classification
Formato: 0101FR001
Vigencia: 30/Jun/2021
INGENIERÍA Y DESARROLLO TECNOLÓGICO
COURSE NOTES
o Density estimation
It´s a very simple concept that is mostly used to explore the data
to find some structure within it
o Market basket analysis
It´s a modeling technique based upon the theory that if you buy
a certain group of items, you´re more likely to buy another group
of items
o Clustering
…
Clustering
o It is considered to be one of the most popular unsupervised
machine learning techniques, used for grouping data points or
objects that are somehow similar
22 o Cluster analysis has many applications in different domains, whether
it be a Bank´s desire to segment his customers based on certain
characteristics, or helping an individual to organize in-group his or
her favorite types of music
Formato: 0101FR001
Vigencia: 30/Jun/2021
INGENIERÍA Y DESARROLLO TECNOLÓGICO
COURSE NOTES
24 Quiz Week 1
25 1. REGRESSION W2
Formato: 0101FR001
Vigencia: 30/Jun/2021
INGENIERÍA Y DESARROLLO TECNOLÓGICO
COURSE NOTES
In this module, you will get a brief intro to regression. You learn about
Linear, Non-linear, Simple and Multiple regression, and their
applications. You apply all these methods on two different datasets, in
the lab part. Also, you learn how to evaluate your regression model, and
calculate its accuracy.
Learning Objectives
26 o Demonstrate understanding of the basics of regression. W2.1
o Demonstrate understanding of simple linear regression.
o Describe approaches for evaluating regression models.
o Describe evaluation metrics for determining accuracy of regression
models.
o Demonstrate understanding of multiple linear regression.
o Demonstrate understanding of non-linear regression.
o Apply Simple and Multiple, Linear Regression on a dataset for
estimation.
Formato: 0101FR001
Vigencia: 30/Jun/2021
INGENIERÍA Y DESARROLLO TECNOLÓGICO
COURSE NOTES
Regression
o It’s the process of predicting continuous values as opposed to
predicting categorical value in classification.
o Regarding to the attached image we can use regression methods to
predict a continuous value such as co2 emission using some other
variables
27 o There exist two types of variables
“Y” Dependent Variable: Can be seen as the state, target, or
final goal we study and try to predict, should be continuous and
cannot be a discrete value
“X” Independent Variable or Variables: Also known as
explanatory variables, can be seen as the causes of those states,
can be measured on either a categorical or continuous
measurement scale
Historical data of past cars using one or more of their features and from
28
that make a model.
Formato: 0101FR001
Vigencia: 30/Jun/2021
INGENIERÍA Y DESARROLLO TECNOLÓGICO
COURSE NOTES
Simple Regression
o An independent variable is used to estimate a dependent variable.
Multiple Regression
o When more than one independent variable is present to estimate a
29 dependent variable
30 Sales forecasting.
o Predict a salesperson’s total yearly sales from independent variables
such as age, education, and years of experience. It can also be used
in the
Field of psychology
o Determine individual satisfaction, based on demographic and
psychological factors.
Real state
o Predict the price of a house in an area, based on its size number of
bedrooms, and so on.
Personal Recruitment
o Predict employment income for independent variables such as hours
of work, education, occupation, sex, age, years of experience, and so
on
Other fields or domains,
Formato: 0101FR001
Vigencia: 30/Jun/2021
INGENIERÍA Y DESARROLLO TECNOLÓGICO
COURSE NOTES
Formato: 0101FR001
Vigencia: 30/Jun/2021