You are on page 1of 5

Data mining

Francia Theresa L. Oliver


MACHINE LEARNING WEEK 1-2

Introduction to Machine Learning 3. Online Transportation Networks


➥used by search engines such as Google and Bing to ➥book cabs by using mobile apps like Grab & Uber.
rank web pages or to decide which advertisement to ⇢these apps estimate the price of the ride by using
machine learning.
show to which user.
⇢they also use ML to determine price surge hours by
➥used by social networks such as Facebook and predicting the rider’s demand
Instagram to generate a custom feed for every user to 4. Video Surveillance
tag the user by the picture that was uploaded. ➥such as CCTV cameras
➥banks to detect whether an online transaction is ⇢ML based video surveillance systems can even detect
genuine or fraudulent and e-commerce websites such crime before it happens.
as Amazon and Flipkart to recommend products that ⇢It can track unusual behavior in people such as
standing motionless for an unnaturally long time,
we are most likely to buy.
hesitancy, or napping on benches and such-like.
➥Gmail, Yahoo, and Hotmail use machine learning to ⇢the system can thus alert human attendants, for
decide which emails are spam and which are not. suitable response.
5. Social Media Services
Machine Learning ➥such as Facebook
➥Branch of Artificial Intelligence ⇢personalizing news feed in order to better
advertisement targeting over social media
➥But it is very difficult to make rules for programs ⇢to show news feed to the user based on his or her
such as photo tagging, classifying emails as spam or interests by considering items clicked earlier by
not spam, and web page ranking. The only solution to that user
accomplish these tasks was to write a program that ⇢a list of Facebook users are suggested for you to
become friends with.
could generate its own rules by examining some
⇢The Face Recognition feature of Facebook also uses
examples (also called training data). This approach ML to tag the friends in a picture. Facebook checks
was named Machine Learning. the poses and projections in the picture, notices the
unique features, and then matches them with the
people in your friends list.
Machine Learning
➥Pinterest also uses ML for computer vision to
AI researchers believe that
identify the objects (or pins) in the images and
this goal can be achieved recommend similar pins accordingly.
through machine learning 6. Email Span and Malware Filtering
algorithms that try to mimic
the learning processes of the ➥have inbuilt machine learning to identify spam
human brain. emails.
build an Artificial ➥ the system learns and identifies new mail as
Intelligence (AI) platform spam or not, automatically.
that is as intelligent as 7. Online Customer Support
the human mind.
➥Many sites these days offer the surfer the option
to talk with them.
Applications of Machine Learning
➥In any case, very few sites have a live official to
1. Virtual Personal Assistants answer your questions. In most cases, you
➥such as Siri, Alexa, or Google Assistant converse with a chatbot.
⇢these assistants can be trained for certain tasks like ➥These bots extract data from the site and
‘Set an alarm for 5 AM next morning’, ‘Remind me to present it to clients through machine learning.
visit doctor tomorrow at 6 PM’, and so on. 8. Search Engine Result Refining
➥Smart Speakers like Amazon Echo and Google
➥Google and other search engines use machine
Home are the outcomes of this innovation. learning to improve search results for you.
➥Above-mentioned assistants use machine 9. Product Recommendations
learning to achieve these Objectives. ➥Whenever we make an online purchase on
2. Traffic Predictions websites such as Amazon or Flipkart or
➥such as Google maps similar, we usually keep receiving emails from
⇢it uses machine learning to predict the expected time them with shopping suggestions. They also
of arrival at the destination and also to model traffic recommend us items that somehow match our
congestion on real time basis. tastes.
10. Online Fraud Detection Classification of Machine
➥Machine learning is also helping in making Learning Algorithms
cyberspace more secure and tracking
monetary frauds online. Machine Learning
➥PayPal is using ML for protection against money
laundering. The company uses a set of ML
tools that helps them to compare millions of
transactions taking place and distinguish
between legitimate or illegitimate transactions Supervised Reinforcement
Unsupervised
taking place between the buyers and sellers. (Regression/
Classification)
(Clustering)
(Algorithm learns to
react to environment)

11. Medicine
➥With the advent of automation, medical records
are now available in electronic form. Supervised Learning
➥The ML algorithms are helping doctors to ➥Supervised learning can be used for two types of
understand diseases in a better manner by problems: Classification and Regression.
turning the medical records into medical ➥machine learning method in which models are
knowledge. trained using labeled data.
12. Handwriting Recognition ➥models need to find the mapping function to
➥ML can not only recognize handwriting but also map the input variable (X) with the output
read different ones. So, it is a versatile tool for variable (Y). Y=f(X)
many applications. ➥needs supervision to train the model, which is
➥For instance it can be used to route postal mail similar to as a student learns things in the
all over the country once it has been trained presence of a teacher.
to read addresses written in anyone’s 📌 The example discussed below is an example of
handwriting. supervised machine learning and the term ‘supervised’
Site: https://quickdraw.withgoogle.com/# signifies the fact that the dataset with the ‘right
13. Machine Translation answers’ is given to the algorithm. The given example
➥We use Google Translate that translates is also a case of regression problem. In a regression
text/website instantly between 100 different problem, the system predicts a continuous– valued
human languages as if by magic. output (here, it is the price of the house).
14. Driverless cars and autonomous helicopters
➥One such application is self-driving cars or
autonomous helicopters.
➥It takes years of experience for a person to
become a good driver and much of this is
intuitive. The only possible solution is
machine learning, i.e., having a computer
program that can learn by itself how to drive
a car or fly a helicopter.

Defining Machine Learning


➥Arthur Samuel (1959) coined the term machine
learning and defined it as: ‘the field of study that gives
computers the ability to learn without being explicitly
programmed.’ This is an informal and old definition of
machine learning.
➥In 1998, Tom Mitchell redefined the concept of
machine learning as ‘[A] computer program is said to
learn from experience E with respect to some class of
tasks T and performance measures P, if its
performance at tasks in T, as measured by P, improves
with experience E.’ For instance, let’s say there is an
email program which tracks the emails that a person
marks as spam or not spam and based on that learns
how spam can be filtered in a better way. Thus,
Classification of emails as spam or not spam is the
Task T. Tracking the user and marking emails as spam
or not spam becomes Experience E. The number of
emails correctly classified as spam or not spam is
Performance P.
📌 The example below is an example of Classification. For example, the age of the patient can also be
The term ‘classification’ signifies that the system has considered instead of just knowing the size of tumor
only. In that case, our dataset would look like as shown
to predict the output as a discrete value, i.e., one or
in Figure 2.3.
zero (either benign or malignant in the above
example). It should be noted that in a classification
problem, the output can have more than two possible
values. For example, there may be three types of
breast cancers and one can try to predict the discrete
value, i.e., zero, one, or two.
Here, zero may represent a benign tumor or not
harmful cancer, one may represent type one cancer
and the discrete value two may indicate a type two
cancer.
Hence, in a classification problem, we may have N
classes in the output where N is always a finite
Let’s suppose a person has a tumor, of size and age
number. as depicted by B in Figure 2.4. In this dataset, the ML
From the given example, only one attribute or algorithm is able to fit a straight line to separate out
feature namely the tumor size has been used with the the benign tumors from the malignant tumors as
aim to predict whether the type of tumor is benign or shown in Figure 2.5.
malignant. In other machine-learning situations, there Thus, according to the o ML algorithm, a straight
line can, hopefully, help us determine a person’s tumor
can be more than one attribute or feature. For
by separating out the tumors. And if a person’s tumor
example, the age of the patient can also be considered falls in this benign area, then the type of cancer is
instead of just knowing the size of the tumor only. more likely to be benign than malignant.
' In this example, two features such as the patient’s
age and tumor size have been considered. However,
we can increase the number of features to further
increase the accuracy of prediction. It is important to
note that as more and more relevant features are
added to a model, the model will become more
complex but the accuracy may increase. Commonly,
machine learning algorithms have more than one
feature.

Problem 1:
For each customer account, you’d like to have a
software to check the account and decide if it has been
compromised (hacked).

Problem 2:
You have a large stock of similar goods. You wish to
know that in the next three months how many of these
goods will sell.
Unsupervised Learning 📌Other Examples Of Unsupervised Learning Applications
➥Unsupervised learning, also known as unsupervised ➥Organize Computing Clusters
machine learning, uses machine learning algorithms ⇢The geographic location of servers is decided
to analyze and cluster unlabeled datasets. These on the basis of clustering of web requests
algorithms discover hidden patterns or data groupings received from a particular region of the
without the need for human intervention. world. The local server will then contain only
➥Its ability to discover similarities and differences in the data frequently accessed by people of
information makes it the ideal solution for exploratory that region.
data analysis, cross-selling strategies, customer ➥Social Network Analysis
segmentation, and image recognition. ⇢Social network analysis is conducted to
➥allows us to approach problems with little or no idea create clusters of friends based on the
about what the results will look like. We can derive frequency of communication among them.
structure from data where we don’t necessarily know Such analysis reveals the links among the
the effect of the variables. users of any social networking website.
➥data is not labeled, which means that there is no ➥Market segmentation
output attribute. We only have input attributes and on ⇢Sales organizations can cluster or group
the basis of the values of input attributes grouping or their customers into different segments on
clustering is performed on the input data to group the basis of their previous billed items. For
them into similar classes. example, a big superstore may want to send
➥We can only construct the clusters of data based on an SMS about grocery items specifically to
relationships among the variables in the data. With its customers of grocery instead of sending
unsupervised learning, there is no feedback about the that SMS to all its customers. Not only is it
results predicted, i.e., there is no teacher to correct cheaper but also better; after all it might be
you. an irrelevant irritant to those who only
purchase clothing from the store. The
📌 Examples of Unsupervised Learning: grouping of customers into different
segments based on their purchase history
➥Google news as depicted in Figure 3.1 is an
will help the store to target the correct
excellent example of clustering that uses
customers for increasing sales and
unsupervised learning to group news items
improving its profits.
based on their contents.
➥Astronomical data analysis
Google has a collection of millions of news
items written on different topics and their ⇢Astronomers use large telescopes to study
clustering algorithm automatically groups galaxies and stars. The patterns in light or
these news items into a small number that grouping of lights received from different
are somehow similar or related to each parts of the sky help to identify different
other by using different attributes, such as galaxies, planets, and satellites.
word frequency, sentence length, page
count, and so on Reinforcement Learning
➥Astronomical data analysis used in applications like
computer games (where the machine plays with a
human), driverless cars, robot navigation, etc.
➥It works through trial and error, and the machine
selects those actions that yield the greatest rewards.

Machine Learning

AGENT ENVIRONMENT ACTIONS


It is used for It defines the It defines the
learning and outer world with tasks to be
decision making. which the agent performed by the
The agent chooses interacts. agent.
actions that
maximize some
specified reward
metric over a given ➥reinforcement learning allows machines
amount of time
and software agents to automatically
determine the ideal behavior within a
specific context, in order to maximize its
performance. Simple reward feedback is
required for the agent to learn its behavior
and this is known as the reinforcement
signal.

You might also like