0% found this document useful (0 votes)

738 views22 pages

Introduction to Machine Learning Concepts

Machine learning is becoming ubiquitous, with applications like Google searches, photo tagging, self-driving cars, and more. The document outlines the basic machine learning process as: 1) defining objectives, 2) gathering and cleaning data, 3) choosing a model, 4) training the model, 5) evaluating the model, 6) tuning hyperparameters, 7) interpreting and communicating results, and 8) deploying the model. It also describes the main types of machine learning as supervised learning, unsupervised learning, and reinforcement learning.

Uploaded by

IgorJales

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

738 views22 pages

Introduction to Machine Learning Concepts

Uploaded by

IgorJales

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

An Enlightenment to Machine Learning

Preamble
The concepts of Artificial Intelligence and Machine Learning always evoke the ancient
Greek myth of Pandora’s box. In the fairytale version of the story, Pandora is
portrayed as a curious woman who opened a sealed urn and inadvertently released
eternal misery on humankind.
In the original telling, Pandora was not an innocent girl who succumbed to the
temptation to open a forbidden jar. Instead, as the poet Hesiod tells us, Pandora was
made, not born.

Like the genie that escaped the lamp, the horse that fled the barn, the myth has become
a cliche. Now, let us explore the Machine Learning to get more fascinated!

Data Everywhere!
We are drowning in information and starving for knowledge.

Google
24 petabytes of data are processed per day.
Facebook
10 million photos are uploaded every hour.
Youtube
1 hour of video is uploaded every second.
Twitter
400 million tweets are posted per day.

With data increasing every day, we can believe that smart data analysis will become
more prevalent as a fundamental ingredient for technological progress.

Why Machine Learning?

We interact with Machine Learning models every single day without our knowledge.
Every time we perform a Google search, listen to a song or even take a photo, Machine
Learning is becoming a backbone process behind it by invariably learning and
improving from every interaction.

Machines can drive your car for you, detect eye diseases, unlock your phone with face
recognition, and the list never ends.
Let us get started with Machine Learning!

What is Machine Learning?

Definition

Machine Learning is the field of study that gives computers the ability to learn
without being explicitly programmed.

Machine learning is a tool for turning information into knowledge.

We are DATAFIED! Wherever we go, we leave a data trail. Data becomes fruitless unless
we discover the hidden patterns. Wondering how? Yes! Machine Learning is a magic
wand that turns information into knowledge, which will do wonders for humankind.

Deep dive into the concepts to know more.

Traditional Learning vs. Machine Learning

Traditional Learning

 Blends human-created rules with data to create answers to a problem.

Machine Learning

 Uses data and answers to uncover the rules that build a problem.

What Machine Learning does?

Do you want to predict a category?

 Machine Learning has Classification

Example
Predict if the stock price will increase or decrease.

Do you want to predict a quantity?

 Machine Learning has Regression

Example
Predict the age of a person based on their height, weight, and health factors.

What Machine Learning does?

Do you want to detect an anomaly?

 Machine Learning has Anomaly Detection

Example
Money withdrawal anomalies can be discovered.

Do you want to discover structure in unexplored data?

 Machine Learning has Clustering

Example
Finding a group of customers with similar behavior based on their buying data history.

Machine Learning Adventures

Explore the adventures of Machine Learning in this video.

Prelude
While a great deal of engrossment has been towards model building, model tuning, and
model evaluation, many individuals still find themselves asking basic inquisitive
questions like
What is the life cycle of Machine Learning?
This section of the course will aid in answering this question. Keep reading to know
more!

Big Picture
The big picture of the Machine Learning process lies in the following 9 steps, namely:

1. Defining Project Objectives

2. Gathering Data
3. Exploratory Data Analysis (EDA) and Data Cleaning
4. Choosing a Model
5. Training
6. Evaluation
7. Hyperparameter Tuning
8. Interpret and Communicate
9. Deployment and Documentation

Defining Project Objectives

 The first step of the life cycle is to recognize the opportunity for tangible
improvement of activities, enhance customer satisfaction, or create value
otherwise.
 It is critical that you understand the problem you are trying to solve. In this stage,
you should also be identifying the central objectives of your project by
identifying the variables that need to be predicted.

Gathering Data
 This is considered to be the primary step of the Machine Learning process.
 The quality and quantity of data you gather in this step will determine how
efficient your model will be.

Some important things to remember while gathering data are:

 Data can be collected from anywhere in any format.

 More training examples will aid the model to be more efficient.
 Make sure the number of samples for every class or topic is not
overly imbalanced.
 Ensure that your samples adequately cover the space of possible inputs, not
only the common cases.
Exploratory Data Analysis (EDA) and Data
Cleaning
Exploratory Data Analysis (EDA)

 Analyzing datasets to summarize their notable characteristics is called

Exploratory Data Analysis.
 It helps in performing investigations on data to discover hidden patterns,
anomalies, and so on.
 It aids in checking assumptions and hypothesis with the help of summary
statistics.

Data Cleaning
 Data can have several shortcomings. A few are:
1. Missing values
2. Duplicate data
3. Invalid data
 The process of detecting, correcting, and ensuring that the given dataset is error-
free, consistent enough to use, is called Data Cleaning.

Choosing a Model
 There are numerous models that researchers and Data scientists have created
over the years.
 Some are very well-suited for image data, while others are suited
for sequences, text-based data, and many more.
 Choosing the right model for the problem will impact the efficiency of the model.

Explore this video to know the different constraints for choosing different models.

Training
 The next step of the Machine Learning process, often known as the bulk of ML,
is Training the model.
 This step is very similar to a person who is learning to drive for the first time.
Though they do not know any of the basics initially, a licensed driver emerges
eventually, after a lot of practice and feedback.
 The data is split into Training Data and Testing Data.
 The model is trained with the training data using different ML algorithms by
adjusting the parameters in multiple iterations.
 The testing data is put aside as unseen data to evaluate your models.

Evaluation
 Once the training is complete, it is time to see if the model is any good,
using Evaluation.
 This is where that dataset that we set aside earlier comes into play, that
is, Testing Data.
 Evaluation allows us to test our model against the data that has never been used
for training.
 This metric will enable us to see how the model might perform against data that it
has not yet seen.
 This is meant to be representative of how the model might perform in the real
world.

Hyperparameter Tuning
 After the evaluation step, it is time to see if we can further improve our training by
tuning different parameters that were implicitly assumed in the training process.
This process is called Hyperparameter Tuning.
 The tuned model is once again evaluated for model performance, and this cycle
continues until the final best performing model is chosen.

Interpret and Communicate

 The most challenging task of the ML project is explaining the model's output.
 During the earlier days, Machine Learning was considered a BlackBox because it
was hard to interpret their insights and values.
 The more interpretable your model is, the easier it is to communicate your
model's importance to the stakeholders.

Deployment and Documentation

 Model deployment often poses a problem because of the coding and data
science experience it requires and because the time-to-implementation of
traditional data science methods from the start of the cycle is prohibitively long.
 The trained model has to be deployed in a real-world system to be efficient to
humans.
 It can be deployed using any framework like Flask, Cloud, Azure, and so on.
 Document your project well for your successors to handle it.

Prelude
Machine Learning is an umbrella term that covers 3 learning techniques. In this
section, let us unveil them to understand more about Machine Learning.

Types of Machine Learning

The types of Machine Learning are as follows:

 Supervised Learning
 Unsupervised Learning
 Reinforcement Learning

Supervised Learning
 Supervised learning is the Machine Learning task of learning a function that
maps an input to an output based on example input-output pairs.
 It infers a function from labeled training data.
 Each training example is a pair consisting of an input object and the desired
output value.
 A supervised learning algorithm analyzes the training data and produces
an inferred function, which can be used for mapping new examples.

Applications

1. Spam Detection
2. Pattern Recognition
3. Speech Recognition

Unsupervised Learning
Unsupervised Learning helps in uncovering hidden patterns from unlabeled data.

Applications

1. Recommender Systems
2. Targetted Marketing
3. Customer Segmentation
4. Structure Discovery
Reinforcement Learning
Reinforcement Learning is a type of Machine Learning in which software agents ought
to take actions in an environment to maximize the notion of cumulative reward.

Applications

1. Genetics
2. Economics
3. Robot Navigation

Know the Differences

Know the differences between the various learning techniques through this video.

Machine Learning in SDLC

The image depicted above illustrates how to integrate the process of Machine Learning
into the traditional Software Development Life Cycle (SDLC).

The three phases include:

1. Planning
2. Data Engineering
3. Modeling

Prelude
Are you confused about the jargons and terms in Machine Learning? This section is
here to help you.

Few key terminologies to be known while using the Machine Learning model are
discussed in this section.

Machine Learning Terminologies

Accuracy
Accuracy is the percentage of correct predictions made by the model.

Algorithm

 Machine learning algorithms are programs (math and logic) that adjust
themselves to perform better as they are exposed to more data.
 The learning part of Machine Learning implies that programs change how they
process data over time, much as humans change how they process data by
learning.
 So, a Machine Learning algorithm is a program with a specific way to adjust
its parameters, given the feedback on its previous performance , making
predictions about a dataset.

Examples

 Linear regression
 Decision trees
 Support vector machines
 Neural networks

Machine Learning Terminologies

Categorical Variables

 Categorical variables are variables with a discrete set of possible values.

 They can be ordinal or nominal.

Classification
Classification aids in predicting the categorical output.

Clustering
Clustering is the unsupervised grouping of data into buckets.

Machine Learning Terminologies

Dimension
The dimension of data denotes the number of features in a dataset.

Feature
For a dataset, a feature represents the combination of attribute and value.

Feature Selection
Feature selection is the process of selecting relevant features from a dataset for
creating a Machine Learning model.
Machine Learning Terminologies
Hyperparameters
Hyperparameters are higher-level properties of a model, such as how fast it can
learn or the complexity of a model.

Instance
An instance is a data point, row, or sample in a dataset.

Label
The label is the answer part of the observation in supervised learning.

Machine Learning Terminologies

Outlier
An outlier is an observation that deviates significantly from other observations in the
dataset.

Regression
Regression predicts the continuous form of output (For example, price, sales, and so
on).

Validation Set
The validation set is a set of observations used during model training to provide
feedback on how well the current parameters generalize beyond the training set.

Prelude
Let us now explore the following popular Machine Learning techniques:

 Classification
 Clustering
 Association Rule Mining
 Outlier Detection
 Regression
Classification
Definition

Classification is the process of identifying a category to which a new observation

belongs, based on a training set of data containing observations whose categories are
already known.

 It follows a two-step process, namely:

o Learning Step - Training phase where a model is constructed.
o Classification Step - Predicting the class labels and testing the same for
accuracy.
 Classification predicts the value of the categorical variables.

Classification Concept
This video unveils the concept of classification with an example.

Clustering
Clustering is the task of grouping a set of objects, such that objects in the same
cluster are similar to each other when compared to the objects in the other clusters.
 Distance measure plays a significant role in clustering.
 Clustering is an unsupervised learning method.
 The common distance measures used in various datasets are as follows.
Numeric Dataset

- Manhattan distance
- Minkowski distance
- Hamming distance

Non-Numeric Dataset

- Jaccard index
- Cosine Similarity
- Dice Coefficient

More on Clustering
Explore the types of clustering algorithms through this video.

Association Rule Mining

Association Rule Mining aids in identifying the associations, correlations, and frequent
patterns in data.

The derived relationships are represented in the form of Association Rules.

Association Rule Mining with Apriori

Watch this video to know the process of rule mining with Apriori.

Outlier Detection
Jiawei Han defines Outlier as
A data object that deviates significantly from the normal objects as if it were
generated by a different mechanism.

The types of Outlier are as follows:

 Global Outlier
Global Outlier significantly deviates from the entire dataset.
 Contextual Outlier
Contextual Outlier significantly deviates based on the context selected.
 Collective Outlier
Collective Outlier is a subset of data objects that collectively deviates from the
entire dataset.

Regression
 Regression analysis is a statistical method that aids in examining the relationship
between two or more variables of interest.
 It examines the influence of one or more independent variables on
a dependent variable.

Prelude
There are a variety of algorithms available in the Machine Learning world.
This section will guide you through the commonly used Machine Learning
Algorithms.

Decision Tree
 A Decision Tree (DT) is a tree-like model of decisions and possible
consequences, chance event outcomes, resource costs, and utility.
 Decision Trees are a non-parametric supervised learning method used for
classification and regression.

Watch this video to know more.

Naive Bayes
A Naive Bayes classifier is a probabilistic Machine Learning model that is used for
classification tasks. The crux of the classifier is based on the following Bayes theorem
formula.

P(A|B)= P(B∣A)P(A) / P(B)

Support Vector Machine

Support Vector Machine (SVM) is a supervised machine learning algorithm. It is used
for classification or regression type of problems.

Watch the following video to know more about SVM with an example.

K-means Clustering
Delve into this video to know about a type of clustering algorithm called K-means
Clustering.

Random Forest
Know more about the Random Forest algorithm through this video.

Linear Regression
Explore this video to know about Linear Regression Analysis.

Logistic Regression
Deep dive into this video to know about Logistic Regression Analysis.

Course Summary
Machine Learning is a modern innovation that has helped humans enhance industrial
and professional processes and everyday living.
Explore and delve deeper to increase your skills!
Qustion (13 CORRECTS)
The derived relationships from Association Rule Mining are represented in the
form of ___________.

Decision Trees

Association Rules

Data Rules

Which Machine Learning technique would you suggest to develop a machine

that detects the sudden increase or decrease in the heartbeat?

Outlier Detection

Classification

Regression Analysis

Which of the following Machine Learning models would you suggest to predict
a quantity?
Classification

Regression

Clustering

______________ is given a system of rewards and punishments.

Supervised Learning

Reinforcement Learning

Unsupervised Learning

__ Learning uses data and answers to uncover the rules that build a problem.

Traditional

Machine

Linear Regression helps in predicting the ____________ output.

Discrete

Continuous
Clustering is a/an ____________ learning method.

Supervised

Unsupervised

Support Vector Machine is used for ____________ type(s) of problems.

Classification and Regression

Regression

Classification

_____________ Learning draws inspiration from psychological behavior.

Supervised

Reinforcement

Unsupervised

_____________ is a tool for turning information into knowledge.

Data Transformation
Data Analysis

Machine Learning

______________ outlier deviates significantly from the entire dataset.

Contextual

Global

An observation that deviates significantly from other observations in the

dataset is known as ____________.

Outlier

A marketing company wants to group its customers into various groups to

advertise accordingly. Which Machine Learning technique would you suggest
for the company?

Clustering
Classification

Regression

A credit card company receives thousands of applications for new credit card
issues with attributes like salary, debts, and so on. Which Machine Learning
technique would you suggest to categorize applications into good credit and
bad credit?

Classification

Outlier Detection

____________ learning blends rules created by humans with data to develop

answers to a problem.

Machine

Traditional

Key Data Structures for Engineers
0% (1)
Key Data Structures for Engineers
22 pages
Understanding Sharding and API Whitelisting
No ratings yet
Understanding Sharding and API Whitelisting
164 pages
Spring Frameworks for Enterprise Applications
No ratings yet
Spring Frameworks for Enterprise Applications
330 pages
SpringBoot Interview Qns and Ans
No ratings yet
SpringBoot Interview Qns and Ans
468 pages
Java Study Material
No ratings yet
Java Study Material
29 pages
Handbook 1
100% (1)
Handbook 1
17 pages
Java Programming Notes
No ratings yet
Java Programming Notes
89 pages
Modern Java A Guide To Java 8
No ratings yet
Modern Java A Guide To Java 8
142 pages
Git Notes ?-1
No ratings yet
Git Notes ?-1
71 pages
Multithreading Completable Furure
No ratings yet
Multithreading Completable Furure
9 pages
Java/J2EE Design Patterns Interview Questions You'll Most Likely Be Asked: Second Edition
No ratings yet
Java/J2EE Design Patterns Interview Questions You'll Most Likely Be Asked: Second Edition
22 pages
Angular Framework Overview and Concepts
No ratings yet
Angular Framework Overview and Concepts
33 pages
Capgemini Java Interview Questions
100% (1)
Capgemini Java Interview Questions
10 pages
Spring Boot
No ratings yet
Spring Boot
18 pages
Hibernate Reverse Engineering Guide
No ratings yet
Hibernate Reverse Engineering Guide
50 pages
Java Class and Object Fundamentals
No ratings yet
Java Class and Object Fundamentals
455 pages
Java Coding Interview Questions + Answers (With Code Examples) - Zero To Mastery
No ratings yet
Java Coding Interview Questions + Answers (With Code Examples) - Zero To Mastery
71 pages
Java Programming Basics and Features
No ratings yet
Java Programming Basics and Features
86 pages
Java Material 1
No ratings yet
Java Material 1
255 pages
Java Microservices Interview Questions
No ratings yet
Java Microservices Interview Questions
24 pages
Java Spring Boot Learning Plan - From Beginner To Industry-Ready
No ratings yet
Java Spring Boot Learning Plan - From Beginner To Industry-Ready
15 pages
Introduction to Angular Framework
No ratings yet
Introduction to Angular Framework
63 pages
Lead Nodejs Developer Resume
No ratings yet
Lead Nodejs Developer Resume
7 pages
Design Patterns Natraj
No ratings yet
Design Patterns Natraj
46 pages
Blockchain Technology Overview and Concepts
No ratings yet
Blockchain Technology Overview and Concepts
60 pages
50 Days of DSA Problems and Solutions
No ratings yet
50 Days of DSA Problems and Solutions
15 pages
Java Tutorial for Beginners PDF
No ratings yet
Java Tutorial for Beginners PDF
161 pages
Angular Data Binding Techniques
No ratings yet
Angular Data Binding Techniques
9 pages
JavaScript & CSS Exam Guide
No ratings yet
JavaScript & CSS Exam Guide
44 pages
Dumps
No ratings yet
Dumps
131 pages
JAVA - Coding
No ratings yet
JAVA - Coding
22 pages
MS 201
No ratings yet
MS 201
41 pages
BCA 428 Oracle
No ratings yet
BCA 428 Oracle
142 pages
Introduction to Hadoop Software
No ratings yet
Introduction to Hadoop Software
47 pages
Java Interface Methods: Default & Private
No ratings yet
Java Interface Methods: Default & Private
7 pages
Java Programming Q&A: Packages & Threads
No ratings yet
Java Programming Q&A: Packages & Threads
16 pages
SQL Scenario-Based Interview Questions & Answers: Nitya Cloudtech PVT LTD
No ratings yet
SQL Scenario-Based Interview Questions & Answers: Nitya Cloudtech PVT LTD
8 pages
Java 8 Consumer Interface Explained
No ratings yet
Java 8 Consumer Interface Explained
7 pages
Answers To List of Java Unanswered Interview Questions
No ratings yet
Answers To List of Java Unanswered Interview Questions
35 pages
Spring Cloud Stream Overview
No ratings yet
Spring Cloud Stream Overview
9 pages
Java Coding Standards Checklist
No ratings yet
Java Coding Standards Checklist
12 pages
Azure SDK for Java: Maven Guide
No ratings yet
Azure SDK for Java: Maven Guide
142 pages
Top 50 Microservices Interview Questions
No ratings yet
Top 50 Microservices Interview Questions
16 pages
Java Interview Prep Guide
100% (15)
Java Interview Prep Guide
32 pages
Java Fundamentals Part-1
No ratings yet
Java Fundamentals Part-1
6 pages
Spring Boot Rest API
No ratings yet
Spring Boot Rest API
91 pages
Spring Annotations Guide
No ratings yet
Spring Annotations Guide
3 pages
Dev Sharma: Senior .Net Developer Profile
No ratings yet
Dev Sharma: Senior .Net Developer Profile
8 pages
Containerizing Spring Boot with Docker
No ratings yet
Containerizing Spring Boot with Docker
9 pages
An Enlightenment To Machine Learning
100% (1)
An Enlightenment To Machine Learning
16 pages
Air Quality Prediction Using Machine Learning
No ratings yet
Air Quality Prediction Using Machine Learning
29 pages
Machine Learning 1
No ratings yet
Machine Learning 1
34 pages
Machine Learning 3
No ratings yet
Machine Learning 3
30 pages
Jntuk r20 ML Unit-I (Chapter-I)
No ratings yet
Jntuk r20 ML Unit-I (Chapter-I)
18 pages
Introduction to Machine Learning Basics
No ratings yet
Introduction to Machine Learning Basics
22 pages
10 Machine Learning
No ratings yet
10 Machine Learning
9 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
57 pages
Introduction to Machine Learning Basics
No ratings yet
Introduction to Machine Learning Basics
34 pages
A 6 Step Field Guide For Building Machine Learning Projects
No ratings yet
A 6 Step Field Guide For Building Machine Learning Projects
17 pages
ML Unit-I
No ratings yet
ML Unit-I
28 pages
Understanding NoSQL Architecture
No ratings yet
Understanding NoSQL Architecture
22 pages
Data Mining Techniques Overview
No ratings yet
Data Mining Techniques Overview
33 pages
Design Thinking Course Overview
No ratings yet
Design Thinking Course Overview
18 pages
R3 Corda Resp
No ratings yet
R3 Corda Resp
37 pages
The Art of Cryptography - Resp
No ratings yet
The Art of Cryptography - Resp
24 pages
For A Given Vector in 2D Space, Stretching It by A Value of 2 Is Called
100% (1)
For A Given Vector in 2D Space, Stretching It by A Value of 2 Is Called
23 pages
Data Visualization Aurora - Resp
100% (1)
Data Visualization Aurora - Resp
33 pages
Threat Modeling
100% (1)
Threat Modeling
39 pages
Amazon Management and Monitoring Services - Resp
No ratings yet
Amazon Management and Monitoring Services - Resp
30 pages
Understanding Data Mining Techniques
No ratings yet
Understanding Data Mining Techniques
39 pages
An Ingression Into Deep Learning - Resp
No ratings yet
An Ingression Into Deep Learning - Resp
25 pages
Intro to ML: House Price Prediction
No ratings yet
Intro to ML: House Price Prediction
18 pages
NoSQL - Database Revolution - Resp
50% (2)
NoSQL - Database Revolution - Resp
54 pages
MFDM™ AI - The Renaissance - QUIZ - Atualizado - Resp
67% (49)
MFDM™ AI - The Renaissance - QUIZ - Atualizado - Resp
10 pages
Implementing Design Thinking Resp
84% (19)
Implementing Design Thinking Resp
4 pages
Gradle & Jenkins Build Insights
No ratings yet
Gradle & Jenkins Build Insights
4 pages
Cloud Computing Resp
No ratings yet
Cloud Computing Resp
2 pages
HTML5 - Semantic Elements - Resp
No ratings yet
HTML5 - Semantic Elements - Resp
11 pages
Realm of Logo Design RESP
No ratings yet
Realm of Logo Design RESP
4 pages
Interaction Design - Greenhorns Guide - FA Resp
73% (11)
Interaction Design - Greenhorns Guide - FA Resp
6 pages
RPA - With - UiPath Resp More
No ratings yet
RPA - With - UiPath Resp More
24 pages
Pega Automation Questions & Answers
100% (1)
Pega Automation Questions & Answers
6 pages
Docker Resp
No ratings yet
Docker Resp
2 pages
RPA With UiPath Resp
No ratings yet
RPA With UiPath Resp
7 pages
Module 5: Machine Learning Overview
No ratings yet
Module 5: Machine Learning Overview
33 pages
E-Recommendation Techniques Overview
No ratings yet
E-Recommendation Techniques Overview
5 pages
SOLUTION ONLY CODE DWDM - Lab - All
No ratings yet
SOLUTION ONLY CODE DWDM - Lab - All
8 pages
A Survey of Utility-Oriented Pattern Mining
No ratings yet
A Survey of Utility-Oriented Pattern Mining
22 pages
Mining Association Rules Guide
No ratings yet
Mining Association Rules Guide
41 pages
IT446 Test Bank
No ratings yet
IT446 Test Bank
57 pages
Marketing Analytics - Week 11 - LAQ
No ratings yet
Marketing Analytics - Week 11 - LAQ
5 pages
BSC Data Science 1st Semester Syllabus
No ratings yet
BSC Data Science 1st Semester Syllabus
34 pages
Bloom Filter & Algorithms Guide
No ratings yet
Bloom Filter & Algorithms Guide
9 pages
Association Rules in Cosmetic Purchases
No ratings yet
Association Rules in Cosmetic Purchases
2 pages
21CSE355T DMA-8-15 Marks Question Bank
No ratings yet
21CSE355T DMA-8-15 Marks Question Bank
2 pages
Clustering Techniques in Data Mining
No ratings yet
Clustering Techniques in Data Mining
5 pages
Introduction to Data Mining Techniques
100% (1)
Introduction to Data Mining Techniques
31 pages
Data Mining Exam Questions and Tasks
No ratings yet
Data Mining Exam Questions and Tasks
2 pages
Data Mining Course Overview - B. Tech CSE
No ratings yet
Data Mining Course Overview - B. Tech CSE
13 pages
Realtime Application Projects 2021-22
No ratings yet
Realtime Application Projects 2021-22
4 pages
DMWH Syllabus
No ratings yet
DMWH Syllabus
3 pages
Data Warehousing and Data Mining Dr.P.rizwan Ahmed
0% (1)
Data Warehousing and Data Mining Dr.P.rizwan Ahmed
20 pages
Student Performance Analysis with Apriori
No ratings yet
Student Performance Analysis with Apriori
5 pages
DM Unit Wise Important Questions
No ratings yet
DM Unit Wise Important Questions
6 pages
Data Mining Series 2 Important Topics
No ratings yet
Data Mining Series 2 Important Topics
22 pages
Unit 3 DW&DM Notes Mr. Rohit Pratap Singh
No ratings yet
Unit 3 DW&DM Notes Mr. Rohit Pratap Singh
22 pages
Concepts and Techniques: Data Mining
No ratings yet
Concepts and Techniques: Data Mining
81 pages
Business Intelligence Syllabus 2020
No ratings yet
Business Intelligence Syllabus 2020
3 pages
FP Tree
No ratings yet
FP Tree
54 pages
Association Rule Mining Activity
No ratings yet
Association Rule Mining Activity
4 pages
Data Mining: Machine Learning Algorithms
No ratings yet
Data Mining: Machine Learning Algorithms
111 pages
Overview of Spatial Data Mining Techniques
No ratings yet
Overview of Spatial Data Mining Techniques
6 pages
Computers
No ratings yet
Computers
167 pages
Predicting Undergraduate Admission: A Case Study in Bangabandhu Sheikh Mujibur Rahman Science and Technology University, Bangladesh
No ratings yet
Predicting Undergraduate Admission: A Case Study in Bangabandhu Sheikh Mujibur Rahman Science and Technology University, Bangladesh
8 pages

Introduction to Machine Learning Concepts

Uploaded by

Introduction to Machine Learning Concepts

Uploaded by

An Enlightenment to Machine Learning

Why Machine Learning?

What is Machine Learning?

Machine learning is a tool for turning information into knowledge.

Deep dive into the concepts to know more.

Traditional Learning vs. Machine Learning

 Blends human-created rules with data to create answers to a problem.

 Uses data and answers to uncover the rules that build a problem.

What Machine Learning does?

 Machine Learning has Classification

Do you want to predict a quantity?

 Machine Learning has Regression

What Machine Learning does?

 Machine Learning has Anomaly Detection

Do you want to discover structure in unexplored data?

 Machine Learning has Clustering

Machine Learning Adventures

1. Defining Project Objectives

Defining Project Objectives

Some important things to remember while gathering data are:

 Data can be collected from anywhere in any format.

 Analyzing datasets to summarize their notable characteristics is called

Interpret and Communicate

Deployment and Documentation

Types of Machine Learning

Know the Differences

Machine Learning in SDLC

The three phases include:

Machine Learning Terminologies

Machine Learning Terminologies

 Categorical variables are variables with a discrete set of possible values.

Machine Learning Terminologies

Machine Learning Terminologies

Classification is the process of identifying a category to which a new observation

 It follows a two-step process, namely:

Association Rule Mining

The derived relationships are represented in the form of Association Rules.

Association Rule Mining with Apriori

The types of Outlier are as follows:

Watch this video to know more.

P(A|B)= P(B∣A)P(A) / P(B)

Support Vector Machine

Which Machine Learning technique would you suggest to develop a machine

______________ is given a system of rewards and punishments.

Linear Regression helps in predicting the ____________ output.

Support Vector Machine is used for ____________ type(s) of problems.

Classification and Regression

_____________ Learning draws inspiration from psychological behavior.

_____________ is a tool for turning information into knowledge.

______________ outlier deviates significantly from the entire dataset.

An observation that deviates significantly from other observations in the

A marketing company wants to group its customers into various groups to

____________ learning blends rules created by humans with data to develop

You might also like