Welcome to Scribd!

DMT Session-3 by Kushal Anjaria: Goal: Previously Unseen Records Should Be Assigned A Class As Accurately As Possible

Uploaded by

0% found this document useful (0 votes)

10 views2 pages

The document discusses classification algorithms in data mining. Specifically, it describes building classification models to predict an object's class based on its attributes. As an example, a patient's attributes like heart rate and blood rate could be used to predict if they have a disease or not. It then introduces decision trees as a classification method. Decision trees accept a training data set and build a tree to help make predictions. Each internal node tests an attribute, branches correspond to attribute values, and leaf nodes assign a classification. The goal is to design the best decision tree that accurately fits the training set by correctly classifying all records.

Original Description:

Class notes on Data Science

Original Title

Lecture Notes session 2

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

10 views2 pages

DMT Session-3 by Kushal Anjaria: Goal: Previously Unseen Records Should Be Assigned A Class As Accurately As Possible

Uploaded by

Mighty Singh

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

DMT Session-3

By Kushal Anjaria

The second pattern that we will study is the classification algorithm. In Data Mining, one of the most common tasks is
to build models to predict the class of an object based on its attributes. Here the object can be seen as a customer, patient,
transaction, email message, or even a single character. Characteristics of such objects can be, for example, for the patient
object, heart rate, blood pressure, weight, and gender. In contrast, the patient object's class would most commonly be
positive/negative for a specific disease. This section considers learning a classification tree model using the data we
have about such objects. Figure-1 illustrates the classification task in detail.

Given a collection of records (training set), each record contains a set of attributes; one of the attributes is class. Find a
model of a class attribute as a function of the values of the other variable

Goal: previously unseen records should be assigned a class as accurately as possible.

The test set is used to determine the accuracy of the model. Usually, the give data set is divided into training, and test
sets, with the training set, used to build the model and the test set used to validate it.
E.g., Email is spam or not, story or tweet falls under which category, doctor diagnosis
Multiple data mining techniques can perform the classification task. From the set of techniques, we will initiate with the
decision tree technique. The advantage of the decision tree technique is it is highly human interpretable. As a result, the
decision tree approach hardly requires any transformation.
Decision Tree
It is a classification method where we accept the training data set and design a decision tree. The decision tree helps us
in making the practical decision
The toy training set is as follows:
From the training dataset, the decision tree can be drawn as follows:
Outlook

If you observe the decision tree, then you will realize that

• Each internal node tests the attribute

• Each branch corresponds to the attribute value
• Each leaf-node assigns a classification

In data mining, we aim to design the best decision tree which best fits our training set. Best fit means for each row of
the training set; our decision tree should give us the correct result.

ML Unit-1
Document12 pages
ML Unit-1
20-6616 Abhinay
No ratings yet
Introduction To Data Mining Using Orange
Document72 pages
Introduction To Data Mining Using Orange
Mighty Singh
No ratings yet
Kay Sunderland: Making The Grade at Attain Learning: Name: Dheeraj Sarda Roll No.: P40065 Section: B
Document1 page
Kay Sunderland: Making The Grade at Attain Learning: Name: Dheeraj Sarda Roll No.: P40065 Section: B
Mighty Singh
No ratings yet
Unit-3 DWDM
Document11 pages
Unit-3 DWDM
PRANITHA REDDY
No ratings yet
Dwdm-Unit-3 R16
Document14 pages
Dwdm-Unit-3 R16
Manaswini Bhaskaruni
No ratings yet
Unit III Data Mining Techniques
Document17 pages
Unit III Data Mining Techniques
Ajit Raut
No ratings yet
Data Mining UNIT-2 Notes
Document91 pages
Data Mining UNIT-2 Notes
padma
No ratings yet
ITP4-Lesson 4-Week 7-8
Document18 pages
ITP4-Lesson 4-Week 7-8
Jamaica Mercolita
No ratings yet
Data Mining Unit-1 Notes
Document18 pages
Data Mining Unit-1 Notes
bkharthik1
No ratings yet
Unit-4 AML (1. Basics and K-NN)
Document25 pages
Unit-4 AML (1. Basics and K-NN)
hirenprajapati722
No ratings yet
Unit 4 DWDM
Document8 pages
Unit 4 DWDM
mrdeepuu000
No ratings yet
Classification & Prediction
Document19 pages
Classification & Prediction
AKANKSHA GARG
No ratings yet
18mca52c U3
Document8 pages
18mca52c U3
Sivarajan
No ratings yet
11 W11NSE6220 - Fall 2023 - Zeng
Document43 pages
11 W11NSE6220 - Fall 2023 - Zeng
rahul101056
No ratings yet
CLASSIFICATION
Document21 pages
CLASSIFICATION
Oviya.R
No ratings yet
Overview of Clustering:: UNIT-5
Document27 pages
Overview of Clustering:: UNIT-5
Kalyan Varma
No ratings yet
ML Unit 2
Document31 pages
ML Unit 2
Krishnaveni Yata
No ratings yet
DMDW - B Tech - Unit - 3
Document5 pages
DMDW - B Tech - Unit - 3
Pagoti Jyothirmaye
No ratings yet
O4GzewSLTumFLEulyGHINQ Data-Types Scales Degree
Document4 pages
O4GzewSLTumFLEulyGHINQ Data-Types Scales Degree
Yamini Lokhande
No ratings yet
About Classificatio1
Document5 pages
About Classificatio1
ariful
No ratings yet
Konsep Ensemble
Document52 pages
Konsep Ensemble
hary170893
No ratings yet
AI Assignment 2
Document5 pages
AI Assignment 2
Abraham Onyedikachi Ogudu
No ratings yet
Decision Tree Algorithm, Explained-1-22
Document22 pages
Decision Tree Algorithm, Explained-1-22
shyla
No ratings yet
Chapter-V CLASSIFICATION & CLUSTERING
Document153 pages
Chapter-V CLASSIFICATION & CLUSTERING
21053259
No ratings yet
Decision Trees
Document14 pages
Decision Trees
Justin Russo Harry
50% (2)
Unit-Iv DWDM
Document28 pages
Unit-Iv DWDM
varsha.j2177
No ratings yet
DMWH M3
Document21 pages
DMWH M3
BINESH
No ratings yet
Statistics For Data Science - 1
Document38 pages
Statistics For Data Science - 1
Akash Srivastava
100% (1)
Chapter 3: Data Mining
Document20 pages
Chapter 3: Data Mining
shreya
No ratings yet
Bike Buyer Prediction Using Classification Algorithm
Document19 pages
Bike Buyer Prediction Using Classification Algorithm
chaitra pujar
No ratings yet
Data Mining
Document68 pages
Data Mining
Ipsita
No ratings yet
Classification and Prediction
Document41 pages
Classification and Prediction
kolluriniteesh111
No ratings yet
Data Analytics - Unit-IV
Document21 pages
Data Analytics - Unit-IV
bhavya.shivani1473
No ratings yet
05 Logistic Regression
Document12 pages
05 Logistic Regression
hayero5557
No ratings yet
Classification and Clustering
Document8 pages
Classification and Clustering
Divya G
No ratings yet
Basic Notes
Document26 pages
Basic Notes
salaandeska2015
No ratings yet
Unit 1
Document52 pages
Unit 1
Rishabh Soni
No ratings yet
Data Mining and Visualization Question Bank
Document11 pages
Data Mining and Visualization Question Bank
ghost
100% (1)
DM Notes - UNIT 3
Document24 pages
DM Notes - UNIT 3
Raparthi Jaychandra
No ratings yet
JETIR1809788
Document4 pages
JETIR1809788
Agusti Frananda Alfonsus Naibaho
No ratings yet
Unit 8 Classification and Prediction: Structure
Document16 pages
Unit 8 Classification and Prediction: Structure
Kamal Kant
No ratings yet
Machine Learning & Data Mining: Understanding
Document7 pages
Machine Learning & Data Mining: Understanding
Rassellas Rassell
No ratings yet
Machine Learning Supervised
Document42 pages
Machine Learning Supervised
niyati1120
No ratings yet
For More Visit WWW - Ktunotes.in
Document21 pages
For More Visit WWW - Ktunotes.in
Archa Rajan
No ratings yet
MZU-MBA-DATA ANALYTICS - Data Science and Business Analysis - Unit 3
Document39 pages
MZU-MBA-DATA ANALYTICS - Data Science and Business Analysis - Unit 3
Aamir Reza
No ratings yet
ML Unit-2
Document51 pages
ML Unit-2
diroja5648
No ratings yet
Data Mining Ch-3
Document51 pages
Data Mining Ch-3
Hasset Tiss Abay Genji
No ratings yet
Decision Tree R
Document5 pages
Decision Tree R
Divya B
No ratings yet
Lecture Note 5
Document7 pages
Lecture Note 5
vivek gupta
No ratings yet
DM Mod 3
Document14 pages
DM Mod 3
brandon paxton
No ratings yet
Survey of Classification Techniques in Data Mining: Open Access
Document10 pages
Survey of Classification Techniques in Data Mining: Open Access
Fahri Alfiandi Stsetia
No ratings yet
Classification Algorithm in Data Mining: An
Document6 pages
Classification Algorithm in Data Mining: An
Mosaddek Hossain
No ratings yet
Decision Tree
Document57 pages
Decision Tree
Prabhjit Singh
100% (1)
Data Mining Functionalities
Document4 pages
Data Mining Functionalities
Im' Possible
100% (1)
Machine Learning: BY:Vatsal J. Gajera (09BCE010)
Document25 pages
Machine Learning: BY:Vatsal J. Gajera (09BCE010)
Riya Yadav
No ratings yet
Fundamentals of Machine Learning II
Document13 pages
Fundamentals of Machine Learning II
ssakhare2001
No ratings yet
How Decision Tree Algorithm Works
Document16 pages
How Decision Tree Algorithm Works
hnoor6
No ratings yet
Data Science Crash Course
Document32 pages
Data Science Crash Course
Abhinandan Chatterjee
No ratings yet
Machine Learning Section4 Ebook v03
Document20 pages
Machine Learning Section4 Ebook v03
camgova
No ratings yet
(IJCST-V9I3P23) :aditi Linge, Bhavya Malviya, Digvijay Raut, Payal Ekre
Document3 pages
(IJCST-V9I3P23) :aditi Linge, Bhavya Malviya, Digvijay Raut, Payal Ekre
EighthSenseGroup
No ratings yet
Unit-Iii: Classification and Prediction
Document21 pages
Unit-Iii: Classification and Prediction
Amrusha Naalla
No ratings yet
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Performance Trusted Supplier
Document3 pages
Performance Trusted Supplier
Mighty Singh
No ratings yet
5 Sources of Power and Influence
Document2 pages
5 Sources of Power and Influence
Mighty Singh
No ratings yet
Data Mining Techniques (DMT) by Kushal Anjaria Session-2: Tid Items
Document4 pages
Data Mining Techniques (DMT) by Kushal Anjaria Session-2: Tid Items
Mighty Singh
No ratings yet
8 - GST-8-EXAMPLES - Email BEFORE The Session
Document2 pages
8 - GST-8-EXAMPLES - Email BEFORE The Session
Mighty Singh
No ratings yet
Value Lies in The Belief of The Investor Prof. R. Ramaseshan
Document5 pages
Value Lies in The Belief of The Investor Prof. R. Ramaseshan
Mighty Singh
No ratings yet
6 - COMPUTATION OF TAXABLE VALUE - Q - As - AFTER SESSION - 9
Document21 pages
6 - COMPUTATION OF TAXABLE VALUE - Q - As - AFTER SESSION - 9
Mighty Singh
No ratings yet
Data Mining Techniques (DMT) by Kushal Anjaria Session-1 (Lecture Note)
Document2 pages
Data Mining Techniques (DMT) by Kushal Anjaria Session-1 (Lecture Note)
Mighty Singh
No ratings yet
DMT Session-6 by Kushal Anjaria Next, We Will See How To Compare Multiple Classifier?
Document3 pages
DMT Session-6 by Kushal Anjaria Next, We Will See How To Compare Multiple Classifier?
Mighty Singh
No ratings yet
Lecture Note Session5
Document1 page
Lecture Note Session5
Mighty Singh
No ratings yet
Orange3 Data Mining Library Using Python
Document102 pages
Orange3 Data Mining Library Using Python
Mighty Singh
0% (1)
DRW Technology Situational Analysis
Document2 pages
DRW Technology Situational Analysis
Mighty Singh
No ratings yet