Welcome to Scribd!

Decision Tree

Uploaded by

0% found this document useful (0 votes)

5 views21 pages

This document provides an overview of decision trees, including their structure, how they are built from training data, and how entropy and information gain are used to determine the splits at each node. Decision trees classify data into target categories or classes based on a series of splitting questions about the input features. Each node represents a feature, branches represent the possible values for that feature, and leaf nodes represent the target classifications. The algorithm calculates the entropy and information gain of possible splits on each feature to determine which split will provide the most information about the target classes. It recursively applies this splitting process, reducing entropy at each step, until reaching leaf nodes with pure target classifications.

Original Description:

Decision Tree presentation

Original Title

6. DECISION TREE

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

5 views21 pages

Decision Tree

Uploaded by

Ankur Kumar

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 21

Search inside document

DECISION TREE

PROF. ANUPAM MONDAL, PROF.BAVRABI GHOSH

Department of CSE
INSTITUTE OF ENGINEERING AND MANAGEMENT, KOLKATA
Topics to be covered
● What is Decision Tree?
● Structure details of a Decision Tree
● How to build a Decision Tree?
● Decision Tree based on training dataset
● Entropy and Information Gain of Decision Tree
● Calculation of Entropy
● Calculation of Information Gain
● Conclusion
DECISION TREE

● Decision tree is an algorithm for Classification type Machine Learning

● The model is built in form of a Tree like structure
● It is used for multi-dimensional analysis with multiple classes.
● Decision tree is one of user’s favourite because of the following two
reasons:
○ Fast execution
○ Ease in interpretation of rule
DECISION TREE contd...

● Goal -> To create a model (on past data) or Past vector, where value of
output variable is predicted based on input variable in feature vector
● Each node -> one of the feature vector
● Each leaf node -> one output value
● Which node is to be chosen for branching depends on which node gives
highest information gain value.
STRUCTURE DETAILS OF DECISION
TREE
Feature Vector - A,B
A (attribute)
T F
Leaf Node - L1, L2, L3 and L4
B1 B2 (classification /assignment
T of a class)
F T F

L1 L2 L3 L4
Root Node - A

Branch Node- B1, B2

DATASET
Decision Tree based on training dataset
ENTROPY AND INFORMATION GAIN OF
DECISION TREE
● Entropy is an expression of the disorder, or randomness of a
system, or of the lack of information about it.
● Information gain is the parameter that decides which feature gives
us maximum amount of information required to “decide” which will
be the next feature to be chosen as “Node from where split
happens” for the next stage.
CALCULATION OF ENTROPY AND
INFORMATION GAIN

Entropy (Sbs) -> Entropy Before Split

Entropy (Sas) -> Entropy After Split

CALCULATION OF ENTROPY AND
INFORMATION GAIN contd..
CALCULATIONS

pi( yes)8/18 =0.44

-pi* log2(pi) = 0.52

pi(no)10/18= 0.56
-pi* log2(pi) = 0.47

Total entropy = 0.52+ 0.47 = 0.99

CALCULATION OF ENTROPY AND
INFORMATION GAIN contd..

Total no of instances (row in dataset)= 18

Total Entropy(after split) = (( 6*0.92)+(7*0.99)+(5*0.00)/18) =0.69
(this is weighted entropy)
Information Gain = Entropy (before split) - Entropy (after split) = 0.99-0.69 =0.30
CALCULATION OF ENTROPY AND INFORMATION GAIN contd..

Depending on information gain of “CGPA” ,

“Communication”, “Aptitude” and “ Programming
Skill” , it is found that “Aptitude” has the highest
information gain.
Therefore “ Aptitude” is chosen as the first feature
based on which split is to take place.
CALCULATION OF ENTROPY AND
INFORMATION GAIN contd..
To choose feature for the next
node we will carry on with
“Entropy” and “Information Gain”
calculation. But this time the size
of dataset decreases as we
eliminate the instances for which
“Aptitude” is low. Because Low
aptitude does not fetch you a job.
So that branch of the tree freezes.
?
CALCULATION OF ENTROPY AND INFORMATION GAIN contd..
CALCULATION OF ENTROPY AND INFORMATION GAIN contd..
CALCULATION OF ENTROPY AND
INFORMATION GAIN contd..

?
CALCULATION OF ENTROPY AND INFORMATION GAIN contd..
CALCULATION OF ENTROPY AND INFORMATION GAIN contd..
CALCULATION OF ENTROPY AND
INFORMATION GAIN contd..
This is the final
decision tree
CODE SNIPPET
THANK YOU

Department of Education: Dammang East Elementary School
Document7 pages
Department of Education: Dammang East Elementary School
Renge Taña
100% (2)
Interview Questions for IBM Mainframe Developers
From Everand
Interview Questions for IBM Mainframe Developers
Robert Wingate
Rating: 1 out of 5 stars
1/5 (1)
Activities For Team Building Vol - II
Document250 pages
Activities For Team Building Vol - II
mauvarini
83% (6)
How To Take Care of An Egg Baby Project
Document2 pages
How To Take Care of An Egg Baby Project
MJ Benedicto
No ratings yet
Cambridge DELTA PDA Part1 Stage 2
Document9 pages
Cambridge DELTA PDA Part1 Stage 2
Giuliete Aymard
100% (2)
Accomplishment Report Inservice Training March 2021
Document3 pages
Accomplishment Report Inservice Training March 2021
Rogelio Ladiero
83% (6)
Fundamentals of Artificial Neural Networks
Document7 pages
Fundamentals of Artificial Neural Networks
Sagara Paranagama
No ratings yet
Caregiving NC II
Document109 pages
Caregiving NC II
Tyco Mac
100% (1)
Classification and Prediction
Document81 pages
Classification and Prediction
Krishnan Swami
No ratings yet
Decision Tree
Document66 pages
Decision Tree
Digvijay Maheshwari
100% (3)
Classification With Decision Trees: Instructor: Qiang Yang
Document62 pages
Classification With Decision Trees: Instructor: Qiang Yang
malik_genius
100% (1)
8085 Microprocessor PPT and PDF Report
Document7 pages
8085 Microprocessor PPT and PDF Report
anita1711
No ratings yet
EITN90 Radar and Remote Sensing Lecture 10: Machine Learning Approaches To Radar Signal Analysis
Document49 pages
EITN90 Radar and Remote Sensing Lecture 10: Machine Learning Approaches To Radar Signal Analysis
wire010
No ratings yet
Floating Point Multipliers: Simulation & Synthesis Using VHDL
Document40 pages
Floating Point Multipliers: Simulation & Synthesis Using VHDL
mdzakir_hussain
No ratings yet
Chapter2 Instructions Architecture Set
Document104 pages
Chapter2 Instructions Architecture Set
Zhuan Wu
No ratings yet
Day 5 Supervised Technique-Decision Tree For Classification PDF
Document58 pages
Day 5 Supervised Technique-Decision Tree For Classification PDF
amrita cse
100% (1)
Project
Document2 pages
Project
Thushan
No ratings yet
NIELIT Notes
Document15 pages
NIELIT Notes
Adarsh Gupta
No ratings yet
Kec 553a Lab Manual DSP 20-21
Document62 pages
Kec 553a Lab Manual DSP 20-21
sachinyadavv13
No ratings yet
1713678514kvs Study Material Xii Cs Notes
Document154 pages
1713678514kvs Study Material Xii Cs Notes
venkat raj
No ratings yet
Case Study-1: Department of Computer Science and Engineering (7 Semester)
Document16 pages
Case Study-1: Department of Computer Science and Engineering (7 Semester)
Mofi Gohar
No ratings yet
Compete-Guide, by Vishal Jain
Document186 pages
Compete-Guide, by Vishal Jain
api-3782519
No ratings yet
L03 C Intro
Document35 pages
L03 C Intro
itheachtholly1227
No ratings yet
Day 2
Document58 pages
Day 2
ganareddys
No ratings yet
3 Classification
Document83 pages
3 Classification
黃奕誌
No ratings yet
f37 Book Intarch Pres Pt3
Document95 pages
f37 Book Intarch Pres Pt3
swarnalatha.aimlgnitc
No ratings yet
DigitalLogic ComputerOrganization L23 Multicore Handout
Document32 pages
DigitalLogic ComputerOrganization L23 Multicore Handout
Phan Tuấn Khôi
No ratings yet
MML Cours9 Convolutional Neural Networks
Document61 pages
MML Cours9 Convolutional Neural Networks
Massi .Abbas
No ratings yet
Attribute Selection Measure
Document3 pages
Attribute Selection Measure
hehehenotyours
No ratings yet
Classification and Prediction
Document69 pages
Classification and Prediction
Deepa Nehra
No ratings yet
CS104: Computer Organization: 11 March, 2020
Document37 pages
CS104: Computer Organization: 11 March, 2020
Om Prakash
No ratings yet
SECTION 1: Basic Concepts and Notations, Arrays and Recursion
Document6 pages
SECTION 1: Basic Concepts and Notations, Arrays and Recursion
YT Premone
No ratings yet
Decision Tree Algorithm, Explained-1-22
Document22 pages
Decision Tree Algorithm, Explained-1-22
shyla
No ratings yet
ALGORITHMS
Document6 pages
ALGORITHMS
Emaaz Siddiq
No ratings yet
DS Chapter 1
Document41 pages
DS Chapter 1
Tigabu Yaya
No ratings yet
Decision Tree Using Sci-Kit Learn
Document9 pages
Decision Tree Using Sci-Kit Learn
sudeepvmenon
No ratings yet
TS5. Registers
Document4 pages
TS5. Registers
黄佩珺
No ratings yet
SFB54 Help
Document14 pages
SFB54 Help
Ivan Ignac
No ratings yet
OSCA-05 Register: Operating Systems Computer Architecture 1 of 4
Document4 pages
OSCA-05 Register: Operating Systems Computer Architecture 1 of 4
Sadiqu Zzaman
No ratings yet
Partitioning in Informatica Cloud (IICS) - ThinkETL
Document14 pages
Partitioning in Informatica Cloud (IICS) - ThinkETL
MILIND
No ratings yet
Lec3 - RISC-V Assembly
Document56 pages
Lec3 - RISC-V Assembly
sfleandro_67
No ratings yet
295151512xii Computer Science
Document169 pages
295151512xii Computer Science
Mamta Sharma
No ratings yet
Instruction Types
Document13 pages
Instruction Types
Shubham Goswami
No ratings yet
Information2 Analysis and Decision Making Techniques PDF
Document47 pages
Information2 Analysis and Decision Making Techniques PDF
alkalkia
No ratings yet
Cao Iii PDF
Document16 pages
Cao Iii PDF
zzz
No ratings yet
DSP 2012
Document82 pages
DSP 2012
SREELEKHA K R
No ratings yet
Optimization of C4.5 Decision Tree Algorithm For Data Mining Application
Document5 pages
Optimization of C4.5 Decision Tree Algorithm For Data Mining Application
Anita Andriani
No ratings yet
Chapter 2 Instruction Set Architecture (ISA) : What Is A Processor?
Document12 pages
Chapter 2 Instruction Set Architecture (ISA) : What Is A Processor?
martinland
No ratings yet
ML Unit 3 Part 3
Document33 pages
ML Unit 3 Part 3
jkdprince3
No ratings yet
Engineer To Engineer Note Ee-18: Choosing and Using Ffts For Adsp-21Xx
Document3 pages
Engineer To Engineer Note Ee-18: Choosing and Using Ffts For Adsp-21Xx
Raghu Kodi
No ratings yet
Distributed Arithmetic (Da)
Document13 pages
Distributed Arithmetic (Da)
Anonymous 1kEh0GvTgi
No ratings yet
Sistemas Embarcados e Tempo Real
Document6 pages
Sistemas Embarcados e Tempo Real
hilgad
No ratings yet
Decision Tree Induction
Document23 pages
Decision Tree Induction
Juhita Kumari
No ratings yet
ECE 368 A Tour by Example of Non-Trivial Circuit Design and VHDL Description
Document23 pages
ECE 368 A Tour by Example of Non-Trivial Circuit Design and VHDL Description
Anand Chaudhary
No ratings yet
Artificial Intelligence Applications in Power Systems - Slides
Document90 pages
Artificial Intelligence Applications in Power Systems - Slides
Ladla Shyam Ka
No ratings yet
Principal Component Analysis (PCA) : Feature Extraction Node
Document4 pages
Principal Component Analysis (PCA) : Feature Extraction Node
ShrooDD TM
No ratings yet
Branch Predictors
Document41 pages
Branch Predictors
Bisesh Padhi
No ratings yet
Jan. 2011 Computer Architecture, The Arithmetic/Logic Unit Slide 1
Document94 pages
Jan. 2011 Computer Architecture, The Arithmetic/Logic Unit Slide 1
adil
No ratings yet
It C Lecture
Document50 pages
It C Lecture
M Talha Wakeel
No ratings yet
Downloadable Doc 1 PDF
Document151 pages
Downloadable Doc 1 PDF
Shiva JK
No ratings yet
Handling The Dataset Using R - Word
Document54 pages
Handling The Dataset Using R - Word
swarnalathasatheesh.13
No ratings yet
Python Decision Tree Classification
Document14 pages
Python Decision Tree Classification
Ganesh R
No ratings yet
Effort Estimation VIA Software Metrics
Document47 pages
Effort Estimation VIA Software Metrics
Mansab Raja Khan
No ratings yet
ESZG512 ESD Assignment2 WILP
Document7 pages
ESZG512 ESD Assignment2 WILP
neelakandan venkatraman
No ratings yet
Decision Tree Classifier-Introduction, ID3
Document34 pages
Decision Tree Classifier-Introduction, ID3
mehra.harshal25
No ratings yet
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
Foundation Course for Advanced Computer Studies
From Everand
Foundation Course for Advanced Computer Studies
Franck Ismael Djédjé
No ratings yet
Different Paradigms of Pattern Recognition
Document8 pages
Different Paradigms of Pattern Recognition
Ankur Kumar
No ratings yet
KNN Algorithm
Document42 pages
KNN Algorithm
Ankur Kumar
No ratings yet
Strategies and Algorithms For Clustering Large Datasets: A Review
Document20 pages
Strategies and Algorithms For Clustering Large Datasets: A Review
Ankur Kumar
No ratings yet
Lecture 4
Document8 pages
Lecture 4
Ankur Kumar
No ratings yet
Workbook of Pattern Recognition
Document11 pages
Workbook of Pattern Recognition
Ankur Kumar
No ratings yet
Lecture 6
Document7 pages
Lecture 6
Ankur Kumar
No ratings yet
Notes On Module 3 - Pattern Recognition
Document17 pages
Notes On Module 3 - Pattern Recognition
Ankur Kumar
No ratings yet
Pattern Recognition - Questions
Document12 pages
Pattern Recognition - Questions
Ankur Kumar
No ratings yet
Universally Speaking: The Ages and Stages of Children's Communication Development From 5 To 11 Years
Document28 pages
Universally Speaking: The Ages and Stages of Children's Communication Development From 5 To 11 Years
transicion videos
No ratings yet
Students' Difficulties in Writing Descriptive Text at The Seventh Grade of SMP Sint Carolus Bengkulu
Document10 pages
Students' Difficulties in Writing Descriptive Text at The Seventh Grade of SMP Sint Carolus Bengkulu
Handayani Jaelani
No ratings yet
Student Personal Perception of Classroom Climate: Exploratory and Confirmatory Factor Analyses
Document22 pages
Student Personal Perception of Classroom Climate: Exploratory and Confirmatory Factor Analyses
Khumaeroh Dwi Nur'Aini
No ratings yet
The Future of Multimedia: Bridging To Virtual Worlds: Christopher Dede
Document1 page
The Future of Multimedia: Bridging To Virtual Worlds: Christopher Dede
Rass Seyoum
No ratings yet
English Club Action Plan
Document2 pages
English Club Action Plan
Faye Tamula II
No ratings yet
Lesson Plan
Document2 pages
Lesson Plan
api-710425077
No ratings yet
"Alternative Forms of Assessment Are Well-Suited For Effective Instruction." Reflection Paper
Document2 pages
"Alternative Forms of Assessment Are Well-Suited For Effective Instruction." Reflection Paper
Ronald Almagro
No ratings yet
Department of Education Schools Division of Aurora
Document2 pages
Department of Education Schools Division of Aurora
Cristel Anne A. Llamador
No ratings yet
RPMS 2023 (8.5X13)
Document37 pages
RPMS 2023 (8.5X13)
PRINCESS (LAZAM) VELARDE
No ratings yet
Article by NSHIMIYIMANA GASTON & GERMAIN
Document12 pages
Article by NSHIMIYIMANA GASTON & GERMAIN
ngaston2020
No ratings yet
Correcting Oral Errors (2012)
Document3 pages
Correcting Oral Errors (2012)
Ricardo Barros
No ratings yet
TTP Final
Document27 pages
TTP Final
vea verzon
100% (1)
Changing Methodologies in TESOL PDF
Document3 pages
Changing Methodologies in TESOL PDF
JUAN PABLO VALDIVIA ASTUDILLO
No ratings yet
Module 1
Document24 pages
Module 1
Cindy Bonono
No ratings yet
Observation Tara Fry Cornell Principal 3
Document3 pages
Observation Tara Fry Cornell Principal 3
api-328819557
No ratings yet
LDM2 Module 3B Ans
Document9 pages
LDM2 Module 3B Ans
Hazel L Ibarra
90% (21)
Soft Skill Development Skill
Document2 pages
Soft Skill Development Skill
Mohsin Modi
No ratings yet
Andragogy
Document7 pages
Andragogy
QUIJANO, FLORI-AN P.
No ratings yet
Imaging Lesson
Document7 pages
Imaging Lesson
api-222848521
No ratings yet
2016 CCRWT Program 9-26
Document5 pages
2016 CCRWT Program 9-26
api-233867021
No ratings yet
MSC Global Finance January 2019 Intake
Document71 pages
MSC Global Finance January 2019 Intake
Robert Truong
No ratings yet
Action Research Dictation Sample
Document10 pages
Action Research Dictation Sample
Philip Hayes
100% (1)
5 - TOEIC Advice and Tips
Document5 pages
5 - TOEIC Advice and Tips
marisa putton
No ratings yet