Welcome to Scribd!

Skip carousel

Decision Tree Entropy Gini

Uploaded by

Sudheer Redus

0% found this document useful (0 votes)

121 views5 pages

Original Title

decision-tree-entropy-gini

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

121 views5 pages

Decision Tree Entropy Gini

Uploaded by

Sudheer Redus

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 5

Search inside document

DECISION TREES

A decision tree represents a function that takes a vector of attribute values as inputs and returns a
‘decision’ – a single output value.

The input and output values can be discrete or continuous.

A decision tree reaches its decision by performing a sequence of tests.

Ex: Let us take a decision to go out for playing depending on weather condition. The decision tree is
given below:

There are several algorithms used by decision trees.

Which attribute should be taken as root node? This question is answered based on values of 1)
entropy and 2) gini index.

Both the entropy and gini index represent impurity level in the dataset.

ENTROPY

Entropy is the measure of randomness of data. When the data is more random, it means, there are
impurities. When entropy is less, the data is good. When entropy is less ‘information gain’ in every
split will be more.

Entropy is useful to decide which attribute is to be selected as root node in the decision tree.

The formula for calculating entropy is:

Example. Calculate Entropy. To go out for playing which of the attributes
should be selected as root node ?

Suppose, we take outlook as root node, then the following decision tree arises.

First calculate total entropy E(S) for the data set

There are 14 instances (or rows) and we have 9 Yes and 5 No.

The formula for Entropy:

E(S) = -P(Yes) log2 P(Yes) - P(No) log2 P(No)

E(S) = - (9/14) * log2 9/14 – (5/14) * log2 5/14

E(S) = 0.41 + 0.53 = 0.94

Calculate entropy for outlook

Outlook has 3 different parameters: Sunny, Overcast, Rainy.

In case of Sunny, no. of ‘Yes’ = 2, no. of ‘No’ = 3

In case of Overcast, no. of ‘Yes’ = 4, no. of ‘No’ = 0

In case of Rainy, no. of ‘Yes’ = 3, no. of ‘No’ = 2

So, Entropy (outlook=Sunny) = -2/5 log2 2/5 - 3/5 log2 3/5 = 0.971

Entropy(outlook=overcast) = -1 log2 1 – 0 log2 0 = 0

Entropy(outlook=rainy) = -3/5 log2 3/5 – 2/5 log2 2/5 = 0.971

Information from outlook:

I(outlook) = 5/14 x 0.971 + 4/14 x 0 + 5/14 x 0.971 = 0.693

Information gained from outlook:

Gain(outlook) = E(S) – I(outlook) = 0.94 – 0.693 = 0.247

This gain must be high. That node is to be selected as root node. Similarly
repeat for other nodes (like Temperature, Humidity, and Windy).

The following are the results:

Since in outlook, the information gain is very high, it is selected as root node.

GINI INDEX

Formula for gini index

Let us calculate the gini index for Outlook
Note down how many ‘Yes’ and ‘No’ are there in each class.

Sunny = 2 Yes, 3 Nos

overcast = 4 Yes, 0 No

rainy = 3 Yes, 2 No

Gini (outlook = Sunny) = 1 – (2/5)2 – (3/5)2 = 0.48

Gini (outlook = overcast) =1 – (4/4)2 – (0/4)2 = 0

Gini (outlook = rainy) = 1 – (3/5)2 – ( 2/5)2 = 0.48

Therefore, the gini index for outlook is:

5/14 x 0.48 + 4/14 x 0 + 5/14 x 0.48 = 0.3429

In the same manner, calculate gini index for other attributes also.

Since the gini index of outlook is very less, there are very less impurities. Hence
we select outlook as our root node.

3.1 C 4.5 Algorithm-19
Document10 pages
3.1 C 4.5 Algorithm-19
nayan jain
No ratings yet
Intermediate Code Generation
Document28 pages
Intermediate Code Generation
muler_tesfa
No ratings yet
Les Statistiques Descriptives Est Oujda
Document4 pages
Les Statistiques Descriptives Est Oujda
Mouhsine EL MOUDIR
No ratings yet
JMonkeyEngine 3 Tutorial (1) Hello SimpleAplication
Document6 pages
JMonkeyEngine 3 Tutorial (1) Hello SimpleAplication
Syamih Kamyu II
No ratings yet
C Mid-Term Exam Problems for Programming, Matrix, Recursion, Maximum
Document6 pages
C Mid-Term Exam Problems for Programming, Matrix, Recursion, Maximum
張帕姆
No ratings yet
Série D'exercices N°2 - TIC JS - Bac SI (2012-2013) MR HAMDI MONCEF
Document11 pages
Série D'exercices N°2 - TIC JS - Bac SI (2012-2013) MR HAMDI MONCEF
Ghanem Bahrini
No ratings yet
Algorithmique Et Programmation en C: Cours Avec 200 Exercices Corrigés
Document298 pages
Algorithmique Et Programmation en C: Cours Avec 200 Exercices Corrigés
Serges Keou
No ratings yet
ExamenGL2122 Correction
Document5 pages
ExamenGL2122 Correction
Ahmed BENABID
100% (2)
TP2 RMI Sol
Document4 pages
TP2 RMI Sol
Majdi Boyka
No ratings yet
Artificial Intelligence Notes Unit-4 Lecture-1 Expert Systems
Document7 pages
Artificial Intelligence Notes Unit-4 Lecture-1 Expert Systems
Shivangi Thakur
No ratings yet
Pylxml
Document56 pages
Pylxml
ghodghod123
No ratings yet
JAVA RMI Example
Document13 pages
JAVA RMI Example
Hector Oliveros
No ratings yet
XT Camera Switch XT7100-XT500-E
Document2 pages
XT Camera Switch XT7100-XT500-E
natata18
No ratings yet
Compiler-Lexical Analysis
Document59 pages
Compiler-Lexical Analysis
vidhya_bineesh
100% (1)
Assembly Language
Document7 pages
Assembly Language
Dilip Jha
No ratings yet
Line Coding Schemes Design and Implementation
Document12 pages
Line Coding Schemes Design and Implementation
Rakesh Venkatesan
No ratings yet
Administration & Sécurité des Systèmes d’exploitation Unix TD 2
Document4 pages
Administration & Sécurité des Systèmes d’exploitation Unix TD 2
Ahmed BENABID
No ratings yet
TP Reseau 2
Document5 pages
TP Reseau 2
Làmiiàe El
No ratings yet
Exercices en Java by Saad PDF
Document2 pages
Exercices en Java by Saad PDF
Jimmy
No ratings yet
What Is Codename One
Document18 pages
What Is Codename One
نورالدنيا
No ratings yet
Nis Linux HPC
Document268 pages
Nis Linux HPC
yeldasbabu
No ratings yet
Pfsense Basic Configuration
Document19 pages
Pfsense Basic Configuration
aami6
No ratings yet
Polinom Java
Document3 pages
Polinom Java
Marian Dinu
No ratings yet
BTS CIM - Programmation d'un microcontrôleur PIC pour le contrôle d'un vérin et d'un destructeur d'aiguille
Document7 pages
BTS CIM - Programmation d'un microcontrôleur PIC pour le contrôle d'un vérin et d'un destructeur d'aiguille
Said Ahniche
No ratings yet
Alfa Install Guide
Document16 pages
Alfa Install Guide
dibya1234
100% (2)
Correction Exercices DIANA Et AGnes
Document3 pages
Correction Exercices DIANA Et AGnes
Marwen Guesmi
No ratings yet
Wireless Simulation Vanet - TCL
Document5 pages
Wireless Simulation Vanet - TCL
piyushji125
100% (1)
The Secure Zone Routing Protocol (SZRP) 1
Document24 pages
The Secure Zone Routing Protocol (SZRP) 1
Kamalakar Reddy
No ratings yet
Oaisim Walkthrough
Document7 pages
Oaisim Walkthrough
selvakumar2k2
No ratings yet
Unix Commands: Simple UNIX Commands File Related Commands Directory Related Commands
Document29 pages
Unix Commands: Simple UNIX Commands File Related Commands Directory Related Commands
api-26041653
No ratings yet
Semaphores
Document10 pages
Semaphores
Muhammad Makki
No ratings yet
Eloquent Relationships Explained
Document26 pages
Eloquent Relationships Explained
Momo Semerkhet
No ratings yet
Correction Td2 2
Document6 pages
Correction Td2 2
Islem Oth
No ratings yet
Spring Data JPA Cheatsheet
Document3 pages
Spring Data JPA Cheatsheet
IliasAhmed
No ratings yet
3rd Assignment
Document4 pages
3rd Assignment
Emilio Moyers
No ratings yet
PDDL
Document28 pages
PDDL
Gustavo P R
No ratings yet
Map Reduce Examples
Document7 pages
Map Reduce Examples
Singsg Singsg
No ratings yet
Install NS2 and Nam on Ubuntu 14.04
Document2 pages
Install NS2 and Nam on Ubuntu 14.04
Deepa Thilak
No ratings yet
HEC DoubleDegree - X
Document18 pages
HEC DoubleDegree - X
WJ
No ratings yet
TP-1 VPN Configuration Lab Using Routers in Cisco Packet Tracer
Document5 pages
TP-1 VPN Configuration Lab Using Routers in Cisco Packet Tracer
Khaoula RAZZAKI
No ratings yet
REGRESSION ANALYSIS TITLE
Document2 pages
REGRESSION ANALYSIS TITLE
Hitesh Sharma
100% (1)
Wireless Simulation Vanet TCL
Document5 pages
Wireless Simulation Vanet TCL
alione
100% (1)
Transfinite Interpolation
Document5 pages
Transfinite Interpolation
dev burman
No ratings yet
Hive Exercises
Document2 pages
Hive Exercises
sandeep
No ratings yet
Powerful programmable text editor for Linux, Unix, and Windows
Document9 pages
Powerful programmable text editor for Linux, Unix, and Windows
Sazal Khan
100% (1)
Corrig+® Exam Java Contrl Esprit
Document4 pages
Corrig+® Exam Java Contrl Esprit
PFE
No ratings yet
CSL 210 Lab06 Inheritance
Document7 pages
CSL 210 Lab06 Inheritance
samiullah
No ratings yet
Data Structures & Algorithms LAB - Spring 2014 (BS-CS-F12 Morning & Afternoon)
Document3 pages
Data Structures & Algorithms LAB - Spring 2014 (BS-CS-F12 Morning & Afternoon)
Rizwan Khadim
100% (2)
Program of NFA To DFA Conversion Program
Document6 pages
Program of NFA To DFA Conversion Program
Arvind Rawat
0% (2)
Searching Techniques in AI
Document37 pages
Searching Techniques in AI
Jagtar Shergill
No ratings yet
Commands Included in Show Tech CISCO
Document4 pages
Commands Included in Show Tech CISCO
Srinivas Kumar
No ratings yet
Test de Niveau Anglais
Document10 pages
Test de Niveau Anglais
imnotlinno
No ratings yet
Activities Diagram and UML Activities
Document36 pages
Activities Diagram and UML Activities
mor ndiaye
No ratings yet
Getting Node Position, Energy and Analyzing NS2 Traces
Document9 pages
Getting Node Position, Energy and Analyzing NS2 Traces
Rajendra Bhosale
No ratings yet
Examen Poo
Document24 pages
Examen Poo
Madalina Simionescu
No ratings yet
SQL Solution
Document5 pages
SQL Solution
Maf Fall
No ratings yet
Elimination of Left Recursion
Document17 pages
Elimination of Left Recursion
mdhuq1
No ratings yet
Compte Ur
Document4 pages
Compte Ur
Ayoub Oufadel
No ratings yet
Decision Tree
Document10 pages
Decision Tree
Sameer Khan
No ratings yet
Decision_Tree_(Class_37-38)_169692509554958626652505a71d481
Document45 pages
Decision_Tree_(Class_37-38)_169692509554958626652505a71d481
23mb0072
No ratings yet
KNN
Document3 pages
KNN
Sudheer Redus
No ratings yet
Elastic Net Reg
Document2 pages
Elastic Net Reg
Sudheer Redus
No ratings yet
Datascience With Answers
Document36 pages
Datascience With Answers
Sudheer Redus
100% (1)
K Means Clustering
Document5 pages
K Means Clustering
Sudheer Redus
No ratings yet
Decision Tree
Document2 pages
Decision Tree
Sudheer Redus
No ratings yet
Bias Variance Ridge Regression
Document4 pages
Bias Variance Ridge Regression
Sudheer Redus
No ratings yet
Bias Variance Ridge Regression
Document4 pages
Bias Variance Ridge Regression
Sudheer Redus
No ratings yet
6 One Hot Encoding
Document3 pages
6 One Hot Encoding
Sudheer Redus
No ratings yet
Eda Notes
Document4 pages
Eda Notes
Sudheer Redus
No ratings yet
5 Multiple Linear Regression
Document2 pages
5 Multiple Linear Regression
Sudheer Redus
No ratings yet
Quotation For Telematics
Document3 pages
Quotation For Telematics
Sudheer Redus
No ratings yet
Bagging Boosting
Document3 pages
Bagging Boosting
Sudheer Redus
No ratings yet
Form 1 Exhibitor Details
Document2 pages
Form 1 Exhibitor Details
Sudheer Redus
No ratings yet
Resume Sudheer Kangala 2019 OI
Document1 page
Resume Sudheer Kangala 2019 OI
Sudheer Redus
No ratings yet
Resume Sudheer Kangala 2019 New
Document1 page
Resume Sudheer Kangala 2019 New
Sudheer Redus
No ratings yet
Esthetician Cover Letter
Document4 pages
Esthetician Cover Letter
zys0vemap0m3
100% (2)
Diverse Narrative Structures in Contemporary Picturebooks: Opportunities For Children's Meaning-Making
Document11 pages
Diverse Narrative Structures in Contemporary Picturebooks: Opportunities For Children's Meaning-Making
Blanca Hernández
No ratings yet
Prana Astrology in Kalacakra Tantra
Document7 pages
Prana Astrology in Kalacakra Tantra
Niraj Kumar
No ratings yet
Structural Steel and Foundation Calculation
Document27 pages
Structural Steel and Foundation Calculation
Ashraf Ammar
No ratings yet
Bit Selection Guidelines PDF
Document225 pages
Bit Selection Guidelines PDF
Susan Li HB
100% (2)
Test 5 Trans
Document7 pages
Test 5 Trans
Nguyễn Thương
No ratings yet
Hyd 3
Document1 page
Hyd 3
Jocelyn Cabarles
No ratings yet
21794
Document369 pages
21794
savvy_as_98-1
No ratings yet
Babasaheb Bhimrao Ambedkar University: (A Central University) Vidya Vihar, Rae Bareli Road, Lucknow-226025
Document2 pages
Babasaheb Bhimrao Ambedkar University: (A Central University) Vidya Vihar, Rae Bareli Road, Lucknow-226025
shrey Yadav
No ratings yet
Plagiarism Scan Report: Plagiarised Unique
Document3 pages
Plagiarism Scan Report: Plagiarised Unique
Koustubh Mohanty
No ratings yet
18 PDF
Document25 pages
18 PDF
Dian Abiyoga
No ratings yet
Math End of Term Test Specs December 2019
Document4 pages
Math End of Term Test Specs December 2019
minnett pinnock
No ratings yet
Using ADO and Stored Procedures Visual Basic 6 VB6 PDF
Document37 pages
Using ADO and Stored Procedures Visual Basic 6 VB6 PDF
John Mark Arcea
No ratings yet
ISOM2700 Practice Set3 Sol
Document12 pages
ISOM2700 Practice Set3 Sol
g
No ratings yet
Kevin Antonevich Xbabip Abstract
Document3 pages
Kevin Antonevich Xbabip Abstract
api-273450042
No ratings yet
JUST Safety Audit Report
Document3 pages
JUST Safety Audit Report
Mohammad Liftawi
No ratings yet
Engineering Journal Adsorption of Nitrogen and Sulphur Organic-Compounds On Titania Nanotubes
Document11 pages
Engineering Journal Adsorption of Nitrogen and Sulphur Organic-Compounds On Titania Nanotubes
Engineering Journal
No ratings yet
A Theory of Knowledge
Document16 pages
A Theory of Knowledge
kemmerich
No ratings yet
Electronic Devices Floyd Solution Manual
Document2 pages
Electronic Devices Floyd Solution Manual
Lola Bautista
No ratings yet
Dianodic DN2301
Document7 pages
Dianodic DN2301
dalton2004
No ratings yet
ETECH 223syllabus
Document4 pages
ETECH 223syllabus
ferdie marcos
No ratings yet
OMAPL138 Lab Manual
Document31 pages
OMAPL138 Lab Manual
vijaygurumani
No ratings yet
Five Central Cognitive Processes in IL Development
Document3 pages
Five Central Cognitive Processes in IL Development
Lorena Vega Limon
No ratings yet
1 ZXMSG 5200 (V2 (1) .0.2) Technical Manual
Document69 pages
1 ZXMSG 5200 (V2 (1) .0.2) Technical Manual
Natan Getahun
100% (2)
Software Testing
Document21 pages
Software Testing
Jeevan Balaka
No ratings yet
Thankyou Letter
Document18 pages
Thankyou Letter
arvindranganathan
No ratings yet
Accenture SAP General Ledger PDF
Document12 pages
Accenture SAP General Ledger PDF
mageshjayapaul6149
No ratings yet
Advice on choosing a community class
Document7 pages
Advice on choosing a community class
Shobiroh Muhamad Isa
No ratings yet
Daly-Movement Analysis R
Document14 pages
Daly-Movement Analysis R
George Baciu
No ratings yet
Understanding Community Dynamics and Actions
Document8 pages
Understanding Community Dynamics and Actions
Christian Guimban
100% (2)