Welcome to Scribd!

Decision Tree

Uploaded by

0% found this document useful (0 votes)

16 views13 pages

This document discusses decision trees and how to build them. It explains that decision trees are built using recursive partitioning to classify data by determining the attribute that best splits the data based on its predictive power. It discusses how predictiveness is based on decreasing the impurity of nodes, and that impurity is calculated using entropy, which measures the homogeneity of samples in a node, with 0 being completely homogeneous and 1 being equally divided. Selecting the attribute that most reduces entropy results in the purest nodes.

Original Description:

Theory of Data Science

Original Title

DecisionTree

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

16 views13 pages

Decision Tree

Uploaded by

Aly Boy

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 13

Search inside document

Decision Tree

DS221 – Spring 2023

Instructor: Abinta Mehmood
Outline
• Decision Tree
• How to build a decision tree?
• Selecting the attributes
• Entropy
Decision trees are built using recursive partitioning to classify the data.

What is important in making a decision tree, is to determine which attribute

is the best or more predictive to split data based on the feature.
if the patient has high cholesterol we cannot say with high confidence that drug B might be
suitable for him.
Also, if the patient's cholesterol is normal, we still don't have sufficient evidence or
information to determine if either drug A or drug B is in fact suitable.
if the patient is female, we can say drug B might be suitable for her with high certainty.
But if the patient is male, we don't have sufficient evidence or information to determine if
drug A or drug B is suitable.
However, it is still a better choice in comparison with the cholesterol attribute because the
result in the nodes are more pure
Predictiveness is based on decrease in impurity of nodes.
We're looking for the best feature to decrease the impurity of patients in the
leaves, after splitting them up based on that feature.
Impurity and Entropy

• A node in the tree is considered pure if in 100

percent of the cases, the nodes fall into a specific
category of the target field.
• In fact, the method uses recursive partitioning to
split the training records into segments by
minimizing the impurity at each step.
• Impurity of nodes is calculated by entropy of data in
the node.
• So, what is entropy?
The entropy is used to calculate the homogeneity of the samples in that node.
If the samples are completely homogeneous, the entropy is zero and if the samples are
equally divided it has an entropy of one.

Haematology Made Easy
From Everand
Haematology Made Easy
Adias
Rating: 5 out of 5 stars
5/5 (3)
Statistics & Molecular MRCP1
Document87 pages
Statistics & Molecular MRCP1
Raouf Ra'fat Soliman
100% (2)
A2 Psychology Unit 4, Clinical
Document11 pages
A2 Psychology Unit 4, Clinical
Lesley Connor
100% (1)
DecisionTree 2
Document13 pages
DecisionTree 2
Aly Boy
No ratings yet
1694600905-Unit2.4 Decision Tree CU 2.0
Document29 pages
1694600905-Unit2.4 Decision Tree CU 2.0
woxiko1688
No ratings yet
Aetiology or Harm Critical Appraisal Guide: Validity
Document3 pages
Aetiology or Harm Critical Appraisal Guide: Validity
MELLY RAHMAWATI
No ratings yet
Propensity Scores
Document48 pages
Propensity Scores
goudou
No ratings yet
10.2 Power Point
Document23 pages
10.2 Power Point
krothroc
No ratings yet
Epidemiology 1
Document38 pages
Epidemiology 1
Big.Dre
No ratings yet
Cart: Classification and Regression Tree
Document45 pages
Cart: Classification and Regression Tree
cahyadi aditya
No ratings yet
Samplesize
Document8 pages
Samplesize
Maria Carmela Domocmat
No ratings yet
Genome Basic Concept, Terminology and Tools
Document47 pages
Genome Basic Concept, Terminology and Tools
marina nikolidaki
No ratings yet
3.3 Sampling Distribution
Document22 pages
3.3 Sampling Distribution
youservezeropurpose113
No ratings yet
Identifying Parameters For Testing in Given Real
Document4 pages
Identifying Parameters For Testing in Given Real
Eian Ingan
No ratings yet
Inference For Numerical Data - Stats 250
Document18 pages
Inference For Numerical Data - Stats 250
Oliver Barr
No ratings yet
SPTC 0701 Q3 FPF
Document32 pages
SPTC 0701 Q3 FPF
jonelalombrosales
No ratings yet
Res Meth Unit 7 - Sampling
Document27 pages
Res Meth Unit 7 - Sampling
anjnaprohike26
No ratings yet
5.2 Power Point
Document19 pages
5.2 Power Point
krothroc
No ratings yet
Suggested Answers To Exercise: Comparing The Means of Small Samples
Document2 pages
Suggested Answers To Exercise: Comparing The Means of Small Samples
Dhaif dhaif
No ratings yet
APznzaYDsiTvDSmZuXAxbX9cqnk4BMn4-A8RY-VBBlT4bK-O5l6c4YQSM4yOMVGR7d8WfJswbhf7txhuAf1XB6E6sTsFsHyMCwb3ENEa_gD1ljSRzDyfnusdkipRkYfLemkjxDaiy9vo9LxoXa4VYLRepz8dFlU4RLJYq2pQatpdMtGMqS2qOGHMduKW1BVe5EpaFJO8nqDVs5J-SxEpvJ7
Document46 pages
APznzaYDsiTvDSmZuXAxbX9cqnk4BMn4-A8RY-VBBlT4bK-O5l6c4YQSM4yOMVGR7d8WfJswbhf7txhuAf1XB6E6sTsFsHyMCwb3ENEa_gD1ljSRzDyfnusdkipRkYfLemkjxDaiy9vo9LxoXa4VYLRepz8dFlU4RLJYq2pQatpdMtGMqS2qOGHMduKW1BVe5EpaFJO8nqDVs5J-SxEpvJ7
Jamis Delara
No ratings yet
Wa Nko Nalipay PR
Document12 pages
Wa Nko Nalipay PR
gbs040479
No ratings yet
10 Things To Know About Covariate Adjustment
Document14 pages
10 Things To Know About Covariate Adjustment
Ahmad Rustam
No ratings yet
Diagnóstico Integral Del Cáncer Clases 2-3
Document5 pages
Diagnóstico Integral Del Cáncer Clases 2-3
Nerea Otegui
No ratings yet
How Do We Select A Sample Size When Sampling A Small Population
Document1 page
How Do We Select A Sample Size When Sampling A Small Population
Byaruhanga Emmanuel
No ratings yet
Therappraisalclr
Document5 pages
Therappraisalclr
Mike Cabotage
No ratings yet
Full Download Biostatistics An Applied Introduction For The Public Health Practitioner 1st Edition Bush Test Bank
Document36 pages
Full Download Biostatistics An Applied Introduction For The Public Health Practitioner 1st Edition Bush Test Bank
hildasavardpro
100% (23)
Quarter 4 Lesson 1 2
Document15 pages
Quarter 4 Lesson 1 2
Karell Garcia
No ratings yet
Statist
Document67 pages
Statist
god4alll
No ratings yet
Prognosis Critical Appraisal Guide: Validity
Document2 pages
Prognosis Critical Appraisal Guide: Validity
putraod
No ratings yet
Research Methods: It Is Actually Way More Exciting Than It Sounds!!!!
Document36 pages
Research Methods: It Is Actually Way More Exciting Than It Sounds!!!!
sangathi
No ratings yet
CHP 5
Document101 pages
CHP 5
Narendra Singh
No ratings yet
Chapter 1 Introduction To Applied Statistics 111
Document70 pages
Chapter 1 Introduction To Applied Statistics 111
Mohd Najib
100% (2)
Biostatistics An Applied Introduction For The Public Health Practitioner 1st Edition Bush Test Bank
Document8 pages
Biostatistics An Applied Introduction For The Public Health Practitioner 1st Edition Bush Test Bank
AndrewMorrisbknaf
100% (15)
Chap1 Section 5
Document18 pages
Chap1 Section 5
Luke Horsburgh
No ratings yet
Introduction To Biostatistics: Associate Professor Georgi Iskrov, PHD Department of Social Medicine
Document60 pages
Introduction To Biostatistics: Associate Professor Georgi Iskrov, PHD Department of Social Medicine
KARTHIK SREEKUMAR
No ratings yet
Affective Neuroscience
Document3 pages
Affective Neuroscience
Neus Sangrós Vidal
No ratings yet
XX063
Document3 pages
XX063
Bittu
No ratings yet
Introduction To Statistics & Probability: Chapter 3: Producing Data (Part 3)
Document17 pages
Introduction To Statistics & Probability: Chapter 3: Producing Data (Part 3)
Kenesa
No ratings yet
Sample Size Estimation - NRSI
Document68 pages
Sample Size Estimation - NRSI
Savita Hanamsagar
No ratings yet
Lecture Two-Data Collection
Document30 pages
Lecture Two-Data Collection
Anon Ymous
No ratings yet
Cercetare Stiintifica. 6
Document58 pages
Cercetare Stiintifica. 6
Denisa Neagu
No ratings yet
EBM Introduction
Document49 pages
EBM Introduction
Fanny AgustiaWandany
No ratings yet
Quantitative Home Exam SKOC39
Document8 pages
Quantitative Home Exam SKOC39
juliaasstark
No ratings yet
Methods of Collecting Data
Document65 pages
Methods of Collecting Data
Jhamille Cardona
No ratings yet
3.decision Tree
Document23 pages
3.decision Tree
anima tor
No ratings yet
Evidence-Based Practice in Child and Adolescent Mental Health
Document22 pages
Evidence-Based Practice in Child and Adolescent Mental Health
Ptrc Lbr Lp
No ratings yet
Biostats 2
Document7 pages
Biostats 2
baf49411
No ratings yet
Statistics Suggestions
Document74 pages
Statistics Suggestions
Dr. Mahbub Alam Mahfuz
No ratings yet
5.01.critical Appraisal
Document10 pages
5.01.critical Appraisal
Mark Lopez
No ratings yet
Experimental Design
Document72 pages
Experimental Design
Jithesh Kumar K
No ratings yet
Decision Trees
Document16 pages
Decision Trees
jatinrastogi81
No ratings yet
Evidence-Based Practice Resources For HINARI Users: (Module 7.2)
Document120 pages
Evidence-Based Practice Resources For HINARI Users: (Module 7.2)
thelordhani
No ratings yet
Evidence-Based Practice Resources For HINARI Users: (Module 7.2)
Document120 pages
Evidence-Based Practice Resources For HINARI Users: (Module 7.2)
thelordhani
No ratings yet
Lecture 04 Data Collection, Data Privacy
Document63 pages
Lecture 04 Data Collection, Data Privacy
sxya.community.hk
No ratings yet
Standardization (Adjusted Measure) : Oleh Lintang Dian Saraswati, SKM, M.Epid
Document19 pages
Standardization (Adjusted Measure) : Oleh Lintang Dian Saraswati, SKM, M.Epid
Azhar Zain
No ratings yet
Sampling
Document5 pages
Sampling
Bashiru Garba
No ratings yet
Understanding The Quality of Data in Clinical Medicine
Document17 pages
Understanding The Quality of Data in Clinical Medicine
Lalalulu
No ratings yet
Reversing Cerebral Palsy in Early Infancy: A Protocol for Using Normalization Through Neuroplastic Manipulation (NTNM)
From Everand
Reversing Cerebral Palsy in Early Infancy: A Protocol for Using Normalization Through Neuroplastic Manipulation (NTNM)
Carol R. Bettendorf PT MS PCS
No ratings yet
Decision Making in Small Animal Oncology
From Everand
Decision Making in Small Animal Oncology
David J. Argyle
No ratings yet
CSA Revision Notes for the MRCGP, second edition
From Everand
CSA Revision Notes for the MRCGP, second edition
Jennifer Stannett
Rating: 4.5 out of 5 stars
4.5/5 (3)
Regression 1
Document22 pages
Regression 1
Aly Boy
No ratings yet
Evaluation Metrics
Document10 pages
Evaluation Metrics
Aly Boy
No ratings yet
Machine Learning
Document27 pages
Machine Learning
Aly Boy
No ratings yet
DecisionTree 2
Document13 pages
DecisionTree 2
Aly Boy
No ratings yet
Data Analytics Life Cycle
Document13 pages
Data Analytics Life Cycle
Aly Boy
No ratings yet