Welcome to Scribd!

Walchand Institute of Technology, Solapur Information Technology

Uploaded by

0% found this document useful (0 votes)

11 views10 pages

1) The document describes the decision tree classification algorithm. It builds a tree structure to partition a dataset based on attribute values, with decision nodes that split the data and leaf nodes that make predictions. 2) The algorithm uses information gain to choose the attribute that best splits the data at each step. It calculates the entropy of datasets to measure homogeneity and splits the data to maximize information gain and create pure, homogeneous subsets. 3) It recursively splits the data further down the tree until reaching leaf nodes that make predictions, resulting in a tree that models the hierarchical relationships between predictor variables.

Original Description:

Original Title

08_ML_Assignment4

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

11 views10 pages

Walchand Institute of Technology, Solapur Information Technology

Uploaded by

08Archanasoma

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 10

Search inside document

Walchand Institute of Technology, Solapur

INFORMATION TECHNOLOGY
COURSE: MACHINE LEARNING
CLASS: B.E. [I.T.] -20-21
(SEM-II)
Name: Archana Vivekanand Soma
Roll No: 08
Assignment 4

Problem Statement: Implement Decision Tree Algorithm using following

Dataset. Calculate the posterior probability for each class. What is the
outcome of prediction?

Decision Tree - Classification

Decision tree builds classification or regression models in the form of a tree structure. It breaks
down a dataset into smaller and smaller subsets while at the same time an associated decision
tree is incrementally developed. The final result is a tree with decision nodes and leaf nodes.
A decision node (e.g., Outlook) has two or more branches (e.g., Sunny, Overcast and Rainy).
Leaf node (e.g., Play) represents a classification or decision. The topmost decision node in a
tree which corresponds to the best predictor called root node.
Decision trees can handle both categorical and numerical data.

Algorithm

1
Walchand Institute of Technology, Solapur
INFORMATION TECHNOLOGY
COURSE: MACHINE LEARNING
CLASS: B.E. [I.T.] -20-21
(SEM-II)
Name: Archana Vivekanand Soma
Roll No: 08
Assignment 4

The core algorithm for building decision trees called ID3 by J. R. Quinlan which
employs a top-down, greedy search through the space of possible branches with
no backtracking. ID3 uses Entropy and Information Gain to construct a decision
tree. In ZeroR model there is no predictor, in OneR model we try to find the
single best predictor, naive Bayesian includes all predictors using Bayes' rule and
the independence assumptions between predictors but decision tree includes all
predictors with the dependence assumptions between predictors.

Entropy
A decision tree is built top-down from a root node and involves partitioning the
data into subsets that contain instances with similar values (homogenous). ID3
algorithm uses entropy to calculate the homogeneity of a sample. If the sample is
completely homogeneous the entropy is zero and if the sample is an equally
divided it has entropy of one.

2
Walchand Institute of Technology, Solapur
INFORMATION TECHNOLOGY
COURSE: MACHINE LEARNING
CLASS: B.E. [I.T.] -20-21
(SEM-II)
Name: Archana Vivekanand Soma
Roll No: 08
Assignment 4

To build a decision tree, we need to calculate two types of entropy using

frequency tables as follows:

a) Entropy using the frequency table of one attribute:

3
Walchand Institute of Technology, Solapur
INFORMATION TECHNOLOGY
COURSE: MACHINE LEARNING
CLASS: B.E. [I.T.] -20-21
(SEM-II)
Name: Archana Vivekanand Soma
Roll No: 08
Assignment 4

b) Entropy using the frequency table of two attributes:

4
Walchand Institute of Technology, Solapur
INFORMATION TECHNOLOGY
COURSE: MACHINE LEARNING
CLASS: B.E. [I.T.] -20-21
(SEM-II)
Name: Archana Vivekanand Soma
Roll No: 08
Assignment 4

Information Gain
The information gain is based on the decrease in entropy after a dataset is split on
an attribute. Constructing a decision tree is all about finding attribute that returns
the highest information gain (i.e., the most homogeneous branches).

Step 1: Calculate entropy of the target.

5
Walchand Institute of Technology, Solapur
INFORMATION TECHNOLOGY
COURSE: MACHINE LEARNING
CLASS: B.E. [I.T.] -20-21
(SEM-II)
Name: Archana Vivekanand Soma
Roll No: 08
Assignment 4

Step 2: The dataset is then split on the different attributes. The entropy for each branch is
calculated. Then it is added proportionally, to get total entropy for the split. The resulting
entropy is subtracted from the entropy before the split. The result is the Information Gain, or
decrease in entropy.

6
Walchand Institute of Technology, Solapur
INFORMATION TECHNOLOGY
COURSE: MACHINE LEARNING
CLASS: B.E. [I.T.] -20-21
(SEM-II)
Name: Archana Vivekanand Soma
Roll No: 08
Assignment 4

Step 3: Choose attribute with the largest information gain as the decision node, divide the
dataset by its branches and repeat the same process on every branch.

7
Walchand Institute of Technology, Solapur
INFORMATION TECHNOLOGY
COURSE: MACHINE LEARNING
CLASS: B.E. [I.T.] -20-21
(SEM-II)
Name: Archana Vivekanand Soma
Roll No: 08
Assignment 4

Step 4a: A branch with entropy of 0 is a leaf node.

8
Walchand Institute of Technology, Solapur
INFORMATION TECHNOLOGY
COURSE: MACHINE LEARNING
CLASS: B.E. [I.T.] -20-21
(SEM-II)
Name: Archana Vivekanand Soma
Roll No: 08
Assignment 4

Step 4b: A branch with entropy more than 0 needs further splitting.

Step 5: The ID3 algorithm is run recursively on the non-leaf branches, until all data is

9
Walchand Institute of Technology, Solapur
INFORMATION TECHNOLOGY
COURSE: MACHINE LEARNING
CLASS: B.E. [I.T.] -20-21
(SEM-II)
Name: Archana Vivekanand Soma
Roll No: 08
Assignment 4

classified.

Decision Tree to Decision Rules
A decision tree can easily be transformed to a set of rules by mapping from the
root node to the leaf nodes one by one.

Lecture 023+-+Decision+Trees+ - 1
Document54 pages
Lecture 023+-+Decision+Trees+ - 1
faiza jahangir
No ratings yet
Computer Networks Lab Manual Latest
Document45 pages
Computer Networks Lab Manual Latest
Kande Archana K
100% (16)
Walchand Institute of Technology, Solapur Information Technology
Document4 pages
Walchand Institute of Technology, Solapur Information Technology
08Archanasoma
No ratings yet
Walchand Institute of Technology, Solapur Information Technology
Document7 pages
Walchand Institute of Technology, Solapur Information Technology
08Archanasoma
No ratings yet
Bachelor of Vocation (Software Development) Third Semester Examination (Level-Vi) Paper Code Paper ID Paper L T/P Credits Theory Papers
Document6 pages
Bachelor of Vocation (Software Development) Third Semester Examination (Level-Vi) Paper Code Paper ID Paper L T/P Credits Theory Papers
Ranveer Sanu
No ratings yet
EC604 Assignment - 1682053761
Document1 page
EC604 Assignment - 1682053761
Gopi Pawar
No ratings yet
Walchand Institute of Technology, Solapur Information Technology
Document4 pages
Walchand Institute of Technology, Solapur Information Technology
08Archanasoma
No ratings yet
CV #2 Abhinav Agarwal For Intel
Document2 pages
CV #2 Abhinav Agarwal For Intel
satbirsingh123
No ratings yet
UG 4-1 R19 IT Syllabus
Document31 pages
UG 4-1 R19 IT Syllabus
Masimukkala Sunitha
No ratings yet
Department of Master of Computer Application: Mca-I Sem-I
Document139 pages
Department of Master of Computer Application: Mca-I Sem-I
Chirag Baldota
No ratings yet
Dept Elective MTECH - EC - VLSI - EVEN 2023
Document13 pages
Dept Elective MTECH - EC - VLSI - EVEN 2023
Amit Tufani
No ratings yet
M Tech DSCE R15 Syllabus
Document41 pages
M Tech DSCE R15 Syllabus
Nikhil Kumar
No ratings yet
Abhinav Resume PDF
Document2 pages
Abhinav Resume PDF
Anurag Prabhakar
No ratings yet
(Diagram - 2 Marks, Panel - 2 Marks) : (Autonomous)
Document39 pages
(Diagram - 2 Marks, Panel - 2 Marks) : (Autonomous)
ayushmahtole
No ratings yet
Computer Networks DC
Document3 pages
Computer Networks DC
jsksjs
No ratings yet
DSU Microp
Document14 pages
DSU Microp
Siddhi
No ratings yet
Common Syllabus Final (6 Sem)
Document17 pages
Common Syllabus Final (6 Sem)
Pranav Kasliwal
No ratings yet
Power Elex 2022 - Syllabus
Document26 pages
Power Elex 2022 - Syllabus
ASWIN S
No ratings yet
Python Session 6-7 Functions
Document33 pages
Python Session 6-7 Functions
Pallavi Jayram
No ratings yet
2024 International Conference On EV Workshop-1
Document1 page
2024 International Conference On EV Workshop-1
Veera
No ratings yet
22636-Emerging Trends in Electronics
Document164 pages
22636-Emerging Trends in Electronics
rashmi patil
100% (1)
Computer Communications Lab File
Document53 pages
Computer Communications Lab File
yageshpandey88
No ratings yet
Exam
Document1 page
Exam
Anirudh nandi
No ratings yet
Lp-Iii Be Lab Manual Final1
Document4 pages
Lp-Iii Be Lab Manual Final1
satish.asane
No ratings yet
Scheme of Teaching and Examination B.E.: Electronics & Communication Engineering
Document8 pages
Scheme of Teaching and Examination B.E.: Electronics & Communication Engineering
mabhat
No ratings yet
Cat 3 PPT
Document10 pages
Cat 3 PPT
SATYAM ANAND 20SCSE1010341
No ratings yet
Micro 2014-2015
Document51 pages
Micro 2014-2015
sharan mourya
No ratings yet
Etc-1111-Sem 7
Document14 pages
Etc-1111-Sem 7
Vivek Soni
No ratings yet
UG 4-2 R19 ECE Syllabus
Document22 pages
UG 4-2 R19 ECE Syllabus
Techno Dost Meghams
No ratings yet
Lab-2 1
Document18 pages
Lab-2 1
Abdul Haseeb Khan
No ratings yet
EDA Tools DC
Document3 pages
EDA Tools DC
jsksjs
No ratings yet
Assignment 1
Document1 page
Assignment 1
puneet thapar
No ratings yet
Advanced Technology Lab: Laboratory Manual
Document53 pages
Advanced Technology Lab: Laboratory Manual
sivaji
No ratings yet
Revised Syllabus and Scheme of Examination Effective From July 2010-11
Document14 pages
Revised Syllabus and Scheme of Examination Effective From July 2010-11
Arjun Dubey
No ratings yet
Curriculum For Tech. Programme: Course Structure &
Document29 pages
Curriculum For Tech. Programme: Course Structure &
ohio
No ratings yet
CSE Modified PDF
Document40 pages
CSE Modified PDF
budduna
No ratings yet
8 TH
Document9 pages
8 TH
Yuv Raj
No ratings yet
UG 4-1 R19 EEE Syllabus
Document37 pages
UG 4-1 R19 EEE Syllabus
Masimukkala Sunitha
No ratings yet
Slow Learner: SR Roll No Enrollment No Stream Lab Batch Student Name
Document18 pages
Slow Learner: SR Roll No Enrollment No Stream Lab Batch Student Name
Amit Kumar
No ratings yet
CEP PSA Fall2022 F19batch
Document3 pages
CEP PSA Fall2022 F19batch
Mr Zaib
No ratings yet
Aiml 3.3
Document4 pages
Aiml 3.3
spartan 02
No ratings yet
Fundamentals of Automation Technology 20EE43P Portfolio
Document60 pages
Fundamentals of Automation Technology 20EE43P Portfolio
Thanmay JS
No ratings yet
"Mems Technology": A Seminar Report ON
Document33 pages
"Mems Technology": A Seminar Report ON
Askar Ali
No ratings yet
At2 Coa
Document12 pages
At2 Coa
sayantika roy
No ratings yet
ES Course Handout (BIT451)
Document3 pages
ES Course Handout (BIT451)
Afshan Kaleem
No ratings yet
MAR Lab Manual 1 2
Document31 pages
MAR Lab Manual 1 2
Sujal Patil
No ratings yet
IPU VLSI Syllabus
Document40 pages
IPU VLSI Syllabus
Abhinav Singh
No ratings yet
s6 Btech Elect Electro Engrng Syllabi 8
Document34 pages
s6 Btech Elect Electro Engrng Syllabi 8
Gladwin George
No ratings yet
M techComputerNew
Document23 pages
M techComputerNew
Arvind Kiwelekar
No ratings yet
Electronic Device and Circuits Two Marks - V+ Blog - Education News and Study Materials
Document9 pages
Electronic Device and Circuits Two Marks - V+ Blog - Education News and Study Materials
Neha Anis
No ratings yet
7th Sem Syllabi - ECE (1) - 220718 - 184300
Document11 pages
7th Sem Syllabi - ECE (1) - 220718 - 184300
vipul bagga
No ratings yet
T.Y.B.tech Syllabus W.E.F. 2022-23
Document47 pages
T.Y.B.tech Syllabus W.E.F. 2022-23
ETCSYA23Dipendra Mule
No ratings yet
Instructional Module: Republic of The Philippines Nueva Vizcaya State University Bayombong, Nueva Vizcaya
Document20 pages
Instructional Module: Republic of The Philippines Nueva Vizcaya State University Bayombong, Nueva Vizcaya
josie galupo
No ratings yet
Digital Communications: R. David Koilpillai
Document17 pages
Digital Communications: R. David Koilpillai
Sathwik Methari
No ratings yet
Series 1 EE8691 EMBEDDED SYSTEMS
Document1 page
Series 1 EE8691 EMBEDDED SYSTEMS
MARIA MCET
No ratings yet
Microelectronic Systems 1 Checkbook: The Checkbook Series
From Everand
Microelectronic Systems 1 Checkbook: The Checkbook Series
R E Vears
No ratings yet
Pathways to Machine Learning and Soft Computing: 邁向機器學習與軟計算之路（國際英文版）
From Everand
Pathways to Machine Learning and Soft Computing: 邁向機器學習與軟計算之路（國際英文版）
Jyh-Horng Jeng
No ratings yet
Computer Science and Multiple-Valued Logic: Theory and Applications
From Everand
Computer Science and Multiple-Valued Logic: Theory and Applications
David C. Rine
Rating: 4 out of 5 stars
4/5 (1)
Optical Computing Hardware: Optical Computing
From Everand
Optical Computing Hardware: Optical Computing
Jürgen Jahns
No ratings yet
Digital Electronics for Beginners: 1, #1
From Everand
Digital Electronics for Beginners: 1, #1
Raja Suresh
No ratings yet
Introduction to Quantum Computing & Machine Learning Technologies: 1, #1
From Everand
Introduction to Quantum Computing & Machine Learning Technologies: 1, #1
M. Sreedevi
No ratings yet
Latest Seminar Report Yash Ingole
Document35 pages
Latest Seminar Report Yash Ingole
Pranjal Hejib
No ratings yet
What Is Entropy and Why Information Gain Matter in Decision Trees
Document10 pages
What Is Entropy and Why Information Gain Matter in Decision Trees
prem chandran
No ratings yet
Internship Report
Document65 pages
Internship Report
Vatsal Agola
No ratings yet
Implementation of Decision Tree Algorithm On FPGA Devices
Document8 pages
Implementation of Decision Tree Algorithm On FPGA Devices
IAES IJAI
No ratings yet
ID3 Algorithm: Michael Crawford
Document28 pages
ID3 Algorithm: Michael Crawford
Muhammad Mudassir
No ratings yet
Machine Learning Approaches: Decision Trees
Document44 pages
Machine Learning Approaches: Decision Trees
Karem Saad
No ratings yet
Assignment-Decision Tree
Document12 pages
Assignment-Decision Tree
ThAnos n
No ratings yet
UCS551 Chapter 6 - Classification
Document20 pages
UCS551 Chapter 6 - Classification
nur ashfaraliana
No ratings yet
Unit 5 Big Data
Document55 pages
Unit 5 Big Data
Venkatesh Sharma
No ratings yet
Machine Learning Lab
Document33 pages
Machine Learning Lab
Dharanya V
No ratings yet
Multilevel Classification Algorithm Using Diagnosis and Prognosis of Breast Cancer
Document3 pages
Multilevel Classification Algorithm Using Diagnosis and Prognosis of Breast Cancer
IIR india
No ratings yet
Unit 3: Classification & Regression: Question Bank and Its Solution
Document180 pages
Unit 3: Classification & Regression: Question Bank and Its Solution
Tejas Narsale
No ratings yet
Decision Tree - IDS3
Document13 pages
Decision Tree - IDS3
Zarfa Masood
No ratings yet
Ch4 Supervised
Document78 pages
Ch4 Supervised
hareem iftikhar
No ratings yet
Lecture Notes - Decision Tree
Document13 pages
Lecture Notes - Decision Tree
samrat141988
No ratings yet
Unit 3 Classification
Document71 pages
Unit 3 Classification
Saima
No ratings yet
DS4 - CLS-Decision Tree
Document32 pages
DS4 - CLS-Decision Tree
Văn Hoàng Trần
No ratings yet
Batch B DWM Experiments
Document90 pages
Batch B DWM Experiments
Atharva Nalawade
No ratings yet
Question Bank
Document5 pages
Question Bank
manisha mudgal
No ratings yet
Clustering-Classification Based Prediction of Stock Market Future Prediction
Document4 pages
Clustering-Classification Based Prediction of Stock Market Future Prediction
Ankit
No ratings yet
Video Tutorial: Decision Tree Learning
Document21 pages
Video Tutorial: Decision Tree Learning
Mohammed Danish
No ratings yet
DMW Module 3
Document112 pages
DMW Module 3
Rahul S.Kumar
No ratings yet
Supervised Learning: Adane Letta Mamuye (PHD)
Document41 pages
Supervised Learning: Adane Letta Mamuye (PHD)
ABDULHAMID
No ratings yet
Multinomial Problem Statement
Document28 pages
Multinomial Problem Statement
Moayad
No ratings yet
Iterative Dichotomizer 3 (ID3) Decision Tree: A Machine Learning Algorithm For Data Classification and Predictive Analysis
Document8 pages
Iterative Dichotomizer 3 (ID3) Decision Tree: A Machine Learning Algorithm For Data Classification and Predictive Analysis
Văn Hoàng Trần
No ratings yet
Decision Tree Algorithm, Explained
Document20 pages
Decision Tree Algorithm, Explained
Ali Khan
No ratings yet
Notes On Module 3 - Pattern Recognition
Document17 pages
Notes On Module 3 - Pattern Recognition
Ankur Kumar
No ratings yet
Unit 5 - Data Mining - WWW - Rgpvnotes.in
Document15 pages
Unit 5 - Data Mining - WWW - Rgpvnotes.in
Bhagwan Bharose
No ratings yet
Decision Tree
Document12 pages
Decision Tree
shrikrishna
No ratings yet