Decision Tree in Machine Learning in Predictive Analytics

Uploaded by

kontham sridhar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views5 pages

Decision Tree in Machine Learning in Predictive Analytics

Uploaded by

kontham sridhar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Decision Tree in Machine Learning




A decision tree is a supervised learning algorithm used for both classification

and regression tasks. It has a hierarchical tree structure which consists of a
root node, branches, internal nodes and leaf nodes. It It works like a
flowchart help to make decisions step by step where:
 Internal nodes represent attribute tests
 Branches represent attribute values
 Leaf nodes represent final decisions or predictions.
Decision trees are widely used due to their interpretability, flexibility and low
preprocessing needs.

How Does a Decision Tree Work?

A decision tree splits the dataset based on feature values to create pure
subsets ideally all items in a group belong to the same class. Each leaf node
of the tree corresponds to a class label and the internal nodes are feature-
based decision points. Let’s understand this with an example.

Decisi
on Tree
Let’s consider a decision tree for predicting whether a customer will buy a
product based on age, income and previous purchases: Here's how the
decision tree works:
1. Root Node (Income)
First Question: "Is the person’s income greater than $50,000?"
 If Yes, proceed to the next question.
 If No, predict "No Purchase" (leaf node).
2. Internal Node (Age):
If the person’s income is greater than $50,000, ask: "Is the person’s age
above 30?"
 If Yes, proceed to the next question.
 If No, predict "No Purchase" (leaf node).
3. Internal Node (Previous Purchases):
 If the person is above 30 and has made previous purchases, predict
"Purchase" (leaf node).
 If the person is above 30 and has not made previous purchases, predict
"No Purchase" (leaf node).

Decision making with 2 Decision Tree

Example: Predicting Whether a Customer Will Buy a Product Using Two
Decision Trees
Tree 1: Customer Demographics
First tree asks two questions:
1. "Income > $50,000?"
 If Yes, Proceed to the next question.
 If No, "No Purchase"
2. "Age > 30?"
 Yes: "Purchase"
 No: "No Purchase"
Tree 2: Previous Purchases
"Previous Purchases > 0?"
 Yes: "Purchase"
 No: "No Purchase"
Once we have predictions from both trees, we can combine the results to
make a final prediction. If Tree 1 predicts "Purchase" and Tree 2 predicts "No
Purchase", the final prediction might be "Purchase" or "No Purchase"
depending on the weight or confidence assigned to each tree. This can be
decided based on the problem context.
Information Gain and Gini Index in Decision Tree
Till now we have discovered the basic intuition and approach of how decision
tree works, so lets just move to the attribute selection measure of decision
tree. We have two popular attribute selection measures used:
1. Information Gain
Information Gain tells us how useful a question (or feature) is for splitting
data into groups. It measures how much the uncertainty decreases after the
split. A good question will create clearer groups and the feature with the
highest Information Gain is chosen to make the decision.
For example if we split a dataset of people into "Young" and "Old" based on
age and all young people bought the product while all old people did not, the
Information Gain would be high because the split perfectly separates the two
groups with no uncertainty left
 Suppose SS is a set of instances AA is an attribute, SvSv is the subset
of SS, vv represents an individual value that the attribute AA can take
and Values (AA) is the set of all possible values of AA then

−∑vA∣Sv∣∣S∣.Entropy(Sv)Gain(S,A)=Entropy(S)−∑vA∣S∣∣Sv∣
Gain(S,A)=Entropy(S)

.Entropy(Sv)
 Entropy: is the measure of uncertainty of a random variable it
characterizes the impurity of an arbitrary collection of examples. The
higher the entropy more the information content.
For example if a dataset has an equal number of "Yes" and "No" outcomes
(like 3 people who bought a product and 3 who didn’t), the entropy is high
because it’s uncertain which outcome to predict. But if all the outcomes are
the same (all "Yes" or all "No") the entropy is 0 meaning there is no
uncertainty left in predicting the outcome
Suppose SS is a set of instances, AA is an attribute, SvSv is the subset
of SS with AA= vv and Values (AA) is the set of all possible values of AA,

Gain(S,A)=Entropy(S)−∑vϵValues(A) ∣Sv∣∣S∣.Entropy(Sv)
then

Gain(S,A)=Entropy(S)−∑vϵValues(A)∣S∣∣Sv∣.Entropy(Sv)
Example:
For the set X = {a,a,a,b,b,b,b,b}
Total instances: 8
Instances of b: 5
Instances of a: 3

Entropy H(X)=[(38)log⁡238+(58)log⁡258]=−[0.375(−1.415)+0.625(−
0.678)]=−(−0.53−0.424)=0.954 Entropy H(X)=[(83)log283+(85)log2
85]=−[0.375(−1.415)+0.625(−0.678)]=−(−0.53−0.424)=0.954
Building Decision Tree using Information Gain the essentials
 Start with all training instances associated with the root node
 Use info gain to choose which attribute to label each node with
 Recursively construct each subtree on the subset of training instances
that would be classified down that path in the tree.
 If all positive or all negative training instances remain, the label that node
“yes" or “no" accordingly
 If no attributes remain label with a majority vote of training instances left
at that node
 If no instances remain label with a majority vote of the parent's training
instances.
Example: Now let us draw a Decision Tree for the following data using
Information gain. Training set: 3 features and 2 classes
X Y Z C

1 1 1 I

1 1 0 I
X Y Z C

0 0 1 II

1 0 0 II

Here, we have 3 features and 2 output classes. To build a decision tree

using Information gain. We will take each of the features and calculate the
information for each feature.

Unit 3 (A) NGP
No ratings yet
Unit 3 (A) NGP
78 pages
UNIT - 3 ML
No ratings yet
UNIT - 3 ML
24 pages
Unit 1 ML (NN& ML Techniques)
No ratings yet
Unit 1 ML (NN& ML Techniques)
40 pages
ML Unit 3
No ratings yet
ML Unit 3
14 pages
Unit-4 (1) .Docx ML
No ratings yet
Unit-4 (1) .Docx ML
42 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
UNIT - 3 ML
No ratings yet
UNIT - 3 ML
24 pages
Unit 1 ML (DT)
No ratings yet
Unit 1 ML (DT)
24 pages
Decision Tree - Notes
No ratings yet
Decision Tree - Notes
8 pages
1.decision Trees Concepts
No ratings yet
1.decision Trees Concepts
70 pages
Decision Tree Basics for Data Scientists
No ratings yet
Decision Tree Basics for Data Scientists
61 pages
Data Science Lectures 3
No ratings yet
Data Science Lectures 3
46 pages
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
31 pages
Decision Tree
No ratings yet
Decision Tree
7 pages
Decision Tree
No ratings yet
Decision Tree
12 pages
Understanding Decision Trees in ML
No ratings yet
Understanding Decision Trees in ML
11 pages
Decision Trees Lectures
No ratings yet
Decision Trees Lectures
55 pages
Decisiontree 2
No ratings yet
Decisiontree 2
16 pages
MLT UNIT-3 Notes
No ratings yet
MLT UNIT-3 Notes
35 pages
Decision Tree
No ratings yet
Decision Tree
41 pages
Data Minning Unit 5 PDF
No ratings yet
Data Minning Unit 5 PDF
19 pages
Unit 3.2 Decision Tree Algorithm Wit Examples
No ratings yet
Unit 3.2 Decision Tree Algorithm Wit Examples
85 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
DT-0 (3 Files Merged)
No ratings yet
DT-0 (3 Files Merged)
143 pages
S&ML Unit 6 - Q & A
No ratings yet
S&ML Unit 6 - Q & A
12 pages
Decision Tree
No ratings yet
Decision Tree
20 pages
ML-PPT Unit Iii-1
No ratings yet
ML-PPT Unit Iii-1
38 pages
Decision Tree
No ratings yet
Decision Tree
35 pages
Classification
No ratings yet
Classification
75 pages
Decision Trees - A Complete Introduction With Examples - by Shubham Koli - Medium
No ratings yet
Decision Trees - A Complete Introduction With Examples - by Shubham Koli - Medium
22 pages
ML Unit-2 Material WORD
No ratings yet
ML Unit-2 Material WORD
25 pages
Decision Tree Induction Basics
No ratings yet
Decision Tree Induction Basics
55 pages
Module 5 Notes
No ratings yet
Module 5 Notes
8 pages
AI - Mod 5. Part 2
No ratings yet
AI - Mod 5. Part 2
40 pages
DM Mod 3
No ratings yet
DM Mod 3
14 pages
Understanding Decision Trees in ML
No ratings yet
Understanding Decision Trees in ML
16 pages
Information Gain in Decision Trees
No ratings yet
Information Gain in Decision Trees
10 pages
Trees
No ratings yet
Trees
78 pages
Module - 3 - DTL & Ann
No ratings yet
Module - 3 - DTL & Ann
10 pages
Building Decision Trees in ML
No ratings yet
Building Decision Trees in ML
16 pages
Unit6 - 2 Classification-Decision-Trees
No ratings yet
Unit6 - 2 Classification-Decision-Trees
36 pages
T6 Decision Tree
No ratings yet
T6 Decision Tree
38 pages
Decision Tree
100% (4)
Decision Tree
66 pages
ML Unit 3 Notes-1
No ratings yet
ML Unit 3 Notes-1
118 pages
Module 3
No ratings yet
Module 3
102 pages
Module 3
No ratings yet
Module 3
101 pages
Unit 3 (MLT)
No ratings yet
Unit 3 (MLT)
42 pages
Decision Trees
No ratings yet
Decision Trees
25 pages
NOTES Module 3 - Chapter 6 - Decision Tree Learning
No ratings yet
NOTES Module 3 - Chapter 6 - Decision Tree Learning
20 pages
Chapter 4classification and Prediction
No ratings yet
Chapter 4classification and Prediction
19 pages
Decision Tree Intro MDT903
No ratings yet
Decision Tree Intro MDT903
40 pages
Understanding Classification and Decision Trees
No ratings yet
Understanding Classification and Decision Trees
80 pages
Lecture 11 Classification-1
No ratings yet
Lecture 11 Classification-1
30 pages
Unit 3 Notes
No ratings yet
Unit 3 Notes
25 pages
Decision Tree Learning Guide
No ratings yet
Decision Tree Learning Guide
33 pages
4-Decision Tree Learning 1
No ratings yet
4-Decision Tree Learning 1
21 pages
Classification 4
No ratings yet
Classification 4
16 pages
6CS4-02 Machine Learning Manish Bhardwaj
No ratings yet
6CS4-02 Machine Learning Manish Bhardwaj
625 pages
AI Chapter 3 Part 2
No ratings yet
AI Chapter 3 Part 2
51 pages
Decision Tree
No ratings yet
Decision Tree
13 pages
Block Cipher Operation
No ratings yet
Block Cipher Operation
10 pages
Public Key Infrastructure
No ratings yet
Public Key Infrastructure
3 pages
Idea
No ratings yet
Idea
8 pages
RC 5
No ratings yet
RC 5
6 pages
Sream Ciphers
No ratings yet
Sream Ciphers
2 pages
Aes Algorithm
No ratings yet
Aes Algorithm
3 pages
RC 4
No ratings yet
RC 4
5 pages
Asymmetric Key Ciphers
No ratings yet
Asymmetric Key Ciphers
8 pages
Neural Networks
No ratings yet
Neural Networks
2 pages
Logistic Regression
No ratings yet
Logistic Regression
2 pages
Visible Surface Detection Method
No ratings yet
Visible Surface Detection Method
6 pages
Area Sub Division Method
No ratings yet
Area Sub Division Method
4 pages
Ethics and Resposibilities
No ratings yet
Ethics and Resposibilities
2 pages
3-d Object Representation
No ratings yet
3-d Object Representation
14 pages
Colour Models
No ratings yet
Colour Models
4 pages
Polygon Meshes
100% (1)
Polygon Meshes
2 pages
Properties of B Spine Curves
No ratings yet
Properties of B Spine Curves
7 pages
Student Database Management System
No ratings yet
Student Database Management System
20 pages
Data Structure 1 HINDI
100% (1)
Data Structure 1 HINDI
2 pages
11th Computer Science C Programs 2024 2025 Tamil Medium PDF Download
No ratings yet
11th Computer Science C Programs 2024 2025 Tamil Medium PDF Download
13 pages
Xii Functions Answers
No ratings yet
Xii Functions Answers
10 pages
Understanding Algorithm Complexity
No ratings yet
Understanding Algorithm Complexity
9 pages
Tts2918-V.jaganraja-Question Bank
No ratings yet
Tts2918-V.jaganraja-Question Bank
5 pages
Finals w23
No ratings yet
Finals w23
10 pages
Last Crash Log
No ratings yet
Last Crash Log
2 pages
Data Representation Tutorial
No ratings yet
Data Representation Tutorial
27 pages
Computer Network Lab SJCIT 2024 22 Scheme
No ratings yet
Computer Network Lab SJCIT 2024 22 Scheme
48 pages
Csharp Interview Guide
No ratings yet
Csharp Interview Guide
5 pages
Hadoop Cluster & Architecture Guide
No ratings yet
Hadoop Cluster & Architecture Guide
18 pages
Unit1,2,3
No ratings yet
Unit1,2,3
125 pages
Caie As Level: Computer SCIENCE (9618)
No ratings yet
Caie As Level: Computer SCIENCE (9618)
16 pages
Peep-hole Optimization Techniques Explained
No ratings yet
Peep-hole Optimization Techniques Explained
20 pages
Python Tic Tac Toe Game Code
No ratings yet
Python Tic Tac Toe Game Code
6 pages
Comp106 6 Computation
No ratings yet
Comp106 6 Computation
63 pages
Result B. Tech. CSE 8th Semester Batch 2020
No ratings yet
Result B. Tech. CSE 8th Semester Batch 2020
3 pages
Ingles Rakin
No ratings yet
Ingles Rakin
21 pages
Treap: Probabilistic Data Structure
No ratings yet
Treap: Probabilistic Data Structure
4 pages
Object Oriented Programming in Java
No ratings yet
Object Oriented Programming in Java
4 pages
It Phase 1 Example
No ratings yet
It Phase 1 Example
7 pages
Source Doc
No ratings yet
Source Doc
9 pages
Fundamentals-of-Computer-and-IT-BCA Notes (Unit1, Unit2, Unit3 and Unit4)
No ratings yet
Fundamentals-of-Computer-and-IT-BCA Notes (Unit1, Unit2, Unit3 and Unit4)
187 pages
Model Project
No ratings yet
Model Project
54 pages
Computer Organization and Architecture: Lecture-1
No ratings yet
Computer Organization and Architecture: Lecture-1
160 pages
R20 Java Syllabus
No ratings yet
R20 Java Syllabus
6 pages
FINAL DSA MINI PROJECT - PDF - 20241027 - 225622 - 0000
No ratings yet
FINAL DSA MINI PROJECT - PDF - 20241027 - 225622 - 0000
15 pages
Dsa Sheet
No ratings yet
Dsa Sheet
1 page
MCA Open Source-PHP Unit 1 and 2
No ratings yet
MCA Open Source-PHP Unit 1 and 2
24 pages

Decision Tree in Machine Learning in Predictive Analytics

Uploaded by

Decision Tree in Machine Learning in Predictive Analytics

Uploaded by

Decision Tree in Machine Learning

A decision tree is a supervised learning algorithm used for both classification

How Does a Decision Tree Work?

Decision making with 2 Decision Tree

Here, we have 3 features and 2 output classes. To build a decision tree

You might also like