0% found this document useful (0 votes)

37 views10 pages

Decision Tree Decision Tree Introduction With Ex

The document discusses how to create a perfect decision tree. It begins by defining a decision tree and its components such as nodes, branches, and end nodes. It then outlines advantages like understandability and disadvantages like computational expense. The document proceeds to create a sample decision tree to determine if a newly discovered planet could support life. It derives classification rules from the tree and explains the greedy approach used to build decision trees by selecting attributes that maximize information gain at each step.

Uploaded by

gouthamk5151

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views10 pages

Decision Tree Decision Tree Introduction With Ex

Uploaded by

gouthamk5151

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

 Subscribe

Become a Certified Professional 

Decision Tree: How To Create A Perfect Decision Tree?

Last updated on Nov 25,2020 89.1K Views

Upasana
Research Analyst, Tech Enthusiast, Currently working on Azure IoT & Data
Science...

A Decision Tree has many analogies in real life and turns out, it has influenced a wide area of
Machine Learning, covering both Classification and Regression. In decision analysis, a
decision tree can be used to visually and explicitly represent decisions and decision making.

So the outline of what I’ll be covering in this blog is as follows.

RANIBEW EERF
What is a Decision Tree?
Advantages and Disadvantages of a Decision Tree
Creating a Decision Tree

What is a Decision Tree?

A decision tree is a map of the possible outcomes of a series of related choices. It allows an
individual or organization to weigh possible actions against one another based on their costs,
probabilities, and benefits.

As the name goes, it uses a tree-like model of decisions. They can be used either to drive
informal discussion or to map out an algorithm that predicts the best choice mathematically.

A decision tree typically starts with a single node, which branches into possible outcomes. Each
of those outcomes leads to additional nodes, which branch off into other possibilities. This
gives it a tree-like shape.



 Subscribe
Become a Certified Professional 

RANIBEW EERF
There are three different types of nodes: chance nodes, decision nodes, and end nodes. A
chance node, represented by a circle, shows the probabilities of certain results. A decision
node, represented by a square, shows a decision to be made, and an end node shows the final
outcome of a decision path.

Advantages & Disadvantages of Decision Trees

Advantages
Decision trees generate understandable rules.
Decision trees perform classification without requiring much computation.
Decision trees are capable of handling both continuous and categorical variables.
Decision trees provide a clear indication of which fields are most important for prediction or
classification.
Decision Tree: How To Create A Perfect Decision Tree?
e! edureka.co
Disadvantages

Decision trees are less appropriate for estimation tasks where the goal is to predict the value of a
continuous attribute.
Decision trees are prone to errors in classification problems with many class and a relatively small
Whatsapp Linkedin Twitter Facebook Reddit
number of training examples.
Decision trees can be computationally expensive to train. The process of growing a decision treeis
computationally
Copy Link expensive. At each node, each candidate splitting field must be sorted before its best

split can be found. In some algorithms, combinations of fields are used and a search must be made for
optimal combining weights. Pruning algorithms can also be expensive since many candidate sub-trees
must be formed and compared.
 Subscribe
Become a Certified Professional 
Creating a Decision Tree
Let us consider a scenario where a new planet is discovered by a group of astronomers. Now
the question is whether it could be ‘the next earth?’ The answer to this question will
revolutionize the way people live. Well, literally!

There is n number of deciding factors which need to be thoroughly researched to take an

intelligent decision. These factors can be whether water is present on the planet, what is the
temperature, whether the surface is prone to continuous storms, flora and fauna survives the
climate or not, etc.

Let us create a decision tree to find out whether we have discovered a new habitat.

RANIBEW EERF
The habitable temperature falls into the range 0 to 100 Celsius.

Whether water is present or not?

Decision Tree: How To Create A Perfect Decision Tree?

e! edureka.co

Whatsapp Linkedin Twitter Facebook Reddit



Whether flora and fauna flourishes?
 Subscribe
Become a Certified Professional 

RANIBEW EERF
The planet has a stormy surface?

Decision Tree: How To Create A Perfect Decision Tree?

e! edureka.co
Thus, we a have a decision tree with us.

Classification Rules:
Classification
Whatsapp
rules are theLinkedin
cases in which all Twitter
the scenarios are Facebook
taken into consideration
Reddit
and a
class variable is assigned to each. 
 Variable:
Class
 Subscribe
Become a Certified Professional 
Each leaf node is assigned a class-variable. A class-variable is the final output which leads to
our decision.

Let us derive the classification rules from the Decision Tree created:

1. If Temperature is not between 273 to 373K, -> Survival Difficult

2. If Temperature is between 273 to 373K, and water is not present, -> Survival Difficult

3. If Temperature is between 273 to 373K, water is present, and flora and fauna is not present -
> Survival Difficult

4. If Temperature is between 273 to 373K, water is present, flora and fauna is present, and a

RANIBEW EERF
stormy surface is not present -> Survival Probable

5. If Temperature is between 273 to 373K, water is present, flora and fauna is present, and a
stormy surface is present -> Survival Difficult

Decision Tree
A decision tree has the following constituents :

Root Node: The factor of ‘temperature’ is considered as the root in this case.
Internal Node: The nodes with one incoming edge and 2 or more outgoing edges.
Leaf Node: This is the terminal node with no out-going edge.

As the decision tree is now constructed, starting from the root-node we check the test
condition and assign the control to one of the outgoing edges, and so the condition is again
tested and a node is assigned. The decision tree is said to be complete when all the test
conditions lead to a leaf node. The leaf node contains the class-labels, which vote in favor or
against the decision.

Now, you might think why did we start with the ‘temperature’ attribute at the root? If you
Decision Tree: How To Create A Perfect Decision Tree?
e! edureka.co
choose any other attribute, the decision tree constructed will be different.

Correct. For a particular set of attributes, there can be numerous different trees created. We
need to choose the optimal tree which is done by following an algorithmic approach. We will
now see ‘the greedy approach’ to create a perfect decision tree.
Whatsapp Linkedin Twitter Facebook Reddit

The Greedy Approach 


“Greedy Approach is based on the concept of Heuristic Problem Solving by making an optimal
local choice at each node. By making these local optimal choices, we reach the approximate
 Subscribe
optimal solution globally.” Become a Certified Professional 
The algorithm can be summarized as :

1. At each stage (node), pick out the best feature as the test condition.

2. Now split the node into the possible outcomes (internal nodes).

3. Repeat the above steps till all the test conditions have been exhausted into leaf nodes.

When you start to implement the algorithm, the first question is: ‘How to pick the starting test
condition?’

The answer to this question lies in the values of ‘Entropy’ and ‘Information Gain’. Let us see what
are they and how do they impact our decision tree creation.

RANIBEW EERF
Entropy: Entropy in Decision Tree stands for homogeneity. If the data is completely
homogenous, the entropy is 0, else if the data is divided (50-50%) entropy is 1.

Information Gain: Information Gain is the decrease/increase in Entropy value when the node
is split.

An attribute should have the highest information gain to be selected for splitting. Based on the
computed values of Entropy and Information Gain, we choose the best attribute at any
particular step.

Let us consider the following data:

Decision Tree: How To Create A Perfect Decision Tree?

e! edureka.co

Whatsapp Linkedin Twitter Facebook Reddit


There can be n number of decision trees that can be formulated from these set of attributes.

Tree Creation Trial-1 :
 Subscribe
Become a Certified Professional 
Here we take up the attribute ‘Student’ as the initial test condition.

RANIBEW EERF
Tree Creation Trial-2 :

Similarly, why to choose ‘Student’? We can choose ‘Income’ as the test condition.

Decision Tree: How To Create A Perfect Decision Tree?

e! edureka.co
Creating the Perfect Decision Tree With Greedy Approach
Let us follow the ‘Greedy Approach’ and construct the optimal decision tree.

There are two classes involved:

Whatsapp ‘Yes’ i.e. whether the
Linkedin person buys a computer
Twitter Facebook or ‘No’ i.e. he Reddit
does not. To
calculate Entropy and Information Gain, we are computing the value of Probability for each of these 2

classes.
 For ‘buys_computer=yes’ probability will come out to be :
»Positive:
»Negative:
 For ‘buys_computer=no’ probability comes out to be : Subscribe
Become a Certified Professional 

Entropy in D: We now put calculate the Entropy by putting probability values in the formula stated above.

We have already classified the values of Entropy, which are:

Entropy =0: Data is completely homogenous (pure)
Entropy =1: Data is divided into 50- 50 % (impure)
Our value of Entropy is 0.940, which means our set is almost impure.
Let’s delve deep, to find out the suitable attribute and calculate the Information Gain.
What is information gain if we split on “Age”?
This data represents how many people falling into a specific age bracket, buy and do not buy the product.
For example, for people with Age 30 or less, 2 people buy (Yes) and 3 people do not buy (No) the product,
the Info (D) is calculated for these 3 categories of people, that is represented in the last column.

RANIBEW EERF
The Info (D) for the age attribute is computed by the total of these 3 ranges of age values. Now, the
question is what is the ‘information gain’ if we split on ‘Age’ attribute.
The difference of the total Information value (0.940) and the information computed for age attribute
(0.694) gives the ‘information gain’.

This is the deciding factor for whether we should split at ‘Age’ or any other attribute. Similarly, we calculate
the ‘information gain’ for the rest of the attributes:
Information Gain (Age) =0.246
Information Gain (Income) =0.029
Information Gain (Student) = 0.151
Information Gain (credit_rating) =0.048
On comparing these values of gain for all the attributes, we find out that the ‘information gain’ for ‘Age’ is
the highest. Thus, splitting at ‘age’ is a good decision.
Similarly, at each split, we compare the information gain to find out whether that attribute should be
chosen for split or not.
Thus, the optimal tree created looks like :

Decision Tree: How To Create A Perfect Decision Tree?

e! edureka.co

Whatsapp Linkedin Twitter Facebook Reddit



The
 classification rules for this tree can be jotted down as: Subscribe
Become a Certified Professional 
If a person’s age is less than 30 and he is not a student, he will not buy the product.
Age(<30) ^ student(no) = NO

If a person’s age is less than 30 and he is a student, he will buy the product.
Age(<30) ^ student(yes) = YES

If a person’s age is between 31 and 40, he is most likely to buy.

Age(31…40) = YES

If a person’s age is greater than 40 and has an excellent credit rating, he will not buy.

Age(>40) ^ credit_rating(excellent) = NO

RANIBEW EERF
If a person’s age is greater than 40, with a fair credit rating, he will probably buy.
Age(>40) ^ credit_rating(fair) = Yes

Thus, we achieve the perfect Decision Tree!!

Now that you have gone through our Decision Tree blog, you can check out Edureka’s Data Science
Certification Training. Got a question for us? Please mention it in the comments section and we will
get back to you.

Categories: Data Science

Machine Learning in R for Beginners with Example

Decision Tree: How To Create A Perfect Decision Tree?

e! Naive Bayes Classifier
edureka.co

Linear Regression Algorithm from Scratch

Whatsapp Linkedin Twitter Facebook Reddit



 Subscribe
Become a Certified Professional 
BROWSE COURSES

TRENDING CERTIFICATION COURSES TRENDING MASTERS COURSES COMPANY WORK WITH US

DevOps Certification Training Data Scientist Masters Program About us Careers

AWS Architect Certification Training DevOps Engineer Masters Program News & Media Become an Instructor
Big Data Hadoop Certification Training Cloud Architect Masters Program Reviews Become an Affiliate
Tableau Training & Certification Big Data Architect Masters Program Contact us Become a Partner

RANIBEW EERF
Python Certification Training for Data Science Machine Learning Engineer Masters Program Blog Hire from Edureka
Selenium Certification Training Full Stack Web Developer Masters Program Community
DOWNLOAD APP
PMP® Certification Exam Training Business Intelligence Masters Program Sitemap
Robotic Process Automation Training using Data Analyst Masters Program Blog Sitemap
UiPath Test Automation Engineer Masters Program Community Sitemap
Apache Spark and Scala Certification Training Post-Graduate Program in Artificial Webinars
Microsoft Power BI Training Intelligence & Machine Learning
Online Java Course and Training Post-Graduate Program in Big Data
Python Certification Course Engineering

Legal & Privacy

"PMP®","PMI®", "PMI-ACP®" and "PMBOK®" are registered marks of the Project Management Institute, Inc. MongoDB®, Mongo and the leaf
logo are the registered trademarks of MongoDB, Inc.

Decision Tree: How To Create A Perfect Decision Tree?

e! edureka.co

Whatsapp Linkedin Twitter Facebook Reddit




Decisiontrees
No ratings yet
Decisiontrees
28 pages
Decision Treesnotes
No ratings yet
Decision Treesnotes
3 pages
Lecture Notes 3
No ratings yet
Lecture Notes 3
11 pages
Decision Trees for Job Applicants
No ratings yet
Decision Trees for Job Applicants
7 pages
Decisiontree
No ratings yet
Decisiontree
6 pages
Decision Tree
0% (1)
Decision Tree
16 pages
Decision Trees
No ratings yet
Decision Trees
53 pages
Decision Tree Is An Upside
No ratings yet
Decision Tree Is An Upside
7 pages
Decision Tree Learning (8 Hours)
No ratings yet
Decision Tree Learning (8 Hours)
141 pages
Unit Ii
No ratings yet
Unit Ii
22 pages
Decision Tree
No ratings yet
Decision Tree
18 pages
Decision Tree Monograph
No ratings yet
Decision Tree Monograph
6 pages
Decision Tree PDF
No ratings yet
Decision Tree PDF
6 pages
Decision Tree Examples: Problems With Solutions
0% (1)
Decision Tree Examples: Problems With Solutions
10 pages
Classification 4
No ratings yet
Classification 4
16 pages
ML Ch-3 Decision Trees and Ensemble Methods
No ratings yet
ML Ch-3 Decision Trees and Ensemble Methods
14 pages
Decision Tree Homework
100% (1)
Decision Tree Homework
7 pages
Decision Trees for Data Enthusiasts
No ratings yet
Decision Trees for Data Enthusiasts
52 pages
Unit 4 Classification
No ratings yet
Unit 4 Classification
15 pages
Cours #4-Decision Tree
No ratings yet
Cours #4-Decision Tree
18 pages
Decision Trees
No ratings yet
Decision Trees
12 pages
Decision Trees for Beginners
No ratings yet
Decision Trees for Beginners
45 pages
Machine - Learning - Lecture - 08 - Decision Tree Learning
No ratings yet
Machine - Learning - Lecture - 08 - Decision Tree Learning
67 pages
Decision Trees - A Complete Introduction With Examples - by Shubham Koli - Medium
No ratings yet
Decision Trees - A Complete Introduction With Examples - by Shubham Koli - Medium
22 pages
Decision Tree Algorithm Guide
No ratings yet
Decision Tree Algorithm Guide
10 pages
Decision Tree
No ratings yet
Decision Tree
2 pages
Tree
No ratings yet
Tree
7 pages
Unit Iir20
No ratings yet
Unit Iir20
22 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
8 pages
Understanding Decision Trees in Data Science
No ratings yet
Understanding Decision Trees in Data Science
13 pages
Lecture Note #5 - PEC-CS701E
No ratings yet
Lecture Note #5 - PEC-CS701E
16 pages
Decision Tree Basics for Students
No ratings yet
Decision Tree Basics for Students
12 pages
Decision Tree
No ratings yet
Decision Tree
13 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
11 pages
1.decision Trees Concepts
No ratings yet
1.decision Trees Concepts
70 pages
UNIT-IV - Decision Tree Induction
No ratings yet
UNIT-IV - Decision Tree Induction
19 pages
Decision Tree
No ratings yet
Decision Tree
7 pages
Decision Tree Is An Upside
No ratings yet
Decision Tree Is An Upside
17 pages
Decision Tree
0% (1)
Decision Tree
24 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
14 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
Decision Trees for Beginners
No ratings yet
Decision Trees for Beginners
3 pages
UNIT-3 ML Notes
No ratings yet
UNIT-3 ML Notes
4 pages
What Is Decision Tree
No ratings yet
What Is Decision Tree
35 pages
Decision Tree
No ratings yet
Decision Tree
26 pages
Understanding Decision Trees in ML
No ratings yet
Understanding Decision Trees in ML
16 pages
Decision Tree Structure and Algorithms
No ratings yet
Decision Tree Structure and Algorithms
5 pages
Pks Machine Learning Module 3 2
No ratings yet
Pks Machine Learning Module 3 2
80 pages
Branching Out - Navigating The World of Decision Trees
No ratings yet
Branching Out - Navigating The World of Decision Trees
14 pages
Unit-II - Tree Based Methods
No ratings yet
Unit-II - Tree Based Methods
158 pages
Lesson-5 2
No ratings yet
Lesson-5 2
5 pages
06 - Decision Trees
No ratings yet
06 - Decision Trees
14 pages
Understanding Decision Trees in Analysis
No ratings yet
Understanding Decision Trees in Analysis
1 page
Understanding Decision Trees in ML
No ratings yet
Understanding Decision Trees in ML
1 page
Data Minin1
No ratings yet
Data Minin1
104 pages
Decision Trees
No ratings yet
Decision Trees
18 pages
Decision Tree
No ratings yet
Decision Tree
2 pages
Decision Tree & Random Forest
No ratings yet
Decision Tree & Random Forest
28 pages
Five Animals of Shaolin Dissemination An
100% (5)
Five Animals of Shaolin Dissemination An
41 pages
SJCIT Repeated Mentions Document
No ratings yet
SJCIT Repeated Mentions Document
63 pages
MATM1 Mod4@AzDOCUMENTS - in
No ratings yet
MATM1 Mod4@AzDOCUMENTS - in
58 pages
MATM1 Mod5@AzDOCUMENTS - in
No ratings yet
MATM1 Mod5@AzDOCUMENTS - in
56 pages
Salesforce LWC Integration Project Part 3
No ratings yet
Salesforce LWC Integration Project Part 3
6 pages
User-Manual Controller Sigma-Control-2 - 9 9450 11use
No ratings yet
User-Manual Controller Sigma-Control-2 - 9 9450 11use
244 pages
05 Grid Search - en
No ratings yet
05 Grid Search - en
1 page
PowerLogic ION9000 Series User Guide 7EN02 0390 08
No ratings yet
PowerLogic ION9000 Series User Guide 7EN02 0390 08
317 pages
CS411 Final MCQs Prep Guide
No ratings yet
CS411 Final MCQs Prep Guide
13 pages
Flashcards - UNIT 5 - IB BM
No ratings yet
Flashcards - UNIT 5 - IB BM
7 pages
Internship Experience at UCB Bashundhara
No ratings yet
Internship Experience at UCB Bashundhara
13 pages
BCA Syllabus for Bengaluru City University
No ratings yet
BCA Syllabus for Bengaluru City University
12 pages
Joseph V. Tranquillo-MATLAB For Engineering and The Life Sciences (Synthesis Lectures On Engineering) - Morgan & Claypool Publishers (2011) PDF
No ratings yet
Joseph V. Tranquillo-MATLAB For Engineering and The Life Sciences (Synthesis Lectures On Engineering) - Morgan & Claypool Publishers (2011) PDF
137 pages
GE3151 PYTHON Syllabus
No ratings yet
GE3151 PYTHON Syllabus
2 pages
CS507 Assignment 1
No ratings yet
CS507 Assignment 1
2 pages
My Resume
No ratings yet
My Resume
1 page
Ops
No ratings yet
Ops
3 pages
Leap Data Sheet Gateway
No ratings yet
Leap Data Sheet Gateway
3 pages
ExperienceTheNext Issue2 2013
No ratings yet
ExperienceTheNext Issue2 2013
40 pages
Limit Problems Worksheet Solutions
No ratings yet
Limit Problems Worksheet Solutions
26 pages
CMM366A-WIFI en
No ratings yet
CMM366A-WIFI en
17 pages
PRACH Preamble Generation - Smart Telecom Edu
No ratings yet
PRACH Preamble Generation - Smart Telecom Edu
17 pages
Gen4 Size 8 Applications Reference Manual: Document No: 177/52901
No ratings yet
Gen4 Size 8 Applications Reference Manual: Document No: 177/52901
130 pages
Vamsi Resume
No ratings yet
Vamsi Resume
9 pages
MT6789 Android Scatter
No ratings yet
MT6789 Android Scatter
10 pages
The e Commerce Revolution American English Student BW
No ratings yet
The e Commerce Revolution American English Student BW
4 pages
Web Interface Designing Technologies - UNIT 4
No ratings yet
Web Interface Designing Technologies - UNIT 4
19 pages
Digital Logic Design and VHDL Phadke
No ratings yet
Digital Logic Design and VHDL Phadke
246 pages
Web Development 3151606 IT GEC Bhavnagar Bharat Vainsh
No ratings yet
Web Development 3151606 IT GEC Bhavnagar Bharat Vainsh
3 pages
API Dokumen - Tempahan
No ratings yet
API Dokumen - Tempahan
5 pages
SG Series CE Brochure GEA D - 1100 GB
No ratings yet
SG Series CE Brochure GEA D - 1100 GB
12 pages
Bendix Acom Diagnostic Software 6.9 User Guide
No ratings yet
Bendix Acom Diagnostic Software 6.9 User Guide
20 pages
Ecdis Top Questions
100% (1)
Ecdis Top Questions
2 pages
Distributed Computing Course Outline
No ratings yet
Distributed Computing Course Outline
3 pages

Decision Tree Decision Tree Introduction With Ex

Uploaded by

Decision Tree Decision Tree Introduction With Ex

Uploaded by

 Subscribe

Become a Certified Professional 

Decision Tree: How To Create A Perfect Decision Tree?

So the outline of what I’ll be covering in this blog is as follows.

What is a Decision Tree?

Advantages & Disadvantages of Decision Trees

There is n number of deciding factors which need to be thoroughly researched to take an

Whether water is present or not?

Decision Tree: How To Create A Perfect Decision Tree?

Whatsapp Linkedin Twitter Facebook Reddit

Decision Tree: How To Create A Perfect Decision Tree?

1. If Temperature is not between 273 to 373K, -> Survival Difficult

The Greedy Approach 

Let us consider the following data:

Decision Tree: How To Create A Perfect Decision Tree?

Whatsapp Linkedin Twitter Facebook Reddit

Decision Tree: How To Create A Perfect Decision Tree?

There are two classes involved:

We have already classified the values of Entropy, which are:

Decision Tree: How To Create A Perfect Decision Tree?

Whatsapp Linkedin Twitter Facebook Reddit

If a person’s age is between 31 and 40, he is most likely to buy.

Thus, we achieve the perfect Decision Tree!!

Categories: Data Science

Machine Learning in R for Beginners with Example

Decision Tree: How To Create A Perfect Decision Tree?

Linear Regression Algorithm from Scratch

Whatsapp Linkedin Twitter Facebook Reddit

TRENDING CERTIFICATION COURSES TRENDING MASTERS COURSES COMPANY WORK WITH US

DevOps Certification Training Data Scientist Masters Program About us Careers

Legal & Privacy

Decision Tree: How To Create A Perfect Decision Tree?

Whatsapp Linkedin Twitter Facebook Reddit

You might also like