0% found this document useful (0 votes)

7 views7 pages

Decision Tree Lecture 1

The document provides an overview of decision trees, a tree-structured learning technique used for classification and regression, detailing their structure, including decision and leaf nodes. It discusses the challenges in generating decision trees, such as the need for a small tree that fits the data and the computational complexity of searching for the optimal tree. Additionally, it explains concepts like entropy and information gain, which are crucial for choosing attribute tests and building effective decision trees.

Uploaded by

nahimalumja

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views7 pages

Decision Tree Lecture 1

Uploaded by

nahimalumja

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Decision Tree

It is a tree-structured learning technique that is used for classification as well as regression.

Function type
 Non-linear
Two types of nodes
 Decision nodes
o A test is performed to choose between two or more paths
o The test is done based on the values of a feature or attribute of the instance
 Leaf nodes
o Indicates the class of an example
o Value of the example; for example, probability, score

Fig. 1 An example decision tree – Approval of loan

Generating a decision tree
Issues
• Many decision trees
• Different extent of errors
• All fit the data
• Noisy examples
• No decision tree that exactly fit the data
Given some training examples, what decision tree should be generated?
One proposal:
 Prefer the smallest tree that is consistent with the data (Bias)
Possible method:
 Search the decision trees space for the smallest decision tree that fits the data
Features of a smaller tree
• Low depth
• Smaller count of nodes

Searching algorithm for the smallest decision tree

 Searching the decision tree space for the smallest decision tree that fits the data is a
hard problem with extremely high computational resources
One proposal
 Greedy algorithm
Fig. 2 A snapshot of training data
Building a decision tree
 Searching for a good tree
 The decision tree space is too large for a systematic search
 At a particular point, two choices to make:
• STOP and
• return the value for the target feature
• a distribution over target feature values
• CONTINUE and
• choose an input feature to split on
• for each value of the test, build a subtree for those examples with this value

Choosing attribute tests

→ Greedy search to minimize the depth of the final tree
→ Choosing the attribute
→ Perfect
→ Divide the examples into sets, each of which is same (positive or negative)

Entropy
→ Measure of the uncertainty of a random variable
→ Acquisition of information corresponds to a reduction in entropy
→ Entropy of a random variable V with values vk each with probability P(vk)
Entropy: ( ) ∑ ( )
( )
∑ ( ) ( )

→ A flip of a fair coin is equally likely to come up heads or tails, 0 or 1

→ This counts as “1 bit” entropy
→ Fair coin flip : H(fair)

( ) ( )

= 1 bit
→ If the coin is loaded to give 99% heads

( ) ( )

= 0.08 bits
( ) : Entropy of a Boolean random variable
→ True with probability q
→ ( ) ( ( ) ( ))

Decision Tree Learning

→ A training set ( ) has : positive, negative examples
→ The entropy of the goal attribute on the whole set
( ) ( )
Choosing the attribute tests
→ The greedy search used to approximately minimize the depth of the final tree.

An attribute
→ distinct values
→ The values divide the training set into subsets

Each subset has

→ positive examples, negative examples
→ In that branch
→ We need an additional ( ( + )) bits of information to answer the
question.
→ Each example has the th value of the attribute A
 With probability

For attribute
→ The expected entropy remaining after testing attribute
→ ( ) ∑ ( )
Information gain from the attribute test
→ Expected reduction in entropy
→ Gain ( ) = ( ) ( )

Example:

Day Outlook Temperature Humidity Wind Play Tennis

X1 Sunny Hot High Weak No

X2 Sunny Hot High Strong No
X3 Overcast Hot High Weak Yes
X4 Rain Mild High Weak Yes
X5 Rain Cool Normal Weak Yes
X6 Rain Cool Normal Strong No
X7 Overcast Cool Normal Strong Yes
X8 Sunny Mild High Weak No
X9 Sunny Cool Normal Weak Yes
X10 Rain Mild Normal Weak Yes
X11 Sunny Mild Normal Strong Yes
X12 Overcast Mild High Strong Yes
X13 Overcast Hot Normal Weak Yes
X14 Rain Mild High Strong No

A decision tree

Entropy measures homogeneity of examples

→ 9 positive and 5 negative examples: = 9, =5
→ Entropy: ( ) ( ) ( )

( ) ( )
Boolean Classification

1
→ Entropy is 0 if all examples belong to the same
class
→ Entropy is 1 if half of them belong to the either
0.5
class
→ Otherwise it is between 0 and 1
Entropy

→ If the target attribute can take on different values,

then the entropy of (examples) relative to their -
0 0.5 1
wise classification is defined as

( ) ∑

Information Gain Measures

→ The expected reduction in Entropy
→ Caused by partitioning the examples according to this attribute
→ = Set of examples and = attribute
→ ( ) ( ) ∑ ( ) ( )
→ Values( )
→ Set of all possible values for attribute
→
→ Subset of for which has value
Entropy ( )
→ Entropy of the original collection
→ The second term ( )
→ Expected value of the entropy after is partitioned using attribute
→ Sum of the entropies of each subset weighted by the fraction of examples
that belong to

Gain ( )
→ Expected reduction in entropy caused by knowing the value of attribute
→ The information provided about the target function value, given the value of some
other attribute
→ The value of Gain( ) is the number of bits saved when encoding the target value of
an arbitrary member of , by knowing the value of attribute
→ Application
→ Values(Wind) = {weak, strong}
→ : ( = 9, = 5)
→ : ( = 6, = 2)
→ : ( = 3, = 3)
→ ( )
( ) ∑ ( )
* +

( ) ( )

( ( ) ( )) ( ( ) ( ))

( ) ( )

Values (Humidity) = {High, Normal}

→ : ( = 9, = 5)
→ : ( = 3, = 4)
→ : ( = 6, = 1)

( )
( ) ∑ ( )
* +
= ( )( ( ) ( )) ( )( ( ) ( ))

( ) ( )

ML UNIT-2 Notes
No ratings yet
ML UNIT-2 Notes
15 pages
Decision Tree Algorithm
No ratings yet
Decision Tree Algorithm
18 pages
ML - 02 Decision Tree
No ratings yet
ML - 02 Decision Tree
24 pages
Unit 3
No ratings yet
Unit 3
90 pages
3 Decision Trees - LMS
No ratings yet
3 Decision Trees - LMS
47 pages
Decision Tree Induction Basics
No ratings yet
Decision Tree Induction Basics
55 pages
ML Unit 3
No ratings yet
ML Unit 3
36 pages
Decision Tree-Using Entropy
No ratings yet
Decision Tree-Using Entropy
17 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
70 pages
Decision Tree & Random Forest
No ratings yet
Decision Tree & Random Forest
41 pages
Module 3
No ratings yet
Module 3
101 pages
ML Lecture 13-14
No ratings yet
ML Lecture 13-14
33 pages
Decision Trees for Data Scientists
No ratings yet
Decision Trees for Data Scientists
75 pages
Decision Tree
No ratings yet
Decision Tree
42 pages
Decision Tree
No ratings yet
Decision Tree
98 pages
ML Lec5
No ratings yet
ML Lec5
7 pages
Decision Trees for Play Tennis Analysis
No ratings yet
Decision Trees for Play Tennis Analysis
51 pages
ID3 Decision Tree Essentials
No ratings yet
ID3 Decision Tree Essentials
20 pages
Lesson 5 Decision Tree Learning
No ratings yet
Lesson 5 Decision Tree Learning
10 pages
07 Decision Tree
No ratings yet
07 Decision Tree
45 pages
Unit 3 Part 2
No ratings yet
Unit 3 Part 2
21 pages
Module 3
No ratings yet
Module 3
102 pages
Decision Tree Algorithm Guide
No ratings yet
Decision Tree Algorithm Guide
25 pages
UNIT3
No ratings yet
UNIT3
71 pages
Machine Learning Decision Tree ID3
No ratings yet
Machine Learning Decision Tree ID3
20 pages
Understanding Decision Trees in AI
No ratings yet
Understanding Decision Trees in AI
61 pages
Class 16 Decision Tree
No ratings yet
Class 16 Decision Tree
45 pages
2c Decision Tree Algorithm
No ratings yet
2c Decision Tree Algorithm
21 pages
07 - ML - Decision Tree
No ratings yet
07 - ML - Decision Tree
37 pages
Decision Tree Example
No ratings yet
Decision Tree Example
21 pages
Decision Trees and Random Forests
No ratings yet
Decision Trees and Random Forests
34 pages
Decision Tree
No ratings yet
Decision Tree
12 pages
Machine Learning II - Decision Trees
No ratings yet
Machine Learning II - Decision Trees
16 pages
Calculating Entropy in Decision Trees
100% (1)
Calculating Entropy in Decision Trees
11 pages
Decisiontrees
No ratings yet
Decisiontrees
46 pages
Decision Tree Class 2
No ratings yet
Decision Tree Class 2
40 pages
Chapter 5 2018 2019
No ratings yet
Chapter 5 2018 2019
5 pages
Unit 5. Decision Trees
No ratings yet
Unit 5. Decision Trees
58 pages
Week - 2 Day - 2 Machine Learning 2 - 3
No ratings yet
Week - 2 Day - 2 Machine Learning 2 - 3
33 pages
Classification Trees
No ratings yet
Classification Trees
48 pages
Classification With Decision Trees: Instructor: Qiang Yang
100% (1)
Classification With Decision Trees: Instructor: Qiang Yang
62 pages
Decision Tree
No ratings yet
Decision Tree
18 pages
Unit 2
No ratings yet
Unit 2
153 pages
Data Warehousing & Mining Assignment
No ratings yet
Data Warehousing & Mining Assignment
10 pages
Unit 3
No ratings yet
Unit 3
81 pages
00 Decision Tree Example
No ratings yet
00 Decision Tree Example
12 pages
Unit 2 1
No ratings yet
Unit 2 1
15 pages
Unit-4-DECISION TREES
No ratings yet
Unit-4-DECISION TREES
16 pages
Decision Tree Classification Guide
No ratings yet
Decision Tree Classification Guide
7 pages
Tasks On Decision Trees
No ratings yet
Tasks On Decision Trees
11 pages
Random Forest Regression
No ratings yet
Random Forest Regression
57 pages
Unit 3 MLT
No ratings yet
Unit 3 MLT
18 pages
Decision Tree
No ratings yet
Decision Tree
52 pages
ML Decision Trees
No ratings yet
ML Decision Trees
47 pages
07 - Decision Tree
No ratings yet
07 - Decision Tree
45 pages
Decision Tree Algorithm: Comp328 Tutorial 1 Kai Zhang
No ratings yet
Decision Tree Algorithm: Comp328 Tutorial 1 Kai Zhang
25 pages
Decision-Tree Learning .
No ratings yet
Decision-Tree Learning .
29 pages
PDA With Differentiate DPDA Vs NPDA
No ratings yet
PDA With Differentiate DPDA Vs NPDA
16 pages
Computer Networks Semester Prep Planner
No ratings yet
Computer Networks Semester Prep Planner
2 pages
Regression 3
No ratings yet
Regression 3
5 pages
ROBO SOCCER TECHTIX1-1 Yfag2v
No ratings yet
ROBO SOCCER TECHTIX1-1 Yfag2v
3 pages
TURBULENCE TECHTIX1-1 Hrcwog
No ratings yet
TURBULENCE TECHTIX1-1 Hrcwog
1 page
EMCEE SCRIPT - Friendship Meeting'25-2
No ratings yet
EMCEE SCRIPT - Friendship Meeting'25-2
4 pages
Stat2012 Notes Study Guide
No ratings yet
Stat2012 Notes Study Guide
118 pages
Chapter 6
No ratings yet
Chapter 6
33 pages
SSP Exam WS1718
No ratings yet
SSP Exam WS1718
5 pages
Employee Performance Insights
No ratings yet
Employee Performance Insights
8 pages
Bivariate Normal Distribution Overview
No ratings yet
Bivariate Normal Distribution Overview
11 pages
CH 5 - Demand Forecasting in A Supply Chain
No ratings yet
CH 5 - Demand Forecasting in A Supply Chain
48 pages
Stat Final Exam '17-'18
100% (3)
Stat Final Exam '17-'18
2 pages
Stat Prob - Q3 - Week 5 6 - Module 4 - Sampling Distribution Its Mean and Variance - For Reproduction
No ratings yet
Stat Prob - Q3 - Week 5 6 - Module 4 - Sampling Distribution Its Mean and Variance - For Reproduction
26 pages
Plating Thickness Distribution and Definitions
No ratings yet
Plating Thickness Distribution and Definitions
9 pages
Data Analysis for Researchers
No ratings yet
Data Analysis for Researchers
23 pages
đề CLC số 1
No ratings yet
đề CLC số 1
2 pages
Curve Fitting and Poisson Distribution
No ratings yet
Curve Fitting and Poisson Distribution
7 pages
Risk (Part 2) - Variance & Covariance - Varsity by Zerodha - All Things Stock Markets Simplified
No ratings yet
Risk (Part 2) - Variance & Covariance - Varsity by Zerodha - All Things Stock Markets Simplified
6 pages
SRM Notes
No ratings yet
SRM Notes
96 pages
Statistics For Engineering and The Sciences
No ratings yet
Statistics For Engineering and The Sciences
4 pages
Outlook Temp Humidity Windy Play
No ratings yet
Outlook Temp Humidity Windy Play
17 pages
Lean Six Sigma Black Belt Mock Exam
100% (2)
Lean Six Sigma Black Belt Mock Exam
26 pages
ML QB Final
No ratings yet
ML QB Final
16 pages
Use of Poisson Distribution in Highway Traffic: Daniel L. Gerlough
No ratings yet
Use of Poisson Distribution in Highway Traffic: Daniel L. Gerlough
78 pages
Re 2-1-R19 &R20 Supple, July-2023
No ratings yet
Re 2-1-R19 &R20 Supple, July-2023
1 page
Forecasting Error Metrics
No ratings yet
Forecasting Error Metrics
4 pages
Upper Critical Values of The F Distribution
No ratings yet
Upper Critical Values of The F Distribution
11 pages
Measures of Central Tendency and Dispersion
No ratings yet
Measures of Central Tendency and Dispersion
13 pages
Demerit System and u-Charts Overview
No ratings yet
Demerit System and u-Charts Overview
30 pages
The KENDALL Coefficient of Concordance
No ratings yet
The KENDALL Coefficient of Concordance
3 pages
Knee Point Detection For Detecting Automatically The Number of Clusters During Clustering Techniques
No ratings yet
Knee Point Detection For Detecting Automatically The Number of Clusters During Clustering Techniques
10 pages
A Practical Guide To Statistics - Book
No ratings yet
A Practical Guide To Statistics - Book
160 pages
Final Assignment Business Analytics
No ratings yet
Final Assignment Business Analytics
10 pages
IILM Greater Noida PGDM
No ratings yet
IILM Greater Noida PGDM
2 pages
06 - Normal Distribution Template
No ratings yet
06 - Normal Distribution Template
16 pages

Decision Tree Lecture 1

Uploaded by

Decision Tree Lecture 1

Uploaded by

Decision Tree

It is a tree-structured learning technique that is used for classification as well as regression.

Fig. 1 An example decision tree – Approval of loan

Searching algorithm for the smallest decision tree

Choosing attribute tests

→ A flip of a fair coin is equally likely to come up heads or tails, 0 or 1

Decision Tree Learning

Each subset has

Day Outlook Temperature Humidity Wind Play Tennis

X1 Sunny Hot High Weak No

Entropy measures homogeneity of examples

→ If the target attribute can take on different values,

Information Gain Measures

Values (Humidity) = {High, Normal}

You might also like