Decision Tree

The document discusses attribute selection measures in decision trees, including information gain, Gini index, entropy, and chi-square, which are used to determine how to split data. It also explains tree-pruning techniques, specifically post-pruning and pre-pruning, to mitigate overfitting by either removing non-essential nodes after full tree growth or limiting tree growth through hyperparameters. These methods aim to enhance model accuracy while simplifying the decision tree structure.

Uploaded by

lakshmichinnu724

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views20 pages

Decision Tree

Uploaded by

lakshmichinnu724

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

DECISION TREE

Attribute selection measures

Attribute selection measures in decision trees include entropy, information gain, Gini index, gain
ratio, reduction in variance, and chi-square. These measures are also known as splitting rules.

 Information gain
Measures how much information a feature provides about the class. It's the decrease in entropy after
splitting the dataset.
 Gini index
Also known as Gini impurity, it calculates the probability of a feature being incorrectly classified
when selected randomly.
 Entropy
Measures the impurity of a dataset. It's used to decide how a decision tree can divide the information.
 Chi-square
Used for categorical features.
Tree-Pruning

When a decision tree is built to its full depth, it often overfits to the training data. To combat overfitting, two
techniques are used:

Post-pruning and Pre-pruning.

1. Post-pruning (Cost Complexity Pruning)

Post-pruning involves first allowing the decision tree to grow fully, and then removing parts of the tree that do
not improve its performance.

How it works:
[Link] the Tree Fully:
The decision tree is initially constructed without any constraints, allowing it to overfit on the training data.

[Link] Node Importance:

The tree is then evaluated to identify nodes and subtrees that do not contribute significantly to the accuracy of
the model.
[Link] Subtrees:

Nodes that do not add significant value are converted into leaf nodes. For instance, if a node has 90% “Yes”

and 10% “No” outcomes, further splitting may not be beneficial, so the subtree is pruned.

4. Simplify the Tree:

Reducing tree complexity lowers overfitting while maintaining accuracy, which is particularly useful for

small datasets.
Pre-Pruning

• In pre-pruning, hyperparameters such as max_depth and max_features are set before the tree is

fully constructed to limit its growth.

• max_depth: Limits the maximum depth of the tree.

• max_features: Restricts the number of features considered for splitting at each node.

• This technique reduces the risk of overfitting by preventing the tree from growing too deep and

capturing noise in the data.

Tree
No ratings yet
Tree
7 pages
Dec Tree
No ratings yet
Dec Tree
5 pages
DS Tech M 3 1
No ratings yet
DS Tech M 3 1
13 pages
Chapter 04
No ratings yet
Chapter 04
48 pages
Supervised Decision TreeRandom Forest
No ratings yet
Supervised Decision TreeRandom Forest
39 pages
Unit 3
No ratings yet
Unit 3
14 pages
Understanding Decision Trees
No ratings yet
Understanding Decision Trees
6 pages
Decision Tree
No ratings yet
Decision Tree
35 pages
Unit 4
No ratings yet
Unit 4
33 pages
Unit 3.2 Decision Tree Algorithm Wit Examples
No ratings yet
Unit 3.2 Decision Tree Algorithm Wit Examples
85 pages
ML CLASS 6 Decision Tree Algorithm
No ratings yet
ML CLASS 6 Decision Tree Algorithm
21 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
Decision Tree & Random Forest
No ratings yet
Decision Tree & Random Forest
16 pages
AI - Mod 5. Part 2
No ratings yet
AI - Mod 5. Part 2
40 pages
Unit-4 (1) .Docx ML
No ratings yet
Unit-4 (1) .Docx ML
42 pages
Decision Trees: Types and Terminologies
No ratings yet
Decision Trees: Types and Terminologies
17 pages
NOTES
No ratings yet
NOTES
18 pages
Understanding Decision Trees in ML
No ratings yet
Understanding Decision Trees in ML
11 pages
Decision Tree Theory
No ratings yet
Decision Tree Theory
22 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
Unit 3
No ratings yet
Unit 3
21 pages
Lecture Note #5 - PEC-CS701E
No ratings yet
Lecture Note #5 - PEC-CS701E
16 pages
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
31 pages
UNIT - 3 ML
No ratings yet
UNIT - 3 ML
24 pages
UNIT - 3 ML
No ratings yet
UNIT - 3 ML
24 pages
Decision Tree Learning in Machine Learning
No ratings yet
Decision Tree Learning in Machine Learning
68 pages
Decision Tree
No ratings yet
Decision Tree
21 pages
Decision Tree
No ratings yet
Decision Tree
11 pages
Decision Tree & Techniques
71% (7)
Decision Tree & Techniques
41 pages
Data Mining: Decision Trees Explained
No ratings yet
Data Mining: Decision Trees Explained
13 pages
Lab 2
No ratings yet
Lab 2
3 pages
Pruning Techniques in Decision Trees
No ratings yet
Pruning Techniques in Decision Trees
3 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
Inductive Inference with Decision Trees
No ratings yet
Inductive Inference with Decision Trees
53 pages
Decision Trees
No ratings yet
Decision Trees
3 pages
Data Mining Notes Unit 4
No ratings yet
Data Mining Notes Unit 4
30 pages
Unit IV Decision Trees
No ratings yet
Unit IV Decision Trees
37 pages
Decision Tree
No ratings yet
Decision Tree
16 pages
L04 Decision Trees
No ratings yet
L04 Decision Trees
34 pages
Decisiontree
No ratings yet
Decisiontree
6 pages
Decsion Tree
No ratings yet
Decsion Tree
6 pages
Unit 4-2
No ratings yet
Unit 4-2
20 pages
AIML Removed Merged
No ratings yet
AIML Removed Merged
31 pages
AIML Removed
No ratings yet
AIML Removed
25 pages
MI - Unit 4
No ratings yet
MI - Unit 4
79 pages
Decision Tree DT
No ratings yet
Decision Tree DT
20 pages
Machine Learning Chapter 4
No ratings yet
Machine Learning Chapter 4
9 pages
Feature Selection Method in Decision Tree Induction
No ratings yet
Feature Selection Method in Decision Tree Induction
7 pages
Decision Tree by Masud
No ratings yet
Decision Tree by Masud
12 pages
08 Decision - Tree
No ratings yet
08 Decision - Tree
9 pages
2179 Unit 3
No ratings yet
2179 Unit 3
29 pages
Trinh Khanh Ly 20213676
No ratings yet
Trinh Khanh Ly 20213676
13 pages
Lecture Notes 3
No ratings yet
Lecture Notes 3
11 pages
Decision Trees - Pres
No ratings yet
Decision Trees - Pres
9 pages
Hyper Parameter Optimization
No ratings yet
Hyper Parameter Optimization
13 pages
Decision Tree Classification Overview
No ratings yet
Decision Tree Classification Overview
37 pages
Unit 1 Classification & Prediction DM
No ratings yet
Unit 1 Classification & Prediction DM
71 pages
CC Unit 1.2
No ratings yet
CC Unit 1.2
39 pages
CC Unit 1.1
No ratings yet
CC Unit 1.1
41 pages
Mean Stack Mid 1
No ratings yet
Mean Stack Mid 1
4 pages
Unit 5 Soft Computing
No ratings yet
Unit 5 Soft Computing
29 pages
Unit 1 Soft Computing
No ratings yet
Unit 1 Soft Computing
17 pages
SVM Introduction To Support Vector Machines and Hyperplanes
No ratings yet
SVM Introduction To Support Vector Machines and Hyperplanes
5 pages

Decision Tree

Uploaded by

Decision Tree

Uploaded by

DECISION TREE

Attribute selection measures

Post-pruning and Pre-pruning.

1. Post-pruning (Cost Complexity Pruning)

[Link] Node Importance:

4. Simplify the Tree:

fully constructed to limit its growth.

• max_depth: Limits the maximum depth of the tree.

capturing noise in the data.

You might also like