0% found this document useful (0 votes)
53 views1 page

Exercise Classification

The document discusses computing Gini indices for attributes in a training data set to determine the best attribute for a decision tree classifier, with Customer ID having the lowest Gini but not being a good attribute choice due to overfitting.

Uploaded by

Lens New
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
53 views1 page

Exercise Classification

The document discusses computing Gini indices for attributes in a training data set to determine the best attribute for a decision tree classifier, with Customer ID having the lowest Gini but not being a good attribute choice due to overfitting.

Uploaded by

Lens New
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd

Exercise

Consider the training examples shown in the following Table for a binary classification problem

(a) Compute the Gini index for the overall collection of training examples.
(b) Compute the Gini index for the Customer ID attribute
(c) Compute the Gini index for the Gender attribute
(d) Compute the Gini index for the Car Type attribute using multiway split.
(e) Compute the Ginix index for the Shirt Size attribute using multiway split
(f) Which attribute is better, Gender, Car Type, or Shirt Size?
(g) Explain why Customer ID should not be used as the attribute test condition even though it
has lowest Gini?

You might also like