0% found this document useful (0 votes)

17 views1 page

Assignment3 Practice Classification

Uploaded by

nawaljaveria07

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views1 page

Assignment3 Practice Classification

Uploaded by

nawaljaveria07

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Assignment 3 Data Mining

Submission not required. Practice these questions, it will help in you the exam.
1. Suppose the proportion of class 1, 2 and 3 in dataset D1 is 0.4, 0.5, and 0.1, respectively and
that in dataset D2 is 0.7, 0.2, 0.1, respectively. Compute the entropy of these sets. Which dataset
is less impure?
2. Suppose we have a dataset D having 32 instances and 4 classes. What are the minimum and
maximum possible values for Info(D), i.e. uncertainty/entropy? Explain. Can you generalize
the range of entropy using this example? i.e., what will be the range of entropy if we have “n”
classes in the data set.
3. Given the data in Table 1, induce a decision tree classifier using ID3 algorithm. Performance
is the class attribute. Show the computation steps.

Table 1
Hostel Regularity Punctuality Smoker Performance
Yes Low Low Yes poor
No Low Medium No poor
Yes Medium High No medium
Yes Medium Low Yes poor
No High Medium Yes poor
No High High No excellent
No Medium High No excellent
No Medium High No medium

4. What is the accuracy of your decision tree classifier on the training data? Make a confusion
matrix.
5. Use your decision tree to predict the labels of following test examples.

yes,low,low,yes,?
yes,high,low,no,?
no,medium,high,no,?
no,medium,medium,no,?

6. Load your data in Table 1 into Weka and train a decision tree (ID3 or J48 classifier). (Prepare
separate arff files for train and test sets).
a. Report the accuracy on the training data including the output predictions by the
classifier.
b. Supply the test set in question 5 to your classifier, and generate output labels. Show
obtained predicted values.

7. Load the Iris dataset in Weka and compare performance of different classifiers on this dataset.
Try to find out the best classifier for this dataset. Report your results in a tabular form. Show
results for at least 3 classifiers and 4 test settings (training data as test, 5-fold cross validation,
10-fold cross validation, 70:30 train test split)
8. Reduce dimensionality of the Iris dataset and compare classification results on original and
reduced dataset. Do you observe any difference in classification accuracy?
9. What is the relationship between decision tree classifier and random forest classifier?
10. Explore SMOTE, a data imbalance handling method.

P02 DecisionTrees SolutionNotes
No ratings yet
P02 DecisionTrees SolutionNotes
3 pages
Machine Learning CA 2
No ratings yet
Machine Learning CA 2
19 pages
Data Mining Assignment No. 1
No ratings yet
Data Mining Assignment No. 1
7 pages
ML Assignment
No ratings yet
ML Assignment
7 pages
P02 DecisionTrees
No ratings yet
P02 DecisionTrees
2 pages
Lecture 6 - Decision Trees
No ratings yet
Lecture 6 - Decision Trees
43 pages
Soft Computing Lab Practical Assignment 2
No ratings yet
Soft Computing Lab Practical Assignment 2
10 pages
M01 Tree-Based Methods
No ratings yet
M01 Tree-Based Methods
38 pages
Data Science
No ratings yet
Data Science
8 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
ML Lab Record2
No ratings yet
ML Lab Record2
42 pages
Draft Xai
No ratings yet
Draft Xai
16 pages
ML Questions
No ratings yet
ML Questions
9 pages
ML Exam Solutions
No ratings yet
ML Exam Solutions
6 pages
ES335
No ratings yet
ES335
22 pages
Department of Electronics & Telecommunications Engineering: ETEL71A-Machine Learning and AI
No ratings yet
Department of Electronics & Telecommunications Engineering: ETEL71A-Machine Learning and AI
4 pages
Data Mining Journal 4 Kashan
No ratings yet
Data Mining Journal 4 Kashan
8 pages
UCS622
No ratings yet
UCS622
1 page
CE880 Lecture7 Slides
No ratings yet
CE880 Lecture7 Slides
78 pages
CAP3770 Lab#4 DecsionTree Sp2017
No ratings yet
CAP3770 Lab#4 DecsionTree Sp2017
4 pages
25 Questions To Test Your Skills On Decision Trees
No ratings yet
25 Questions To Test Your Skills On Decision Trees
10 pages
08 Class Basic
No ratings yet
08 Class Basic
103 pages
Data Mining Classification Models
No ratings yet
Data Mining Classification Models
5 pages
Types of Pruning Techniques
No ratings yet
Types of Pruning Techniques
10 pages
DTC Algorithm Implementation Guide
No ratings yet
DTC Algorithm Implementation Guide
7 pages
ID3 Decision Tree Algorithm Example
No ratings yet
ID3 Decision Tree Algorithm Example
8 pages
Understanding Decision Trees in Classification
100% (1)
Understanding Decision Trees in Classification
58 pages
5b Python Implementation of Decision Tree
No ratings yet
5b Python Implementation of Decision Tree
7 pages
Class 2a-Decision Trees
No ratings yet
Class 2a-Decision Trees
28 pages
ML Assignment 1 - Nageswar
No ratings yet
ML Assignment 1 - Nageswar
7 pages
Aih Exp 2
No ratings yet
Aih Exp 2
8 pages
MLT Experiment 3
No ratings yet
MLT Experiment 3
3 pages
3 1 Overfitting
No ratings yet
3 1 Overfitting
25 pages
DWM 06
No ratings yet
DWM 06
4 pages
BigData Week13
No ratings yet
BigData Week13
62 pages
ML Unit3 Qna
No ratings yet
ML Unit3 Qna
3 pages
MODULE 3 Classification
No ratings yet
MODULE 3 Classification
5 pages
Classification
No ratings yet
Classification
148 pages
Decision Tree Final
No ratings yet
Decision Tree Final
2 pages
Decision Tree Algorithm in Healthcare AI
No ratings yet
Decision Tree Algorithm in Healthcare AI
10 pages
Bayes and Decision Tree
No ratings yet
Bayes and Decision Tree
36 pages
Decision Tree Based Id3 Algorithm
No ratings yet
Decision Tree Based Id3 Algorithm
2 pages
Final Data Mining 2023
No ratings yet
Final Data Mining 2023
5 pages
Structured Data Classification
No ratings yet
Structured Data Classification
3 pages
ML Mid Question Solve
No ratings yet
ML Mid Question Solve
19 pages
DWM - Module 3
No ratings yet
DWM - Module 3
22 pages
Final Exam, Data Mining (CEN 871) : Name Surname: Student's ID
No ratings yet
Final Exam, Data Mining (CEN 871) : Name Surname: Student's ID
2 pages
Diabetes Case Study - Jupyter Notebook
100% (1)
Diabetes Case Study - Jupyter Notebook
10 pages
Unit II Part 1
No ratings yet
Unit II Part 1
62 pages
UNIT II 2.1 ML Decision Tree Learning
No ratings yet
UNIT II 2.1 ML Decision Tree Learning
55 pages
DM Lect 9 - Classification - Decision Trees
No ratings yet
DM Lect 9 - Classification - Decision Trees
39 pages
Machine Learning Assignment Questions
No ratings yet
Machine Learning Assignment Questions
4 pages
Big Data Lesson 5 Lucrezia Noli
No ratings yet
Big Data Lesson 5 Lucrezia Noli
30 pages
Decision Trees
No ratings yet
Decision Trees
53 pages
L8 1 Decisiontrees Random Forest
No ratings yet
L8 1 Decisiontrees Random Forest
118 pages
21CS54 QB Test3
No ratings yet
21CS54 QB Test3
2 pages
Dataset Guidelines for Text Classification
No ratings yet
Dataset Guidelines for Text Classification
3 pages
Midterm
No ratings yet
Midterm
4 pages
Design of An Embedded Machine Learning Based System For An Environmental-Friendly Crop Prediction Using A Sustainable Soil Fertility Management
No ratings yet
Design of An Embedded Machine Learning Based System For An Environmental-Friendly Crop Prediction Using A Sustainable Soil Fertility Management
6 pages
INT354 Lecture 0
No ratings yet
INT354 Lecture 0
33 pages
new-Guidelines-Datamining-I-UGCF-DSE-CS Hons-Sem 4-Jan 25
No ratings yet
new-Guidelines-Datamining-I-UGCF-DSE-CS Hons-Sem 4-Jan 25
3 pages
The Role of Artificial Intelligence in Cyber Security
No ratings yet
The Role of Artificial Intelligence in Cyber Security
24 pages
Repost Master Python? in 15 Days??
No ratings yet
Repost Master Python? in 15 Days??
17 pages
Btad 617
No ratings yet
Btad 617
10 pages
AI TOP Utility 4.0 - User Manual
No ratings yet
AI TOP Utility 4.0 - User Manual
59 pages
Improved Skin Cancer Detection With 3D Total Body
No ratings yet
Improved Skin Cancer Detection With 3D Total Body
21 pages
Deep Learning
No ratings yet
Deep Learning
38 pages
5 - Comparative Study of Crop Yield Prediction Using Explainable AI and Interpretable Machine Learning Techniques
No ratings yet
5 - Comparative Study of Crop Yield Prediction Using Explainable AI and Interpretable Machine Learning Techniques
7 pages
Predictive Maintenance with RNNs
No ratings yet
Predictive Maintenance with RNNs
9 pages
Murat Durmus - A Primer To The 42 Most Commonly Used Machine Learning Algorithms (With Code Samples) - Leanpub (2023)
No ratings yet
Murat Durmus - A Primer To The 42 Most Commonly Used Machine Learning Algorithms (With Code Samples) - Leanpub (2023)
192 pages
Understanding SHAP Values in ML Models
No ratings yet
Understanding SHAP Values in ML Models
12 pages
Artificial Intelligence and Acute Appendicitis A Systematic Review of Diagnostic and Prognostic ModelsWorld Journal of Emergency Surgery
No ratings yet
Artificial Intelligence and Acute Appendicitis A Systematic Review of Diagnostic and Prognostic ModelsWorld Journal of Emergency Surgery
31 pages
Machine Learning for Spam Detection
No ratings yet
Machine Learning for Spam Detection
8 pages
DWM Lab Workbook Sample
No ratings yet
DWM Lab Workbook Sample
10 pages
Credit Card Fraud Detection Techniques Survey
No ratings yet
Credit Card Fraud Detection Techniques Survey
8 pages
Writing An Impressive Letter of Intent For Turkiye Scholarship
No ratings yet
Writing An Impressive Letter of Intent For Turkiye Scholarship
5 pages
Intechposter Format New
No ratings yet
Intechposter Format New
1 page
Clustering Techniques To Identify Low-Engagement Student Levels
No ratings yet
Clustering Techniques To Identify Low-Engagement Student Levels
10 pages
Voluntary Ai Safety Standard
No ratings yet
Voluntary Ai Safety Standard
69 pages
Prediction of Diseases Using Random Forest
No ratings yet
Prediction of Diseases Using Random Forest
8 pages
Machine Learning Algorithm Demos
No ratings yet
Machine Learning Algorithm Demos
31 pages
Foundation Models For Generalist Medical Artificial Intelligence
No ratings yet
Foundation Models For Generalist Medical Artificial Intelligence
7 pages
Efficient and Accurate Explanation Estimation With Distribution Compression
No ratings yet
Efficient and Accurate Explanation Estimation With Distribution Compression
30 pages
Loan Approval PDF
No ratings yet
Loan Approval PDF
60 pages
Smart Crop Recommendation System: Internship Report
No ratings yet
Smart Crop Recommendation System: Internship Report
22 pages
Adversarial Machine Learning Attacks and Defense Methods in The Cyber Security Domain
No ratings yet
Adversarial Machine Learning Attacks and Defense Methods in The Cyber Security Domain
57 pages

Assignment3 Practice Classification

Uploaded by

Assignment3 Practice Classification

Uploaded by

Assignment 3 Data Mining

You might also like