0% found this document useful (0 votes)
21 views2 pages

DMML Lab Syllabus

Uploaded by

nhce.chempa2023
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
21 views2 pages

DMML Lab Syllabus

Uploaded by

nhce.chempa2023
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

DATA MINING AND MACHINE LEARNING LAB

Course Code 22CSL61 CIE Marks 50


L:T:P:S 0:0:1:0 SEE Marks 50
Hrs / Week 02 Total Marks 100
Credits 01 Exam Hours 03
Course outcomes:
At the end of the course, the student will be able to:
22CSL61.1 Understand the existing data preprocessing methods to analyze the datasets.
22CSL61.2 Apply association rules from transaction databases.
22CSL61.3 Analyze datasets for constructing classifiers using machine learning algorithms.
22CSL61.4 Evaluate machine learning models for predicting continuous data
Mapping of Course Outcomes to Program Outcomes and Program Specific Outcomes:
PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11 PO12 PSO1 PSO2
22CSL61.1 3 3 3 3 2 - - - 1 1 - 2 3 3
22CSL61.2 3 3 3 3 2 - - - 1 1 - 2 3 3
22CSL61.3 3 3 3 3 2 - - - 1 1 - 2 3 3
22CSL61.4 3 3 3 3 2 - - - 1 1 - 2 3 3

Program No.
List of Programs Hours COs
Prerequisite Experiments / Programs / Demo
• Demo on installation of Jupyter notebook including the uploading of data
sets.
• Demo on the usage of Google Collab for uploading datasets and mounting
2 NA
data sets from google drives.
• Introduction of the python packages Numpy, Pandas, Scikit Learn,
Matplotlib, Seaborn etc.
PART-A
1 Given a dataset, analyze whether there is missing data in the dataset and 22CSL61.1
2
handle it with different data preprocessing methods.
2 Given a dataset, perform the required data standardization and normalization 22CSL61.1
2
on the data.
3 Explore Label encoding and other encoding methods on various attributes of 22CSL61.1
2
the data
4 Perform Oversampling, under sampling and SMOTE algorithm to handle 22CSL61.1
2
imbalanced dataset.
5 Implement Apriori algorithm to identify the frequent itemset and association 22CSL61.2
2
rule from suitable transaction data.
6 Implement FP Growth Tree algorithm to identify the frequent itemset and 22CSL61.2
2
association rule from a suitable transaction data.
PART-B
7 Write a program to demonstrate the working of the decision tree based ID3
algorithm. Use an appropriate data set for building the decision tree and 2 22CSL61.2
apply this knowledge to classify a new sample.
8 Write a program to implement the naïve Bayesian classifier for a sample
training data set stored as a .CSV file. Compute the accuracy of the classifier, 2 22CSL61.2
considering few test data sets.
9 Write a program to implement the support vector machine classifier for a 22CSL61.3
sample training data set stored as a .CSV file. Compute the accuracy of the 2
classifier, considering few test data sets.
10 Write a program to implement k-Nearest Neighbour algorithm to classify the 22CSL61.3
iris data set. Print both correct and wrong predictions. Java/Python ML 2
library classes can be used for this problem.

57 | P a g e
11 Build an Artificial Neural Network by implementing the Back propagation 22CSL61.4
2
algorithm and test the same using appropriate data sets.
12 Build a classifier using any ensemble learning method and compare the 22CSL61.4
2
results against the classic learning models
PART-C
Beyond Syllabus Virtual Lab Content
(To be done during Lab but not to be included for CIE or SEE)
Implement and demonstrate the FIND-S algorithm for finding the most specific hypothesis based on a given set of
training data samples. Read the training data from a .CSV file.
For a given set of training data examples stored in a .CSV file, implement and demonstrate the Candidate-Elimination
algorithm to output a description of the set of all hypotheses consistent with the training examples
CIE Assessment Pattern (50 Marks – Lab)
Weekly Assessment Test (s)
RBT Levels
20 30
L1 Remember - -
L2 Understand 5 5
L3 Apply 5 10
L4 Analyze 5 10
L5 Evaluate 5 5
L6 Create
SEE Assessment Pattern (50 Marks – Lab)
Exam Marks
RBT Levels
Distribution (50)
L1 Remember -
L2 Understand 10
L3 Apply 20
L4 Analyze 10
L5 Evaluate 10
L6 Create -
Suggested Learning Resources:
Text Books:
1. “Python Machine Learning: Machine Learning and Deep Learning with Python, scikit-learn, and TensorFlow 2”
by Sebastian Raschka and Vahid Mirjalili ISBN 978-1789955750, 3rd ed. Edition 2019.
Reference Books:
1. “Machine Learning with Python Cookbook: Practical Solutions from Preprocessing to Deep Learning” by Kyle
Gallatin (Author), Chris Albon (Author) ISBN 978-1098135720O'Reilly Media, 2023

2. Fundamentals of Machine Learning for Predictive Data Analytics: Algorithms, Worked Examples, and Case Studies”
by John D. Kelleher, Brian Mac Namee, and Aoife D'Arcy, ISBN 9780262044691, The MIT Press, 2020

58 | P a g e

You might also like