SEMESTER-V
COURSE 12: SUPERVISED ML WITH PYTHON
Theory Credits: 3 3 hrs/week
Aim and objectives of Course:
The purpose of this course is to serve as an introduction to Supervisedmachine learning with
Python.
We will explore several classifications, regression algorithms and see howthey can help
us perform a variety of Supervised machine learning tasks.
Learning outcomes of Course:
Able to understand introduction to machine learning concepts.
Able to Loading datasets, build models and model persistence.
Understand Feature extraction from data sets.
Able to do Regression & Classification.
Able to compare SVM with other
classifiers. UNIT I:
Machine Learning Basics: What is machine learning? Key terminology, Key tasksof machine
learning, How to choose right algorithm, steps in developing a machinelearning, why python?
Getting started with Numpy library Classifying with k- Nearest Neighbors: The k-Nearest
Neighbors classification algorithm, Parsing and importing data from a text file, Creating
scatter plots with Matplotlib, Normalizingnumeric values
UNIT II:
Splitting datasets one feature at a time-Decision trees: Introducing decision trees, measuring
consistency in a dataset, using recursion to construct a decision tree, plotting trees in
Matplotlib
UNIT III:
Classifying with probability theory-Naïve Bayes: Using probability distributions for
classification, learning the naïve Bayes classifier, Parsing data from RSS feeds, using naïve
Bayes to reveal regional attitudes
UNIT IV:
Logistic regression: Classification with logistic regression and the sigmoidfunction, Using
optimization to find the best regression coefficients, the gradientdescent optimization
algorithm, Dealing with missing values in the our data UNIT V:
Support vector machines: Introducing support vector machines, using the SMO algorithm for
optimization, using kernels to “transform” data, Comparing support vector machines with
other classifiers
TEXT BOOK:
1. Machine learning in action, Peter Harrington by Manning publications
Supervised ML with Python Lab
SEMESTER-V
COURSE 12: SUPERVISED ML WITH PYTHON
Theory Credits: 3 3 hrs/week
SEMESTER-V
COURSE 12: SUPERVISED ML WITH PYTHON
Practical Credits: 1 2 hrs/week
Details of Lab/Practical/Experiments/Tutorials syllabus:
1. Implement and demonstrate the FIND-S algorithm for finding the most specific hypothesis
based on a given set of training data samples. Read the training data from a
CSV file. For a given set of training data examples stored in a .CSV
file, implement anddemonstrate the Candidate-Elimination algorithm to output a description
of the set of all hypotheses consistent with the training examples.
2. Write a program to demonstrate the working of the decision tree based ID3 algorithm.
3. Write a program to implement the naïve Bayesian classifier for a sample training data set
stored as a CSV file.
4. Assuming a set of documents that need to be classified, use the naïve BayesianClassifier
model to perform this task. Built-in Java classes/API can be used towrite the program.
Calculate the accuracy, precision, and recall for your dataset.