UNIVERSITY OF PETROLEUM AND ENERGY STUDIES, DEHRADUN
Assignment #2, December 2020
Programme Name: B. Tech (CSE- AI/ML) Semester : III
Course Name : Machine Learning
Course Code : CSAI2005 Max. Marks : 30
Nos. of page(s) : 01
Instructions : Answer the following questions
S. No. Marks CO
Q1 Define:
a) Regression
6 CO1
b) Classification
c) Clustering
Q2 Explain the similarity and dissimilarity techniques during preprocessing of data
mining. 5 CO2
Q3 Briefly explain the rule Induction Using CHAID 5 CO4
Q4 Frequent pattern mining may generate many superfluous patterns. Therefore, it
important to develop methods that mine compressed patterns. Suppose a user
would like to obtain only k patterns (where k is a small integer). Outline an efficient
10 CO2
method that generates the k most representative patterns, where more distinct
patterns are referred over very similar patterns. Illustrate the effectiveness of your
method using a small data set.
Q5 Explain the Techniques to Improve Classification Accuracy.
5 CO3