0% found this document useful (0 votes)

31 views38 pages

DWM Assignment Ques

Uploaded by

yashshende208

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as XLSX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views38 pages

DWM Assignment Ques

Uploaded by

yashshende208

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as XLSX, PDF, TXT or read online on Scribd

You are on page 1/ 38

CHAPTER 1

5 MARKS
SR. NO
1
2
3
4

10 MARKS
SR. NO

3
4

6
CHAPTER 1

5 MARKS
QUESTION
Difference between OLTP and OLAP
Every data structure in the Data Warehouse contains the time element. Why?
What are the basic building blocks of Data warehouse?
Difference between ER modeling vs Dimensional modeling

10 MARKS
QUESTION

Suppose that a data warehouse consists of the three dimensions time, doctor and patiet and
two measures count and charge, where charge is the fee that a doctor charges a patient for a
visit

i) Draw a star schema daigram for the above data warehouse.

ii) Starting with the base cuboid [day, doctor, patient], what specific OLAP operations should b
performed in order to list the total fee collected by eaach doctor in 2010?

iii) To obtain the same list, write an SQL query assuming the data are stored in a relational
database with the schema fee (day, month, year, doctor, hospital, patient, count, charge).

The college wants to record the marks for the cources completed by students using the
dimensions:

a) Course b)Student c) Time and a measure of Agrregate marks.

Create a cube and describe following operation:

i)roll up ii)Drill down iii)Slice iv)Dice
Consider the quarterly sales of four companies C1,C2,C3,C4. The dimensions are:
a)Time
b) Shopping category(Mens, Womens, Electronics, Home)
c) Company
Create a cube and describe all five OLAP operation
For a supermarket chain, consider the dimensions namely Product, Store, Time, Promosion. T
schema contains the three facts namely units_sales, dollar_sales, and cost_dollar.

Design a star schema and calculate the maximum number of base fact table records for the
values given below:

Time period: 5 years

Stores: 300 reporting daily sales

Product: 40000 products in each stpre(about 4000 sell daily in each store)
Promotion: a sold item may be in only one promotion in a store on a given day.

Differentiate between Star schema and Snowflake schema. Design star schema for company
sales with three dimensions such as Location, Item and Time.

What is dimensional modeling? Design the data warehouse dimensional model for a wholesal
furniture company. The data warehouse has to analyze the company's situation at least with
respect to the Furniture, Customer, and Time. Moreover, the company needs to analyze: The
furniture with respect to its type, category, and material. The customer with respect to their
spatial location, by considering at least cities, regions, and states. The company is interested
learning the quantity, income, and discount of its sales.
YEAR
MAY_22, MAY_23, DEC_23
DEC_22, DEC_24
MAY_23
DEC_24

YEAR

DEC_19

MAY_22, DEC_23

DEC_22
DEC_22, DEC_24

MAY_23

DEC_23
CHAPTER 2

5 MARKS
SR. NO QUESTION
1 Explain major issues in data mining.

2 Application of Data mining

3 Short note on techniques of Data loading

10 MARKS

SR. NO QUESTION
Describe the steps involved in Data Mining when viewed as a process of knowledge
1
discovery

Develpop a model to predict the salary of college graduates with 10 years of work
experience using linear regression.

2
2
.

Suppose that the data for analysis includes the sttribute salary. We have the
following values for salary (in thousands of dollars), shown in incresing order: 30, 36,
47, 50, 52, 52, 56, 60, 63, 70, 70, 110
3 i) What are the mean, median, mode and midrange of the data?

ii) Find the first quartile (Q1) and the third quartile (Q3) of the data.
iii) Show a boxplot of tha data.

4 Discuss the different steps involved in data processing

5 Discuss the different types of attributes.

6 Discuss different Data visualization technique
Explain KDD process with neat daigram. Also state any five applications of data
7
mining.
YEAR
MAY_22, MAY_23, MAY_24

DEC_23
DEC_23

YEAR

DEC_19, MAY_23

DEC_19
DEC_19

DEC_19

MAY_22, DEC_23

DEC_22, DEC_24
MAY_23, DEC_23

DEC_23, MAY_24
CHAPTER 3

5 MARKS
SR. NO QUESTION

1 What are the various methods for estimating classifiers accuaracy.

2 What are the various issues regarding classification and prediction?

Explain Holdout and Random subsampling method to evaluate the accuracy of
3
classifier.

10MARKS
SR. NO QUESTION

Why tree prunning useful in decesion tree induction? What is a drawback of

1
using a separate set of tuples to evaluate prunning?

Apply the Naive Bayes Classifier to classify the tuple < Red, SUV, Domestic>
For the gien dataset below.

2
2

Explain Decision Tree based Classification approach with example. Discuss

3
metrics for evaluationg clasfier performance.
A data sample is given below. Find whether Patient X has flu or not using
Naive Bayes classifier.
If X = (chills=Y, runny nose=N, headache=Mild, fever=Y, flu=?)

5 Describe in detail about how to evaluate accuracy of the classifier.

A company wants to predict whether a customer will subscribe to a premium
membership based on their demographic and browsing behavior data. The
dataset contains information about customers, including age, gender, income,
browsing time, and subscription status.

Given the training data for height classification, classify the tuple, t=<Rohit,
M, 1.95> using Naïve Bayes Classification.

7
7
YEAR

MAY_22, DEC_22, DEC_23

MAY_22

DEC_24

YEAR

DEC_19

DEC_22
DEC_22

MAY_23

DEC_23

MAY_24
MAY_24

DEC_24
DEC_24
CHAPTER 4

5 MARKS

SR. NO QUESTION

1 Explain K meams clustering algorithm and Draw flowchart.

2 Explain FP Growth Algorithm.

10 MARKS
SR. NO QUESTION

Show the dendogram created by the complete link clusteriong algorithm for the given
set of points

The table below shows the six data points. Apply Agglomerative clustering to find
clusters. Use Euclidian distance measure. Consider single linkage.
2

Suppose that the data mining task is to cluster the following points into 3 clusters.
A1(2,10), A2(2,5), A3(8,4), B1(5,8), B2(7,5), B3(6,4), C1(1,2), C2(4,9). The distance
function is Euclidean distance. Suppose we initially assign A1, B1, C1 as the center of
3 each cluster respectively. Use the k means algorithm to show only

a) The three cluster centers after the first round of execution

b) The final three clusters

Use agglomative algorithm using the following data and plot a dendogram using link
approach. The following figure contains sample data items indicating the distance
between the elements

4
Explain K meams clustering algorithm. Discuss its advantages and limitations. Apply K-
means algorithm for the following data set with 3 ckusters.
5
Data set = {2,3,6,8,9,12,15,18,22}

Consider the data given below. Create adjacency matrix. apply complete link algorithm
tocluster the given data set and draw the dendogram.

Following table gives fat and proteins content of items. Apply single linkage clustering
and construct dendrogram.

7
7

Consider four objects with two attribute (X and Y). These four objects are to be grouped
together into two clusters using k-means clustering algorithm. Following are the objects
with their attribute values.

8
YEAR

DEC_23

DEC_24

YEAR

DEC_19
MAY_22

DEC_22

DEC_22
MAY_23

MAY_23

MAY_24
MAY_24

DEC_24
CHAPTER 5

5 MARKS

SR. NO QUESTION

1 Elucidate market basket analysis with an example.

10 MARKS

SR. NO QUESTION

Consider a transaction database given below

Use apriori algorithm with min-support count = 2 and min-confidence = 60% to find all frequent
itemset and strong association rules.

1
2 Demonstate Multidimensional and Multilevel Association Rule Mining with suitable examples.

A databse has four transactions. Let min sup=60% and min conf=80%

Find all the frequent item sets using apriori algorithm and also list all the strong association rules.

A database has five transactions

Let minimum support=3. Final all frequent itemsets using FP-growth algorithm

Apply apriori algorithm on the following dataset to find strong association rules. minimum support
5
threshold (s= 33.33%) and minimum confident threshold (c=60%)

Use apriori algorithm with min-support count = 2 and min-confidence = 60% to find all frequent
itemset and strong association rules.

6
6

Consider the following transaction database with minimum support 50% and minimum confidence
66%. Find the frequent patterns and strong association rules.

For the table given perform Apriori algorithm and show frequent item set and strong association
rules. Assume Minimum Support of 30% and Minimum confidence of 70%.

8
8

Given the following data, apply the Apriori algorithm. Find frequent item set and strong association
rules. Given Support threshold=50%, Confidence=60%

9
YEAR

DEC_19, DEC_22, MAY_24

YEAR

DEC_19,
DEC_19, MAY_23, DEC_23,
MAY_24, DEC_24

MAY_22

DEC_22

MAY_23
MAY_23

DEC_23

MAY_24
MAY_24

DEC_24
CHAPTER 6

5 MARKS

SR. NO QUESTION

1 Explain Web usage mining in detail

2 Explain page rank techniques in detail.

3 Short note on Web content mining.

4 Discuss different applications of Web Mining.

10 MARKS

SR. NO QUESTION

1 What is spatial data? Explain CLARANS Extension.

What is Web structure Mining? List the approaches used to structure the web
2 pages to improve on the effectiveness of search engines and crawlers. Explain
the Page Rank techniques in detail.

Is web mining different from classical data mining. Justify your answer.
3
Describe types of web mining.
What is web mining? Explain web structure mining and web usage mining in
4
Detail.

5 Explain page rank algorithm with example.

What is Web Mining? Differentiate between Web Mining and Data Mining.
6
Explain types of Web Mining.
YEAR

MAY_22, DEC_23, MAY_24

MAY_23

DEC_23

DEC_24

YEAR

DEC_19, DEC_22

DEC_22
MAY_23

DEC_23, MAY_24, DEC_24

DEC_24

DWM PYQs
No ratings yet
DWM PYQs
7 pages
DWMquestion Bank
No ratings yet
DWMquestion Bank
5 pages
DWDM QB
No ratings yet
DWDM QB
6 pages
SEM 5 - Comps, IOT, CYBER, CS - Data Warehousing & Mining - 2024 MAY To 2022 DEC PYQ - Aeraxia - in
No ratings yet
SEM 5 - Comps, IOT, CYBER, CS - Data Warehousing & Mining - 2024 MAY To 2022 DEC PYQ - Aeraxia - in
10 pages
DWDM Unitwise Questions
No ratings yet
DWDM Unitwise Questions
3 pages
Sample Question DMW
No ratings yet
Sample Question DMW
4 pages
DWDM Unitwise Qns
100% (1)
DWDM Unitwise Qns
3 pages
Data Mining Suggestions
No ratings yet
Data Mining Suggestions
5 pages
Data Mining & Warehouse Q&A
No ratings yet
Data Mining & Warehouse Q&A
4 pages
Vi Sem Bca Qbank - Wcms - Fds
50% (2)
Vi Sem Bca Qbank - Wcms - Fds
11 pages
DWDM Unit Wise Question Bank
No ratings yet
DWDM Unit Wise Question Bank
8 pages
CEUC502 - DMBI - Question - Bank
No ratings yet
CEUC502 - DMBI - Question - Bank
12 pages
16CS531-Data Warehousing and Data Mining
No ratings yet
16CS531-Data Warehousing and Data Mining
6 pages
Data Mining Question Bank Chapter-1 (Introduction To Data Warehouse and Data Mining) Expected Questions 1 Mark Questions
No ratings yet
Data Mining Question Bank Chapter-1 (Introduction To Data Warehouse and Data Mining) Expected Questions 1 Mark Questions
6 pages
Data Mining Concepts and Techniques Guide
No ratings yet
Data Mining Concepts and Techniques Guide
4 pages
Question Bank: Q1) What Is Data Warehouse?
No ratings yet
Question Bank: Q1) What Is Data Warehouse?
17 pages
DMDW Question Bank
No ratings yet
DMDW Question Bank
17 pages
Data Warehousing & Clustering Guide
No ratings yet
Data Warehousing & Clustering Guide
9 pages
DM Obj
No ratings yet
DM Obj
16 pages
Gandhinagar Institute of Technology: Computer Engineer Ing Department Question Bank
No ratings yet
Gandhinagar Institute of Technology: Computer Engineer Ing Department Question Bank
3 pages
DWM Questions
No ratings yet
DWM Questions
5 pages
Data Mining Exam Prep Guide
No ratings yet
Data Mining Exam Prep Guide
4 pages
Vivaquestions
No ratings yet
Vivaquestions
14 pages
Oral Questions LP II
No ratings yet
Oral Questions LP II
21 pages
DMBI Questions
No ratings yet
DMBI Questions
8 pages
Data Ming
No ratings yet
Data Ming
28 pages
Data Mining Syllabus Overview
No ratings yet
Data Mining Syllabus Overview
5 pages
Solutions To DM I MID (A)
100% (1)
Solutions To DM I MID (A)
19 pages
DMW Simp-Tie
No ratings yet
DMW Simp-Tie
2 pages
Aie - Concept of Data Mining
No ratings yet
Aie - Concept of Data Mining
5 pages
Consolidated Cse Question Bank1
No ratings yet
Consolidated Cse Question Bank1
170 pages
Data Mining Long Answers
No ratings yet
Data Mining Long Answers
4 pages
QB Data Mining
No ratings yet
QB Data Mining
5 pages
Data Mining Imp
No ratings yet
Data Mining Imp
11 pages
Question Bank: Data Warehousing and Data Mining Semester: VII
No ratings yet
Question Bank: Data Warehousing and Data Mining Semester: VII
4 pages
DM Overview
No ratings yet
DM Overview
52 pages
DM Unit-1
No ratings yet
DM Unit-1
14 pages
DMBI QB AssignmentQ
No ratings yet
DMBI QB AssignmentQ
8 pages
2018 & 2019 Data Mining Answers
No ratings yet
2018 & 2019 Data Mining Answers
25 pages
Model Question Paper 2
No ratings yet
Model Question Paper 2
7 pages
BTech Data Mining Exam Prep
No ratings yet
BTech Data Mining Exam Prep
8 pages
Data Mining & Warehousing Q&A
No ratings yet
Data Mining & Warehousing Q&A
6 pages
SemSuggestions DM
No ratings yet
SemSuggestions DM
6 pages
DM Unit Wise Important Questions
No ratings yet
DM Unit Wise Important Questions
6 pages
DWDM-CSE-Question Bank
No ratings yet
DWDM-CSE-Question Bank
11 pages
Data Mining Syllabus and Question
No ratings yet
Data Mining Syllabus and Question
6 pages
How To Pass Sem 5 - Comps
No ratings yet
How To Pass Sem 5 - Comps
11 pages
DMBI All Pyqs
No ratings yet
DMBI All Pyqs
4 pages
Data Mining for Computer Science Students
No ratings yet
Data Mining for Computer Science Students
20 pages
DM
No ratings yet
DM
7 pages
Data Warehousing Exam Prep
No ratings yet
Data Warehousing Exam Prep
59 pages
DM Question Bank
No ratings yet
DM Question Bank
50 pages
Data Mining and Warehousing Q&A Guide
No ratings yet
Data Mining and Warehousing Q&A Guide
13 pages
DWM Te QP
No ratings yet
DWM Te QP
7 pages
DM Question Bank
No ratings yet
DM Question Bank
5 pages
DWM QB Cyse
No ratings yet
DWM QB Cyse
8 pages
Data Warehouse Schemas Explained
No ratings yet
Data Warehouse Schemas Explained
21 pages
Data Mining Introductiondifferent
No ratings yet
Data Mining Introductiondifferent
83 pages
Data Mining Model Qns
100% (1)
Data Mining Model Qns
14 pages
Wa0001
No ratings yet
Wa0001
6 pages
Aindump2go Az-305 Dumps 2023-Oct-26 by Elmer 196q Vce
No ratings yet
Aindump2go Az-305 Dumps 2023-Oct-26 by Elmer 196q Vce
22 pages
Order Processor
No ratings yet
Order Processor
2 pages
SiVArc Features for Engineers
No ratings yet
SiVArc Features for Engineers
39 pages
AEM CRXDE Folder Structure
No ratings yet
AEM CRXDE Folder Structure
5 pages
Data Engineer Certification Study Guide
No ratings yet
Data Engineer Certification Study Guide
4 pages
Leaders in Bulk SMS Solutions
No ratings yet
Leaders in Bulk SMS Solutions
4 pages
Srigautham SAP Basis
No ratings yet
Srigautham SAP Basis
2 pages
Cybersecurity Incident Management Quiz
100% (2)
Cybersecurity Incident Management Quiz
13 pages
Big Data Exam: CS Engineering 2017
No ratings yet
Big Data Exam: CS Engineering 2017
2 pages
Cli Reference Guide
No ratings yet
Cli Reference Guide
283 pages
CCSK Cloud Security Practice Questions
No ratings yet
CCSK Cloud Security Practice Questions
78 pages
D105019GC10 Oracle Database Performance Management and Tuning Ed 1
No ratings yet
D105019GC10 Oracle Database Performance Management and Tuning Ed 1
2 pages
YAML For Home Assistant UI
No ratings yet
YAML For Home Assistant UI
6 pages
SIH 2025 Idea Presentation Format
No ratings yet
SIH 2025 Idea Presentation Format
6 pages
Cloud-Native Security Practices in IBM Cloud: White Paper
No ratings yet
Cloud-Native Security Practices in IBM Cloud: White Paper
14 pages
Activate FEH
No ratings yet
Activate FEH
5 pages
Performing A Risk Assessment 3e - Paul Shenjere Mutiswa
No ratings yet
Performing A Risk Assessment 3e - Paul Shenjere Mutiswa
8 pages
21bce2064 Software Engineering Lab3
No ratings yet
21bce2064 Software Engineering Lab3
25 pages
Joe Gray - Knoxville, Tennessee, Capella University - About - Me
No ratings yet
Joe Gray - Knoxville, Tennessee, Capella University - About - Me
2 pages
Shopping Cart Project Report
No ratings yet
Shopping Cart Project Report
126 pages
Dot Net Interview Questions and Answers PDF
No ratings yet
Dot Net Interview Questions and Answers PDF
16 pages
SQL Server 2000 for Database Pros
100% (2)
SQL Server 2000 for Database Pros
289 pages
Terra AC OCPP 1.6 Implementation Overview - v1.5 - External
No ratings yet
Terra AC OCPP 1.6 Implementation Overview - v1.5 - External
20 pages
Grade 9 REVISION (Ch6) (Ch7)
No ratings yet
Grade 9 REVISION (Ch6) (Ch7)
40 pages
Mastering Ansible: A Complete Guide
No ratings yet
Mastering Ansible: A Complete Guide
21 pages
Medilog Darwin2 HL7 Configuration Guide
No ratings yet
Medilog Darwin2 HL7 Configuration Guide
8 pages
ERP MM Session Plan
No ratings yet
ERP MM Session Plan
9 pages
Cyber Cafe Management System Project Class12
No ratings yet
Cyber Cafe Management System Project Class12
7 pages
Software Maintenance Essentials
No ratings yet
Software Maintenance Essentials
16 pages
ICT Trends for Students
No ratings yet
ICT Trends for Students
22 pages

DWM Assignment Ques

Uploaded by

DWM Assignment Ques

Uploaded by

CHAPTER 1

i) Draw a star schema daigram for the above data warehouse.

a) Course b)Student c) Time and a measure of Agrregate marks.

Create a cube and describe following operation:

Time period: 5 years

Stores: 300 reporting daily sales

2 Application of Data mining

4 Discuss the different steps involved in data processing

5 Discuss the different types of attributes.

1 What are the various methods for estimating classifiers accuaracy.

2 What are the various issues regarding classification and prediction?

Why tree prunning useful in decesion tree induction? What is a drawback of

Explain Decision Tree based Classification approach with example. Discuss

5 Describe in detail about how to evaluate accuracy of the classifier.

MAY_22, DEC_22, DEC_23

1 Explain K meams clustering algorithm and Draw flowchart.

2 Explain FP Growth Algorithm.

a) The three cluster centers after the first round of execution

b) The final three clusters

1 Elucidate market basket analysis with an example.

Consider a transaction database given below

A database has five transactions

DEC_19, DEC_22, MAY_24

1 Explain Web usage mining in detail

2 Explain page rank techniques in detail.

3 Short note on Web content mining.

4 Discuss different applications of Web Mining.

1 What is spatial data? Explain CLARANS Extension.

5 Explain page rank algorithm with example.

MAY_22, DEC_23, MAY_24

DEC_23, MAY_24, DEC_24

You might also like