CDT305 A

Uploaded by

bedakanz4

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

108 views3 pages

CDT305 A

Uploaded by

bedakanz4

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

C 1100CDT3051222011100IET301122101 Pages: 3

Reg No.:_ Name:____________

APJ ABDUL KALAM TECHNOLOGICAL UNIVERSITY
Fifth Semester B.Tech Degree Regular and Supplementary Examination December 2022 (2019 Scheme)

Course Code: CDT 305

Course Name: DATA ANALYTICS
Max. Marks: 100 Duration: 3 Hours

PART A
(Answer all questions; each question carries 3 marks) Marks

1 How is a data warehouse different from a database? (3)

2 Explain following attribute types with examples: (3)
(i) Ordinal attribute
(ii) Nominal attribute
(iii) Numeric attribute
3 Explain any three methods for measuring the central tendency of data. (3)
4 Why do we need to pre-process the data? (3)
5 Compare rare pattern and negative pattern with the help of an example. (3)
6 Explain join and prune actions in Apriori algorithm. (3)
7 What is a dendrogram? How dendrogram is constructed in agglomerative and (3)
divisive hierarchical clustering techniques.
8 Explain density-reachability and density-connectivity concepts in DBSCAN. (3)
9 What are stop words? How a stop list is created. (3)
10 What is a language model? (3)
PART B
(Answer one full question from each module, each question carries 14 marks)

Module -1
11 a) Compare roll-up and drill-down operations in OLAP with an example. (8)
b) Given two objects represented by the tuples (10, 8, 4, 12) and (20,10,16,8): (6)
(i) Compute the Euclidean distance between the tuples.
(ii) Compute the Manhattan distance between the two objects.
12 a) Explain any four probability distributions. (8)
b) Explain the iterative analytics process model. (6)

Page 1 of 3
1100CDT3051222011100IET301122101

Module -2
13 a) Explain covariance analysis of numeric attributes with an example. (9)
b) Suppose that the data for analysis includes the marks of students in a class. (5)
The mark values for the data tuples are. Construct the five-number summary
for the dataset. (33, 37, 37, 35, 45, 36, 30, 42, 32, 31, 32, 28, 36, 31, 32, 34, 32,
32, 33, 44, 32, 36, 38, 40, 40, 36, 40, 41)
14 a) Explain any six strategies for data transformation. (6)
b) Explain any two methods for data normalization with examples. (8)
Module -3
15 a) Consider a database having five transactions. Let min_sup=2 and (10)
min_conf=60%. Find the frequent itemsets in the database using Apriori
algorithm.
TID ITEMS
TR1 jam, biscuit, chocolate
TR2 biscuit, butter
TR3 biscuit, milk
TR4 jam, biscuit, butter
TR5 jam, milk
TR6 biscuit, milk
TR7 jam, milk
TR8 jam, biscuit, milk, chocolate
TR9 jam, biscuit, milk

b) Explain any two key measures to quantify the strength of an association rule? (4)
Why Apriori algorithm is slow?
16 a) Explain pattern-growth approach for mining frequent item sets with an (10)
example.
b) What is market basket analysis? (4)
Module -4
17 a) Explain decision tree induction with the help of an example. (8)
b) Explain any two attribute selection measures with an example. (6)
18 a) Given the following distance matrix, construct the dendrogram using (10)
agglomerative clustering with complete linkage and average linkage.

Page 2 of 3
1100CDT3051222011100IET301122101

A B C D E
A 0 8 4 6 10
B 8 0 6 5 9
C 4 6 0 8 2
D 6 5 8 0 7
E 10 9 2 7 0

b) How divisive hierarchical clustering method works? Explain any three (4)
challenges.
Module -5
19 a) Explain Boolean Retrieval with an example. (10)
b) Compare unigram and bigram language model. (4)
20 a) Explain tokenization, stemming and lemmatization with an example. (9)
b) What is case-folding? Explain with an example. (5)
***

Page 3 of 3

Question Bank Semester: IV Sem Subject: Data Science Sub Code: 17MCA441 SL - No. Questions Marks
No ratings yet
Question Bank Semester: IV Sem Subject: Data Science Sub Code: 17MCA441 SL - No. Questions Marks
4 pages
Data Mining Paer 2 Oct 12, 2024 - 241012 - 224522
No ratings yet
Data Mining Paer 2 Oct 12, 2024 - 241012 - 224522
13 pages
Data Warehousing & Mining Exam 2018
No ratings yet
Data Warehousing & Mining Exam 2018
17 pages
Computational Thinking Theory Answers
No ratings yet
Computational Thinking Theory Answers
2 pages
15A05602 Data Warehousing & Mining
No ratings yet
15A05602 Data Warehousing & Mining
2 pages
DBDM, FDS, Ds Model QP
No ratings yet
DBDM, FDS, Ds Model QP
5 pages
2) Final Question Bank - DA-QB
No ratings yet
2) Final Question Bank - DA-QB
8 pages
DMA Question Bank
No ratings yet
DMA Question Bank
4 pages
Model BSC
No ratings yet
Model BSC
1 page
Data Mining and Warehousing Exam 2023
No ratings yet
Data Mining and Warehousing Exam 2023
4 pages
CEUC502 - DMBI - Question - Bank
No ratings yet
CEUC502 - DMBI - Question - Bank
12 pages
202CS009
No ratings yet
202CS009
2 pages
M.Tech Exam: Data Warehousing & Mining
No ratings yet
M.Tech Exam: Data Warehousing & Mining
5 pages
DWDM-CSE-Question Bank
No ratings yet
DWDM-CSE-Question Bank
11 pages
Data Warehousing&Data Mining AMTCSE0114
No ratings yet
Data Warehousing&Data Mining AMTCSE0114
3 pages
Ip QP 2
No ratings yet
Ip QP 2
9 pages
Subject: Computer Science Class: XII Exam: Practice Paper Time Duration: 3 Hrs M.M.: 70
No ratings yet
Subject: Computer Science Class: XII Exam: Practice Paper Time Duration: 3 Hrs M.M.: 70
7 pages
IT M502 BIG Data Analytics
No ratings yet
IT M502 BIG Data Analytics
3 pages
FDS Important Q
No ratings yet
FDS Important Q
5 pages
Ip CLSS Xii 2024-25 Hy
No ratings yet
Ip CLSS Xii 2024-25 Hy
14 pages
2023 June CST322-C
No ratings yet
2023 June CST322-C
3 pages
To Students Data Mining Part-2 Sept 13 - 240913 - 160930
No ratings yet
To Students Data Mining Part-2 Sept 13 - 240913 - 160930
5 pages
Adobe Scan 19 Jul 2025
No ratings yet
Adobe Scan 19 Jul 2025
2 pages
QP - Ip - Xii - Set 2
No ratings yet
QP - Ip - Xii - Set 2
8 pages
QB For DS - V Sem Students
No ratings yet
QB For DS - V Sem Students
23 pages
12 Ip Pb1 Ahd QP B
No ratings yet
12 Ip Pb1 Ahd QP B
9 pages
XII CS Preboard - 2 QP Updated
No ratings yet
XII CS Preboard - 2 QP Updated
10 pages
Class 12 Informatics Practices Guide
No ratings yet
Class 12 Informatics Practices Guide
7 pages
CEG Assessment II
No ratings yet
CEG Assessment II
4 pages
Xii Ip Ekm MS PB1
No ratings yet
Xii Ip Ekm MS PB1
13 pages
FDS - 1 Solved
No ratings yet
FDS - 1 Solved
17 pages
ITT306 Data Science-May2023
No ratings yet
ITT306 Data Science-May2023
3 pages
Assignment I
No ratings yet
Assignment I
4 pages
Data Science Sample
No ratings yet
Data Science Sample
5 pages
Class XII Computer Science Pre Board Exam
No ratings yet
Class XII Computer Science Pre Board Exam
11 pages
InformaticsPractices - SET 1
No ratings yet
InformaticsPractices - SET 1
10 pages
QP Xii Ip Set 1
No ratings yet
QP Xii Ip Set 1
8 pages
Ip Sample Paper 9
No ratings yet
Ip Sample Paper 9
9 pages
Class Xii Ip Pb-I QP
No ratings yet
Class Xii Ip Pb-I QP
51 pages
Da Externalqp
No ratings yet
Da Externalqp
6 pages
Jss Mahavidyapeetha: AY 2019-20 (Even Semester)
No ratings yet
Jss Mahavidyapeetha: AY 2019-20 (Even Semester)
2 pages
Mid Sem 5th
No ratings yet
Mid Sem 5th
5 pages
Xii Ip QP
No ratings yet
Xii Ip QP
8 pages
21ad62 Model Paper
No ratings yet
21ad62 Model Paper
38 pages
Ip Practice Paper - 1 2024-25
No ratings yet
Ip Practice Paper - 1 2024-25
13 pages
23CS5PCDEV
No ratings yet
23CS5PCDEV
5 pages
FDS - 2 Solved
No ratings yet
FDS - 2 Solved
14 pages
PB - QP Xii Ip 2024-25 - Set32
No ratings yet
PB - QP Xii Ip 2024-25 - Set32
11 pages
SL-III Lab Manual
No ratings yet
SL-III Lab Manual
74 pages
Xi Se Ip Sample Paper 2024-25 Ghy Region
No ratings yet
Xi Se Ip Sample Paper 2024-25 Ghy Region
5 pages
DSBDA Merged
No ratings yet
DSBDA Merged
13 pages
202CS009
No ratings yet
202CS009
2 pages
Ip - P2 - Class 12 - 2023-24
No ratings yet
Ip - P2 - Class 12 - 2023-24
8 pages
CST322 Data Analytics - No Space
No ratings yet
CST322 Data Analytics - No Space
11 pages
Informatics Practices Exam Paper 2022-23
No ratings yet
Informatics Practices Exam Paper 2022-23
8 pages
3612779-Class Xii - Cs - Assessment 1
No ratings yet
3612779-Class Xii - Cs - Assessment 1
8 pages
11 AI W6 - Data Literacy
No ratings yet
11 AI W6 - Data Literacy
2 pages
Open Mission Systems (OMS) in A Nutshell
No ratings yet
Open Mission Systems (OMS) in A Nutshell
2 pages
Business Analyst Profile: Shefali Kolge
No ratings yet
Business Analyst Profile: Shefali Kolge
2 pages
Microsoft Zero Trust TEI Study
No ratings yet
Microsoft Zero Trust TEI Study
43 pages
Programming Methodologies Course
No ratings yet
Programming Methodologies Course
1 page
Exp/Imp Utility: Export Options
No ratings yet
Exp/Imp Utility: Export Options
7 pages
Google - Professional Cloud DevOps Engineer.v2023 12 30.q77
No ratings yet
Google - Professional Cloud DevOps Engineer.v2023 12 30.q77
47 pages
MODUL PRAKTIKUM SQL Subqueries
No ratings yet
MODUL PRAKTIKUM SQL Subqueries
7 pages
Agile Software Development: 5.1 Coping With Change
No ratings yet
Agile Software Development: 5.1 Coping With Change
13 pages
Zero Trust Cloud Security
No ratings yet
Zero Trust Cloud Security
8 pages
Understanding RAID: Types and Benefits
No ratings yet
Understanding RAID: Types and Benefits
6 pages
SAP Java Instance Troubleshooting
No ratings yet
SAP Java Instance Troubleshooting
5 pages
Zarafa Collaboration Platform 7.0 Administrator Manual en US
No ratings yet
Zarafa Collaboration Platform 7.0 Administrator Manual en US
154 pages
Ansible Best Practices. Study Guide With Practice Questions & Labs 2022
No ratings yet
Ansible Best Practices. Study Guide With Practice Questions & Labs 2022
63 pages
Index Construction in Information Retrieval
No ratings yet
Index Construction in Information Retrieval
54 pages
How To Limit Upload Size in Squid - TechwithGuru
No ratings yet
How To Limit Upload Size in Squid - TechwithGuru
7 pages
Functional and Non-Functional Requirements in Software
No ratings yet
Functional and Non-Functional Requirements in Software
11 pages
Oracle Database Architecture Diagram Overview
No ratings yet
Oracle Database Architecture Diagram Overview
7 pages
Web Technology Full Notes by Shasun
100% (3)
Web Technology Full Notes by Shasun
75 pages
AZ 900 Objectives
No ratings yet
AZ 900 Objectives
12 pages
Virus Protection & Disk Maintenance Guide
No ratings yet
Virus Protection & Disk Maintenance Guide
27 pages
VMware vSAN 6.7 Technical Overview PDF
100% (1)
VMware vSAN 6.7 Technical Overview PDF
78 pages
Preface
No ratings yet
Preface
3 pages
What's The Difference Between VAR A1 - A4 and VAR A1 - A4?
No ratings yet
What's The Difference Between VAR A1 - A4 and VAR A1 - A4?
5 pages
Payroll Timekeeping System
50% (2)
Payroll Timekeeping System
94 pages
User Interface Design in Software Engineering
100% (1)
User Interface Design in Software Engineering
5 pages
SolidWorks Clean Uninstall Guide
No ratings yet
SolidWorks Clean Uninstall Guide
2 pages
How To Remote SSO For Citrix Nfuse
No ratings yet
How To Remote SSO For Citrix Nfuse
6 pages
3rd International Conference On Cloud Computing
No ratings yet
3rd International Conference On Cloud Computing
1 page
CA Advanced ITT New Syllabus 500 MCQ Booklet
25% (4)
CA Advanced ITT New Syllabus 500 MCQ Booklet
121 pages
Aanchal Sharma
No ratings yet
Aanchal Sharma
3 pages

CDT305 A

Uploaded by

CDT305 A

Uploaded by

C 1100CDT3051222011100IET301122101 Pages: 3

Reg No.:_______________ Name:__________________________

Course Code: CDT 305

1 How is a data warehouse different from a database? (3)

You might also like

Reg No.:_ Name:____________