0% found this document useful (0 votes)

19 views3 pages

DATA MINING (Gtu Sem-6) Assignment

The document provides a comprehensive overview of data mining, covering its definitions, functionalities, and classifications, as well as the integration with databases. It discusses data pre-processing techniques, the concept of clustering, and various mining methods including classification, prediction, and web mining. Each chapter includes specific tasks, examples, and calculations related to data mining processes and techniques.

Uploaded by

rathodkrishna2502

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views3 pages

DATA MINING (Gtu Sem-6) Assignment

Uploaded by

rathodkrishna2502

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Data Mining

Chapter 1: Introduction to Data Mining

1. Define data mining and explain its functionalities.

2. Explain the different classifications of data mining systems.

3. Describe the task primitives of data mining.

4. How does a data mining system integrate with a database or data warehouse?

5. Discuss the key issues in data mining.

6. A company uses data mining to classify customer transactions. Given that 10% of the
transactions are fraudulent, estimate the number of fraudulent transactions from a
dataset of 50,000 transactions.

7. Explain the Knowledge Discovery in Databases (KDD) process.

Chapter 2: Data Pre-processing

1. What is data summarization? Explain its importance in data mining.

2. Discuss various data cleaning techniques with examples.

3. Explain the process of data integration and transformation.

4. Suppose a dataset contains missing values. If 20% of a 1,000-record dataset has missing
values, how many records need imputation?

5. Describe the concept of data reduction and its techniques.

6. What is dimensionality reduction? Explain the CUR decomposition method.

7. Differentiate between feature extraction, feature transformation, and feature selection.

Chapter 3: Concept Description, Mining Frequent Patterns, Associations, and Correlations

1. What is concept description in data mining? How is it useful?

2. Explain data generalization and summarization-based characterization.

3. What are frequent item-set mining methods? Explain any two.

4. Discuss the various types of association rules and correlation analysis.

5. Explain advanced association rule techniques and their importance.

6. How do we measure the quality of rules in data mining?

Chapter 4: Classification and Prediction

1. Differentiate between classification and prediction.

2. Explain the issues related to classification and prediction.

3. Describe statistical-based and distance-based classification algorithms.

4. Explain decision tree-based and neural network-based classification algorithms.

5. What are rule-based classification techniques? Give an example.

6. Discuss the evaluation metrics used to measure classifier accuracy.

7. Explain logistic regression and its role in prediction.

8. How can tools like WEKA and DB Miner be used in classification?

9. Given the transactions:

T1: {Milk, Bread, Butter}

T2: {Milk, Bread}

T3: {Milk, Butter}

T4: {Bread, Butter}

1. Calculate the support and confidence for the rule {Milk} → {Bread}.

2. Find frequent itemsets using Apriori algorithm with a minimum support of 50%.

Chapter 5: Cluster Analysis

1. What is clustering in data mining? Explain its importance.

2. Discuss the problem definition of clustering and its applications.

3. Explain the K-Means algorithm and its additional issues.

4. What is the PAM algorithm? How does it differ from K-Means?

5. Differentiate between agglomerative and divisive hierarchical clustering methods.

6. Explain outlier detection and its importance in clustering.

7. How do we perform clustering on high-dimensional data?

8. A hierarchical clustering algorithm merges two clusters with distances D(A, B) = 5 and
D(B, C) = 7. Compute the new distance using:

1. Single linkage method

2. Complete linkage method

9. Discuss clustering techniques for graph and network data.

Chapter 6: Web Mining and Other Data Mining Techniques

1. What is web mining? Explain its different types.

2. Discuss web content mining, web usage mining, and web structure mining.

3. Explain the structure and issues related to web logs.

4. What is spatial data mining? How is it different from temporal mining?

5. Describe the concepts of multimedia mining.

6. What are the applications of distributed and parallel data mining?

7. A website has the following clickstream data:

Page A → Page B (50 clicks)

Page B → Page C (30 clicks)

Page C → Page A (20 clicks)

8. Compute the transition probability matrix for web usage mining

Data Mining Essentials for Analysts
No ratings yet
Data Mining Essentials for Analysts
2 pages
KDD and Data Mining Explained
No ratings yet
KDD and Data Mining Explained
46 pages
Summarizing Transactional Data Insights
No ratings yet
Summarizing Transactional Data Insights
22 pages
Unit1 - Intoduction To Data Mining
No ratings yet
Unit1 - Intoduction To Data Mining
10 pages
Lecture 01 11jan
No ratings yet
Lecture 01 11jan
29 pages
Data Warehousing and Data Mining Dr.P.rizwan Ahmed
0% (1)
Data Warehousing and Data Mining Dr.P.rizwan Ahmed
20 pages
Data Science & Big Data Analysis Module 1,2,3,4,5
No ratings yet
Data Science & Big Data Analysis Module 1,2,3,4,5
70 pages
Data Mining Concepts Overview
No ratings yet
Data Mining Concepts Overview
28 pages
Combine 056
No ratings yet
Combine 056
57 pages
Introduction To Data Mining
No ratings yet
Introduction To Data Mining
11 pages
DM Answers CAT-1
No ratings yet
DM Answers CAT-1
18 pages
DM - Unit I-Updated
No ratings yet
DM - Unit I-Updated
65 pages
Data Mining 1
No ratings yet
Data Mining 1
39 pages
Intro of Data Mining
No ratings yet
Intro of Data Mining
27 pages
Introduction
No ratings yet
Introduction
27 pages
Unit 1
No ratings yet
Unit 1
148 pages
Data Mining: Concepts and Techniques
100% (2)
Data Mining: Concepts and Techniques
27 pages
Data Mining: Concepts and Techniques
No ratings yet
Data Mining: Concepts and Techniques
27 pages
Week-1-Introduction To Data Mining
No ratings yet
Week-1-Introduction To Data Mining
43 pages
Data Mining: Concepts and Techniques
No ratings yet
Data Mining: Concepts and Techniques
25 pages
Chapter 1. Introduction
No ratings yet
Chapter 1. Introduction
323 pages
Data Mining
No ratings yet
Data Mining
20 pages
Fundamentals of Data Science Notes (Module - 1)
No ratings yet
Fundamentals of Data Science Notes (Module - 1)
19 pages
Introduction to Data Mining
No ratings yet
Introduction to Data Mining
55 pages
Data Mining
No ratings yet
Data Mining
9 pages
Data Mining: Concepts and Techniques
No ratings yet
Data Mining: Concepts and Techniques
31 pages
Data Mining UNIT - 1 (Important)
No ratings yet
Data Mining UNIT - 1 (Important)
7 pages
DM 1
No ratings yet
DM 1
47 pages
Data Mining Concepts and Applications
No ratings yet
Data Mining Concepts and Applications
20 pages
Synopsis Print
No ratings yet
Synopsis Print
4 pages
Introduction To Data Mining-Week1
No ratings yet
Introduction To Data Mining-Week1
43 pages
Module 2 (A) - Introduction To Data Mining
No ratings yet
Module 2 (A) - Introduction To Data Mining
37 pages
DM Answers
No ratings yet
DM Answers
22 pages
Week 02 PDF
No ratings yet
Week 02 PDF
39 pages
Data Mining for Beginners
No ratings yet
Data Mining for Beginners
26 pages
Data Mining and Web Mining Overview
No ratings yet
Data Mining and Web Mining Overview
62 pages
01 Intro
No ratings yet
01 Intro
26 pages
DM Overview
No ratings yet
DM Overview
52 pages
DWDM LS1 Fall 24 25
No ratings yet
DWDM LS1 Fall 24 25
42 pages
Data Mining: An Overview From A Database Perspective
No ratings yet
Data Mining: An Overview From A Database Perspective
30 pages
Data Mining Module 1 Theory
No ratings yet
Data Mining Module 1 Theory
4 pages
01 Introduction
No ratings yet
01 Introduction
36 pages
Unit 3
100% (1)
Unit 3
22 pages
Data Mining-1
No ratings yet
Data Mining-1
7 pages
Unit 1
No ratings yet
Unit 1
7 pages
Data Mining Mids
No ratings yet
Data Mining Mids
24 pages
01 - Introduction To Datamining
No ratings yet
01 - Introduction To Datamining
19 pages
Data Mining Concepts and Techniques Overview
No ratings yet
Data Mining Concepts and Techniques Overview
53 pages
Data Mining Introduction
No ratings yet
Data Mining Introduction
32 pages
Knowledge Management UNIT-3 Notes
No ratings yet
Knowledge Management UNIT-3 Notes
17 pages
Data Ming Unit 2
No ratings yet
Data Ming Unit 2
8 pages
Comprehensive Guide to Data Mining
No ratings yet
Comprehensive Guide to Data Mining
32 pages
Data Mining Techniques and Applications
No ratings yet
Data Mining Techniques and Applications
33 pages
Chap1 Introduction
No ratings yet
Chap1 Introduction
21 pages
Web Mining Unit 1
No ratings yet
Web Mining Unit 1
25 pages
IJAAFMR220203
No ratings yet
IJAAFMR220203
14 pages
Using Asset Turnover and Profit Margin T
No ratings yet
Using Asset Turnover and Profit Margin T
15 pages
Internet Riches The Simple Money Making Secrets of Online Millionaires 1st Edition Edition Scott Fox Instant Download
No ratings yet
Internet Riches The Simple Money Making Secrets of Online Millionaires 1st Edition Edition Scott Fox Instant Download
144 pages
Spearman Rho Newcorrelation
No ratings yet
Spearman Rho Newcorrelation
22 pages
أثر الخدمات المصرفية الالكترونية على تحسين جودة الخدمات بالمصارف دراسة حالة الوكالات العمومية و الخاصة بولاية البليدة
No ratings yet
أثر الخدمات المصرفية الالكترونية على تحسين جودة الخدمات بالمصارف دراسة حالة الوكالات العمومية و الخاصة بولاية البليدة
15 pages
Principles of Industrial Instrumentation Third Edition Dipak Patranabis ebook unlocked 2025 file
100% (6)
Principles of Industrial Instrumentation Third Edition Dipak Patranabis ebook unlocked 2025 file
127 pages
Bayesian Regression Analysis
No ratings yet
Bayesian Regression Analysis
85 pages
RA - Assignment 1
No ratings yet
RA - Assignment 1
1 page
The Effects of Implementing Catch Up Fridays Among Stem Students Well Being
No ratings yet
The Effects of Implementing Catch Up Fridays Among Stem Students Well Being
68 pages
Eng10 q4 Mod1 English 10
No ratings yet
Eng10 q4 Mod1 English 10
37 pages
CAG Repeat Length and The Age of Onset in Huntington Disease (HD) - A Review and Validation Study of Statistical Approaches
No ratings yet
CAG Repeat Length and The Age of Onset in Huntington Disease (HD) - A Review and Validation Study of Statistical Approaches
23 pages
Organisational Culture On Productivity - Namibia
No ratings yet
Organisational Culture On Productivity - Namibia
130 pages
Uncertainty in Qualitative Testing
No ratings yet
Uncertainty in Qualitative Testing
24 pages
Marketing Automation in CRM
No ratings yet
Marketing Automation in CRM
58 pages
CHO - Statistics
No ratings yet
CHO - Statistics
6 pages
EM GaussianMixture Example
No ratings yet
EM GaussianMixture Example
2 pages
UT Dallas Syllabus For Stat1342.001.09s Taught by Yuly Koshevnik (Yxk055000)
No ratings yet
UT Dallas Syllabus For Stat1342.001.09s Taught by Yuly Koshevnik (Yxk055000)
5 pages
Lecture 3 - Advanced Topics
No ratings yet
Lecture 3 - Advanced Topics
28 pages
Online Education Challenges in Bangladesh
No ratings yet
Online Education Challenges in Bangladesh
14 pages
Lecture Notes For Chapter 4 Introduction To Data Mining: by Tan, Steinbach, Kumar
No ratings yet
Lecture Notes For Chapter 4 Introduction To Data Mining: by Tan, Steinbach, Kumar
101 pages
Introduction to Statistics Basics
No ratings yet
Introduction to Statistics Basics
30 pages
Online Food Delivery Satisfaction Factors
No ratings yet
Online Food Delivery Satisfaction Factors
82 pages
Credit Card Research Paper
No ratings yet
Credit Card Research Paper
12 pages
EDS Rasch Demo
No ratings yet
EDS Rasch Demo
27 pages
Schramm, W. (1971, December) - Notes On Case Studies of Instructional Mediaprojects. Working
No ratings yet
Schramm, W. (1971, December) - Notes On Case Studies of Instructional Mediaprojects. Working
43 pages
Statistics for Students
No ratings yet
Statistics for Students
18 pages
Probability Distributions and Expectations
No ratings yet
Probability Distributions and Expectations
4 pages
Deeper Understanding, Faster Calculation - Exam P Insights and Shortcuts
90% (10)
Deeper Understanding, Faster Calculation - Exam P Insights and Shortcuts
432 pages
Impellizzeri MSSE Asimmetry 1 PDF
No ratings yet
Impellizzeri MSSE Asimmetry 1 PDF
7 pages
Database Design and Implementation LRU-k
No ratings yet
Database Design and Implementation LRU-k
10 pages

DATA MINING (Gtu Sem-6) Assignment

Uploaded by

DATA MINING (Gtu Sem-6) Assignment

Uploaded by

Data Mining

Chapter 1: Introduction to Data Mining

1. Define data mining and explain its functionalities.

2. Explain the different classifications of data mining systems.

3. Describe the task primitives of data mining.

5. Discuss the key issues in data mining.

7. Explain the Knowledge Discovery in Databases (KDD) process.

Chapter 2: Data Pre-processing

1. What is data summarization? Explain its importance in data mining.

2. Discuss various data cleaning techniques with examples.

3. Explain the process of data integration and transformation.

5. Describe the concept of data reduction and its techniques.

6. What is dimensionality reduction? Explain the CUR decomposition method.

7. Differentiate between feature extraction, feature transformation, and feature selection.

Chapter 3: Concept Description, Mining Frequent Patterns, Associations, and Correlations

1. What is concept description in data mining? How is it useful?

2. Explain data generalization and summarization-based characterization.

3. What are frequent item-set mining methods? Explain any two.

4. Discuss the various types of association rules and correlation analysis.

6. How do we measure the quality of rules in data mining?

Chapter 4: Classification and Prediction

1. Differentiate between classification and prediction.

2. Explain the issues related to classification and prediction.

3. Describe statistical-based and distance-based classification algorithms.

4. Explain decision tree-based and neural network-based classification algorithms.

5. What are rule-based classification techniques? Give an example.

6. Discuss the evaluation metrics used to measure classifier accuracy.

7. Explain logistic regression and its role in prediction.

8. How can tools like WEKA and DB Miner be used in classification?

9. Given the transactions:

T1: {Milk, Bread, Butter}

T2: {Milk, Bread}

T3: {Milk, Butter}

T4: {Bread, Butter}

Chapter 5: Cluster Analysis

1. What is clustering in data mining? Explain its importance.

2. Discuss the problem definition of clustering and its applications.

3. Explain the K-Means algorithm and its additional issues.

4. What is the PAM algorithm? How does it differ from K-Means?

5. Differentiate between agglomerative and divisive hierarchical clustering methods.

6. Explain outlier detection and its importance in clustering.

7. How do we perform clustering on high-dimensional data?

1. Single linkage method

2. Complete linkage method

9. Discuss clustering techniques for graph and network data.

Chapter 6: Web Mining and Other Data Mining Techniques

1. What is web mining? Explain its different types.

3. Explain the structure and issues related to web logs.

4. What is spatial data mining? How is it different from temporal mining?

5. Describe the concepts of multimedia mining.

6. What are the applications of distributed and parallel data mining?

7. A website has the following clickstream data:

Page A → Page B (50 clicks)

Page B → Page C (30 clicks)

Page C → Page A (20 clicks)

8. Compute the transition probability matrix for web usage mining

You might also like