BigData Mining and Analytics

BDA

Uploaded by

lekha.cce

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

447 views2 pages

BigData Mining and Analytics

BDA

Uploaded by

lekha.cce

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

SEMESTER - I

24PBDPC1 BIG DATA MINING AND L T P C

02 3 0 0 3
ANALYTICS
SDG NO. 4

OBJECTIVES:
 To understand the computational approaches to Modelling, Feature
Extraction
 To understand the need and application of Map Reduce
 To understand the various search algorithms applicable to Big Data
 To analyse and interpret streaming data
 To learn how to handle large data sets in main memory and learn the
various clustering techniques applicable to Big Data.

UNIT I DATA MINING AND LARGE SCALE FILES 9

Introduction to Statistical modeling – Machine Learning – Computational
approaches to modeling – Summarization – Feature Extraction – Statistical
Limits on Data Mining - Distributed File Systems – Map-reduce – Algorithms
using Map Reduce– Efficiency of Cluster Computing Techniques.

UNIT II SIMILAR ITEMS 9

Nearest Neighbor Search – Shingling of Documents – Similarity preserving
summaries – Locality sensitive hashing for documents – Distance Measures
– Theory of Locality Sensitive Functions – LSH Families – Methods for High
Degree of Similarities.

UNIT III MINING DATA STREAMS 9

Stream Data Model – Sampling Data in the Stream – Filtering Streams –
Counting Distance Elements in a Stream – Estimating Moments – Counting
Onesin Window – Decaying Windows

UNIT IV LINK ANALYSIS AND FREQUENT ITEMSETS 9

Page Rank –Efficient Computation - Topic Sensitive Page Rank – Link Spam
– Market Basket Model – A-priori algorithm – Handling Larger Datasets in
Main Memory– Limited Pass Algorithm – Counting Frequent Item sets.
UNIT V CLUSTERING 9
Introduction to Clustering Techniques – Hierarchical Clustering –
Algorithms – K-Means – CURE – Clustering in Non – Euclidean Spaces –
Streams and Parallelism – Case Study: Advertising on the Web –
Recommendation Systems

TOTAL: 45 PERIODS
TEXT BOOKS:
1. Jure Leskovec, AnandRajaraman, Jeffrey David Ullman, “Mining of
Massive Datasets”, Cambridge University Press, Second Edition, 2014.
2. Jiawei Han, MichelineKamber, Jian Pei, “Data Mining Concepts
and Techniques”, Morgan Kaufman Publications, Third Edition,
2011.

REFERENCES:
1. Ian H.Witten, Eibe Frank “Data Mining – Practical Machine Learning
Tools and Techniques”, Morgan Kaufman Publications, Third Edition,
2011.
2. David Hand, HeikkiMannila and Padhraic Smyth, “Principles of Data
Mining”, MIT PRESS, 2001

WEB REFERENCES:
1. https://swayam.gov.in/nd2_arp19_ap60/preview
2. https://nptel.ac.in/content/storage2/nptel_data3/html/mhrd/ict/
text/106104189/lec1.pdf

ONLINERESOURCES:
1. https://examupdates.in/big-data-analytics/
2. https://www.tutorialspoint.com/big_data_analytics/index.htm
3. https://www.tutorialspoint.com/data_mining/index.htm

OUTCOMES :
Upon completion of the course, the student should be able to
1. Design algorithms by employing Map Reduce technique for solving Big
Data problems.
2. Design algorithms for Big Data by deciding on the apt Features set .
3. Design algorithms for handling petabytes of datasets
4. Design algorithms and propose solutions for Big Data by optimizing
main memory consumption
5. Design solutions for problems in Big Data by suggesting
appropriate clustering techniques.

CS8091 Bigdata Analytics Lessonplan With Date
No ratings yet
CS8091 Bigdata Analytics Lessonplan With Date
11 pages
M.E. Cse (Ai&ml)
No ratings yet
M.E. Cse (Ai&ml)
63 pages
M.E. Bda 2021
No ratings yet
M.E. Bda 2021
64 pages
If4071 - Deep Learnig - Me - Iii Sem
No ratings yet
If4071 - Deep Learnig - Me - Iii Sem
1 page
Ethical Management Notes
No ratings yet
Ethical Management Notes
38 pages
CP4291 Syllabus
No ratings yet
CP4291 Syllabus
3 pages
Social Network Analysis
100% (1)
Social Network Analysis
2 pages
CCS356-OOSE Lab Question
No ratings yet
CCS356-OOSE Lab Question
4 pages
Cp4152 Database Practice Lab Manual R 2021
No ratings yet
Cp4152 Database Practice Lab Manual R 2021
48 pages
Design and Analysis of Algorithms - AD3351 - Hand Written Notes - Unit 3 - Dynamic Programming and Greedy Technique
No ratings yet
Design and Analysis of Algorithms - AD3351 - Hand Written Notes - Unit 3 - Dynamic Programming and Greedy Technique
41 pages
CS3301 Datastructure QN Paper Apr-May
No ratings yet
CS3301 Datastructure QN Paper Apr-May
2 pages
Multi Core Architecture and Programming IAT 1
100% (1)
Multi Core Architecture and Programming IAT 1
1 page
Ad3411 - Student
No ratings yet
Ad3411 - Student
27 pages
CP4292 Syllabus
No ratings yet
CP4292 Syllabus
4 pages
CP 4093 Information Retrieval Techniques
100% (1)
CP 4093 Information Retrieval Techniques
2 pages
Applied Probability and Statistics For Computer Science Engineers
No ratings yet
Applied Probability and Statistics For Computer Science Engineers
1 page
DVT Paper
No ratings yet
DVT Paper
1 page
CCW332 Digital Marketing Questionbank
No ratings yet
CCW332 Digital Marketing Questionbank
18 pages
IF4071 Deep Learning Notes
No ratings yet
IF4071 Deep Learning Notes
188 pages
Iot Unit-1
No ratings yet
Iot Unit-1
20 pages
Univ QP cp4252 ML Univ Question Paper
0% (1)
Univ QP cp4252 ML Univ Question Paper
5 pages
Advanced Database Tech: IR & Web Search
No ratings yet
Advanced Database Tech: IR & Web Search
21 pages
Unit I
No ratings yet
Unit I
41 pages
Aiml Unit 4
No ratings yet
Aiml Unit 4
26 pages
MC4102 OOSE Question Bank
No ratings yet
MC4102 OOSE Question Bank
4 pages
Database Design and Management - AD3391 - Hand Written Notes - Unit 2 - Relational Model and SQL
No ratings yet
Database Design and Management - AD3391 - Hand Written Notes - Unit 2 - Relational Model and SQL
22 pages
Deep Learning - AD3501 - Notes - Unit 4 - Model Evaluation
No ratings yet
Deep Learning - AD3501 - Notes - Unit 4 - Model Evaluation
18 pages
Data Visualisation
No ratings yet
Data Visualisation
232 pages
Compiler Design - CS3501 2021 Regulation - Notes - Hand Writing
No ratings yet
Compiler Design - CS3501 2021 Regulation - Notes - Hand Writing
110 pages
M.Tech Machine Learning Lab Exam 2024
No ratings yet
M.Tech Machine Learning Lab Exam 2024
1 page
Unit 4-DBP
No ratings yet
Unit 4-DBP
66 pages
CP4151-ADS Unit Iii
No ratings yet
CP4151-ADS Unit Iii
166 pages
Unit V Javafx Event Ndling
No ratings yet
Unit V Javafx Event Ndling
40 pages
RM4151 Research Methodology IPR Notes
No ratings yet
RM4151 Research Methodology IPR Notes
2 pages
UNIT 2 Bigdata Mining and Analytics
No ratings yet
UNIT 2 Bigdata Mining and Analytics
18 pages
CP4252 Machine Learning Lab Manual
No ratings yet
CP4252 Machine Learning Lab Manual
37 pages
DAN Lab ManuaL
No ratings yet
DAN Lab ManuaL
53 pages
Machine Learning - AL3451 - Important Questions With Answer
No ratings yet
Machine Learning - AL3451 - Important Questions With Answer
25 pages
Data and Information Security - CW3551 - Important Questions and Question Bank
No ratings yet
Data and Information Security - CW3551 - Important Questions and Question Bank
9 pages
AL3391 Artificial Intelligence Apr May 2024 Question Paper Download
No ratings yet
AL3391 Artificial Intelligence Apr May 2024 Question Paper Download
4 pages
CS3492 Database Management Systems Apr May 2024 Question Paper Download
No ratings yet
CS3492 Database Management Systems Apr May 2024 Question Paper Download
2 pages
Functional and Behavioural Modeling
No ratings yet
Functional and Behavioural Modeling
4 pages
Foundation of Data Science - CS3352 2021 Regulation - Question Paper 2024 April May-1
No ratings yet
Foundation of Data Science - CS3352 2021 Regulation - Question Paper 2024 April May-1
10 pages
Data Science - UNIT-3 - Notes
No ratings yet
Data Science - UNIT-3 - Notes
32 pages
Big Data Analytics - CCS334 - Important Questions
No ratings yet
Big Data Analytics - CCS334 - Important Questions
9 pages
CS3491 AIML Question Set
No ratings yet
CS3491 AIML Question Set
2 pages
CS3361 Set2
No ratings yet
CS3361 Set2
6 pages
3G & 4G Network Evolution Guide
No ratings yet
3G & 4G Network Evolution Guide
178 pages
ccs363 SNS
No ratings yet
ccs363 SNS
3 pages
ME Structural Engineering Syllabus 2021
No ratings yet
ME Structural Engineering Syllabus 2021
81 pages
CP4092
No ratings yet
CP4092
1 page
BCS714D Syllabus
No ratings yet
BCS714D Syllabus
3 pages
FCCP University Question Paper
No ratings yet
FCCP University Question Paper
11 pages
Ad3251 Unit 2 Notes Edu Engg
No ratings yet
Ad3251 Unit 2 Notes Edu Engg
35 pages
Cloud Computing Technologies QP
No ratings yet
Cloud Computing Technologies QP
1 page
ccs346 Eda
No ratings yet
ccs346 Eda
2 pages
The Role of Algorithms in Computing
No ratings yet
The Role of Algorithms in Computing
9 pages
CS3591 Computer Networks Lab Manual Finalized
No ratings yet
CS3591 Computer Networks Lab Manual Finalized
67 pages
Python Programming Course Outline
No ratings yet
Python Programming Course Outline
5 pages
BDA - CSE Syllabus
No ratings yet
BDA - CSE Syllabus
2 pages
Week 10
No ratings yet
Week 10
3 pages
Week 11
No ratings yet
Week 11
3 pages
Week 2
No ratings yet
Week 2
3 pages
Skill Enhancement
No ratings yet
Skill Enhancement
4 pages
Understanding Plagiarism and Detection Tools
No ratings yet
Understanding Plagiarism and Detection Tools
3 pages
Week 6
No ratings yet
Week 6
4 pages
Research Methodology
No ratings yet
Research Methodology
6 pages
Unit 2 - WD
No ratings yet
Unit 2 - WD
39 pages
Big Data Framework
No ratings yet
Big Data Framework
3 pages
Machine Learning Techniques
No ratings yet
Machine Learning Techniques
3 pages
XML
No ratings yet
XML
36 pages
Web Tech for CS Students
No ratings yet
Web Tech for CS Students
96 pages
Foundation of Datascience
No ratings yet
Foundation of Datascience
2 pages
cs8080 Irt Local Author
No ratings yet
cs8080 Irt Local Author
168 pages
Unit III - IV
No ratings yet
Unit III - IV
122 pages
Introduction
No ratings yet
Introduction
32 pages
IV CSE Handbook
No ratings yet
IV CSE Handbook
29 pages
Normality and Abnormality
100% (2)
Normality and Abnormality
10 pages
Metonymy and Synecdhhoche
No ratings yet
Metonymy and Synecdhhoche
3 pages
MTM 2.0 For Bike MS
No ratings yet
MTM 2.0 For Bike MS
4 pages
Mil A 8625F
No ratings yet
Mil A 8625F
20 pages
Speech Recognition
100% (3)
Speech Recognition
66 pages
As 2102-1989 Micrometer Callipers For External Measurement
0% (1)
As 2102-1989 Micrometer Callipers For External Measurement
6 pages
Free Verse
No ratings yet
Free Verse
7 pages
1310 2437 4 PB
No ratings yet
1310 2437 4 PB
20 pages
Coalescers ZM E25
No ratings yet
Coalescers ZM E25
2 pages
BMI Tanzania Insurance Report Q4 2016
No ratings yet
BMI Tanzania Insurance Report Q4 2016
57 pages
Time Travel Adventure in Ancient China
No ratings yet
Time Travel Adventure in Ancient China
2 pages
The Digital Transformation of The New York Times
No ratings yet
The Digital Transformation of The New York Times
4 pages
Power Diagram Rack HP
No ratings yet
Power Diagram Rack HP
1 page
Car Tune-Up Checklist - What Is A Tune-Up - AutoZone
No ratings yet
Car Tune-Up Checklist - What Is A Tune-Up - AutoZone
13 pages
MSDS All DNA RNA Purification Kits
No ratings yet
MSDS All DNA RNA Purification Kits
6 pages
Toshiba E-STUDIO165+167+205+207+237 Service Handbook
100% (4)
Toshiba E-STUDIO165+167+205+207+237 Service Handbook
313 pages
Positive ICT Integration in Education
No ratings yet
Positive ICT Integration in Education
9 pages
Ethical Dilemma: Eating Dogs
No ratings yet
Ethical Dilemma: Eating Dogs
3 pages
(Ebook) Manufacturing Facilities: Location, Planning, and Design, Third Edition by Sule, Dileep R ISBN 9781420044232, 1420044230 Kindle & PDF Formats
No ratings yet
(Ebook) Manufacturing Facilities: Location, Planning, and Design, Third Edition by Sule, Dileep R ISBN 9781420044232, 1420044230 Kindle & PDF Formats
137 pages
Marketing Management Assignment Guide
100% (1)
Marketing Management Assignment Guide
2 pages
Agcopower Motor Ref Potencia
No ratings yet
Agcopower Motor Ref Potencia
2 pages
Emerging Trends in Satellite Technology and Applications Seminar Summary
No ratings yet
Emerging Trends in Satellite Technology and Applications Seminar Summary
15 pages
Technical Offer Dubai Customs
No ratings yet
Technical Offer Dubai Customs
31 pages
Connecting With Law 4th Edition Sanson: - Click The Link Below To Download
No ratings yet
Connecting With Law 4th Edition Sanson: - Click The Link Below To Download
51 pages
7 Outdated and Updated Teacher
No ratings yet
7 Outdated and Updated Teacher
14 pages
It'S All The Same: Warren Chalk Peter Cook Dennis Crompton David Greene Ron Herron Mike Webb
100% (1)
It'S All The Same: Warren Chalk Peter Cook Dennis Crompton David Greene Ron Herron Mike Webb
6 pages
Traditional Mediterranean Architecture: House With Three Arches
No ratings yet
Traditional Mediterranean Architecture: House With Three Arches
12 pages
Quinton Q55 Series 90
No ratings yet
Quinton Q55 Series 90
108 pages
Ph.D. Coursework in Computer Science 2020
No ratings yet
Ph.D. Coursework in Computer Science 2020
2 pages
Contemporary Issues Facing The Filipino Entrepreneur
No ratings yet
Contemporary Issues Facing The Filipino Entrepreneur
28 pages

BigData Mining and Analytics

Uploaded by

BigData Mining and Analytics

Uploaded by

SEMESTER - I

24PBDPC1 BIG DATA MINING AND L T P C

UNIT I DATA MINING AND LARGE SCALE FILES 9

UNIT II SIMILAR ITEMS 9

UNIT III MINING DATA STREAMS 9

UNIT IV LINK ANALYSIS AND FREQUENT ITEMSETS 9

You might also like