0% found this document useful (0 votes)

19 views28 pages

ch01 Intro

This document contains lecture slides for a course on mining massive datasets. The slides cover topics like data mining goals, descriptive vs predictive methods, challenges with data mining, and different types of data and computational models that will be covered in the course.

Uploaded by

ciuciu.denis.2023

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views28 pages

ch01 Intro

Uploaded by

ciuciu.denis.2023

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Note to other teachers and users of these slides: We would be delighted if you found this our

material useful in giving your own lectures. Feel free to use these slides verbatim, or to modify
them to fit your own needs. If you make use of a significant portion of these slides in your own
lecture, please include this message, or a link to our web site: http://www.mmds.org

Mining of Massive Datasets

Jure Leskovec, Anand Rajaraman, Jeﬀ Ullman
Stanford University
http://www.mmds.org
J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, h=p://www.mmds.org 3
Data contains value and knowledge
J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, h=p://www.mmds.org 4
¡ But to extract the knowledge
data needs to be
§ Stored
§ Managed
§ And ANALYZED ! this class

Data Mining ≈ Big Data ≈

Predic@ve Analy@cs ≈ Data Science

J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, h=p://www.mmds.org 5
J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, h=p://www.mmds.org 6
¡ Given lots of data
¡ Discover paFerns and models that are:
§ Valid: hold on new data with some certainty
§ Useful: should be possible to act on the item
§ Unexpected: non-‐obvious to the system
§ Understandable: humans should be able to
interpret the pa=ern

J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, h=p://www.mmds.org 7
¡ Descrip@ve methods
§ Find human-‐interpretable pa=erns that
describe the data
§ Example: Clustering

¡ Predic@ve methods
§ Use some variables to predict unknown
or future values of other variables
§ Example: Recommender systems

J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, h=p://www.mmds.org 8
¡ A risk with “Data mining” is that an analyst
can “discover” paFerns that are meaningless
¡ StaOsOcians call it Bonferroni’s principle:
§ Roughly, if you look in more places for interesOng
pa=erns than your amount of data will support,
you are bound to ﬁnd crap

J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, h=p://www.mmds.org 9
Example:
¡ We want to find (unrelated) people who at least twice
have stayed at the same hotel on the same day
§ 109 people being tracked
§ 1,000 days
§ Each person stays in a hotel 1% of Ome (1 day out of 100)
§ Hotels hold 100 people (so 105 hotels)
§ If everyone behaves randomly (i.e., no terrorists) will the
data mining detect anything suspicious?
¡ Expected number of “suspicious” pairs of people:
§ 250,000
§ … too many combinaOons to check – we need to have some
addiOonal evidence to find “suspicious” pairs of people in
some more efficient way
J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, h=p://www.mmds.org 10
Usage

Quality

Context

Streaming

Scalability

J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, h=p://www.mmds.org 11
¡ Data mining overlaps with:
§ Databases: Large-‐scale data, simple queries
§ Machine learning: Small data, Complex models
§ CS Theory: (Randomized) Algorithms
¡ Diﬀerent cultures:
§ To a DB person, data mining is an extreme form of
analy@c processing – queries that
examine large amounts of data CS
Machine
Theory
§ Result is the query answer Learning
Data
§ To a ML person, data-‐mining Mining
is the inference of models
§ Result is the parameters of the model Database
¡ In this class we will do both! systems
J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, h=p://www.mmds.org 12
¡ This class overlaps with machine learning,
sta@s@cs, ar@ﬁcial intelligence, databases
but more stress on
§ Scalability (big data)
Statistics
§ Algorithms Machine
Learning
§ Compu@ng architectures
§ AutomaOon for handling Data Mining
large data
Database
systems

J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, h=p://www.mmds.org 13
¡ We will learn to mine different types of data:
§ Data is high dimensional
§ Data is a graph
§ Data is infinite/never-‐ending
§ Data is labeled
¡ We will learn to use different models of
computa@on:
§ MapReduce
§ Streams and online algorithms
§ Single machine in-‐memory
J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, h=p://www.mmds.org 14
¡ We will learn to solve real-‐world problems:
§ Recommender systems
§ Market Basket Analysis
§ Spam detecOon
§ Duplicate document detecOon
¡ We will learn various “tools”:
§ Linear algebra (SVD, Rec. Sys., CommuniOes)
§ OpOmizaOon (stochasOc gradient descent)
§ Dynamic programming (frequent itemsets)
§ Hashing (LSH, Bloom filters)
J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, h=p://www.mmds.org 15
High dim. Graph Infinite Machine
Apps
data data data learning

Locality Filtering
PageRank, Recommen
sensiOve data SVM
SimRank der systems
hashing streams

Community Web Decision AssociaOon

Clustering
DetecOon adverOsing Trees Rules

Dimensional Duplicate
Spam Queries on Perceptron,
ity document
DetecOon streams kNN
reducOon detecOon

J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, h=p://www.mmds.org 16
How do you want that data?
J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, h=p://www.mmds.org 17
¡ TAs:
§ We have 9 great TAs!
§ Sean Choi (Head TA), Sumit ArrawaOa, Jus@n Chen,
Dingyi Li, Anshul Mi=al, Rose Marie Philip, Robi
Robaszkiewicz, Le Yu, Tongda Zhang

¡ Oﬃce hours:
§ Jure: Wednesdays 9-‐10am, Gates 418
§ See course website for TA oﬃce hours
§ For SCPD students we will use Google Hangout
§ We will post Google Hangout links on Piazza
J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, h=p://www.mmds.org 19
¡ Course website:
hFp://cs246.stanford.edu
§ Lecture slides (at least 30min before the lecture)
§ Homeworks, soluOons
§ Readings
¡ Readings: Book Mining of Massive Datasets
with A. Rajaraman and J. Ullman
Free online:
hFp://www.mmds.org

J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, h=p://www.mmds.org 20
¡ Piazza Q&A website:
§ h=ps://piazza.com/class#winter2013/cs246
§ Use Piazza for all quesOons and public communicaOon
with the course staﬀ
§ If you don’t have @stanford.edu email address, send us
your email and we will manually register you to Piazza

¡ For e-‐mailing us, always use:

§ cs246-‐win1213-‐staﬀ@lists.stanford.edu

¡ We will post course announcements to

Piazza (make sure you check it regularly)
Auditors are welcome to sit-‐in & audit the class
J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, h=p://www.mmds.org 21
¡ (1+)4 longer homeworks: 40%
§ TheoreOcal and programming quesOons
§ HW0 (Hadoop tutorial) has just been posted
§ Assignments take lots of @me. Start early!!
¡ How to submit?
§ Homework write-‐up:
§ Stanford students: In class or in Gates submission box
§ SCPD students: Submit write-‐ups via SCPD
§ AFach the HW cover sheet (and SCPD rouOng form)
§ Upload code:
§ Put the code for 1 quesOon into 1 ﬁle and
submit at: h=p://snap.stanford.edu/submit/
J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, h=p://www.mmds.org 22
¡ Short weekly quizzes: 20%
§ Short e-‐quizzes on Gradiance
§ You have exactly 7 days to complete it
No late days!
§ First quiz is already online

¡ Final exam: 40%

§ Friday, March 22 12:15pm-‐3:15pm

¡ It’s going to be fun and hard work. ☺

J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, h=p://www.mmds.org 23
¡ Homework schedule:
Date

Out In
01/08, Tue HW0

01/10, Thu HW1

01/15, Tue HW0
01/24, Thu HW2 HW1
02/07, Thu HW3 HW2
02/21, Thu HW4 HW3
03/07, Thu HW4
§ 2 late “days” (late periods) for HWs for the quarter:
§ 1 late day expires at the start of next class
§ You can use max 1 late day per assignment
J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, h=p://www.mmds.org 24
¡ Algorithms (CS161)
§ Dynamic programming, basic data structures
¡ Basic probability (CS109 or Stat116)
§ Moments, typical distribuOons, MLE, …
¡ Programming (CS107 or CS145)
§ Your choice, but C++/Java will be very useful

¡ We provide some background, but

the class will be fast paced

J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, h=p://www.mmds.org 25
¡ 3 recita@on sessions:
§ Hadoop: Thurs. 1/10, 5:15-‐6:30pm
§ We prepared a virtual machine with Hadoop preinstalled
§ HW0 helps you write your ﬁrst Hadoop program
§ Review of probability&stats: 1/17, 5:15-‐6:30pm
§ Review of linear algebra: 1/18, 5:15-‐6:30pm

§ All sessions will be held in Thornton 102,

Thornton Center (Terman Annex)
§ Sessions will be video recorded!

J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, h=p://www.mmds.org 26
¡ InfoSeminar (CS545):
§ h=p://i.stanford.edu/infoseminar
§ Great industrial & academic speakers
§ Topics include data mining and large scale data
processing
¡ CS341: Project in Data Mining (Spring 2013)
§ Research project on big data
§ Groups of 3 students
§ We provide interesOng data, compuOng resources
(Amazon EC2) and mentoring
¡ We have big-‐data RA posi@ons open!
§ I will post details on Piazza
J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, h=p://www.mmds.org 27
¡ 3 To-‐do items for you:
§ Register to Piazza
§ Complete HW0: Hadoop tutorial
§ HW0 should take your about 1 hour to complete
(Note this is a “toy” homework to get you started. Real
homeworks will be much more challenging and longer)
§ Register to Gradiance and complete the ﬁrst quiz
§ Use your SUNet ID to register! (so we can match grading records)
§ You have 7 days (sharp!) to do so
§ Quizzes typically take several hours
¡ Addi@onal details/instruc@ons at
hFp://cs246.stanford.edu
J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, h=p://www.mmds.org 28

Data Mining Course Overview
No ratings yet
Data Mining Course Overview
29 pages
Big Data Analytics Course Introduction
No ratings yet
Big Data Analytics Course Introduction
28 pages
Ch01 Intro
No ratings yet
Ch01 Intro
19 pages
1 Introduction
No ratings yet
1 Introduction
55 pages
ch02 Mapreduce
No ratings yet
ch02 Mapreduce
7 pages
ch04 Streams1
No ratings yet
ch04 Streams1
4 pages
ch01 Intro
No ratings yet
ch01 Intro
45 pages
Unit 4
No ratings yet
Unit 4
60 pages
Mining Massive Datasets Preface
No ratings yet
Mining Massive Datasets Preface
17 pages
Mining of Massive Datasets: Jure Leskovec Anand Rajaraman Jeffrey D. Ullman
0% (1)
Mining of Massive Datasets: Jure Leskovec Anand Rajaraman Jeffrey D. Ullman
17 pages
ch-09 - Part 1
No ratings yet
ch-09 - Part 1
22 pages
Big Data - Spring 25 - Week01
No ratings yet
Big Data - Spring 25 - Week01
54 pages
Community Detection in Large Networks
No ratings yet
Community Detection in Large Networks
64 pages
Unit 5
No ratings yet
Unit 5
39 pages
(Ebook) Mining of Massive Datasets by Jure Leskovec, Anand Rajaraman, Je Rey D. Ullman ISBN 9781107077232, 1107077230 Ready To Read
No ratings yet
(Ebook) Mining of Massive Datasets by Jure Leskovec, Anand Rajaraman, Je Rey D. Ullman ISBN 9781107077232, 1107077230 Ready To Read
103 pages
Mining of Massive Datasets Jure Leskovec, Anand Rajaraman, Jeff Ullman
No ratings yet
Mining of Massive Datasets Jure Leskovec, Anand Rajaraman, Jeff Ullman
46 pages
ch07 Clustering
No ratings yet
ch07 Clustering
62 pages
Stanford - Slides Mapreduce
No ratings yet
Stanford - Slides Mapreduce
76 pages
Large-Scale Machine Learning Guide
No ratings yet
Large-Scale Machine Learning Guide
33 pages
Stanford CS246: Mining Massive Datasets
No ratings yet
Stanford CS246: Mining Massive Datasets
77 pages
L2 Linkanalysis1 2024
No ratings yet
L2 Linkanalysis1 2024
59 pages
Course Outline and Introduction
No ratings yet
Course Outline and Introduction
37 pages
MapReduce for Big Data Processing
No ratings yet
MapReduce for Big Data Processing
48 pages
Big Data Processing with MapReduce
No ratings yet
Big Data Processing with MapReduce
49 pages
BD - Lecture 3 - Decision Tree
No ratings yet
BD - Lecture 3 - Decision Tree
39 pages
Community Detection in Social Networks
No ratings yet
Community Detection in Social Networks
64 pages
Bloom Filters & Stream Algorithms
No ratings yet
Bloom Filters & Stream Algorithms
4 pages
ch05 Linkanalysis1
No ratings yet
ch05 Linkanalysis1
60 pages
1 Intro
No ratings yet
1 Intro
46 pages
CCS415-CCT416 Course Outline
No ratings yet
CCS415-CCT416 Course Outline
3 pages
MapReduce - 1
No ratings yet
MapReduce - 1
39 pages
18-Sub-Modular Functions
No ratings yet
18-Sub-Modular Functions
51 pages
Mining Data Streams 1
No ratings yet
Mining Data Streams 1
46 pages
MapReduce-Final
No ratings yet
MapReduce-Final
92 pages
4 Frequent Item Set Mining & Association Rules
No ratings yet
4 Frequent Item Set Mining & Association Rules
68 pages
ch07 Clustering
No ratings yet
ch07 Clustering
56 pages
ch06 Assocrules
No ratings yet
ch06 Assocrules
59 pages
Introduction PDF
No ratings yet
Introduction PDF
69 pages
3 Hadoop
No ratings yet
3 Hadoop
111 pages
Data Stream Mining Algorithms Explained
No ratings yet
Data Stream Mining Algorithms Explained
46 pages
Data Mining
100% (5)
Data Mining
89 pages
Intro 1
No ratings yet
Intro 1
43 pages
Association Rules and Frequent Item Sets
No ratings yet
Association Rules and Frequent Item Sets
98 pages
Mining Data Streams (Part 1) : Mining of Massive Datasets Jure Leskovec, Anand Rajaraman, Jeff Ullman
No ratings yet
Mining Data Streams (Part 1) : Mining of Massive Datasets Jure Leskovec, Anand Rajaraman, Jeff Ullman
46 pages
CAS CS 565, Data Mining
No ratings yet
CAS CS 565, Data Mining
30 pages
ch06 Assocrules
No ratings yet
ch06 Assocrules
110 pages
Comprehensive Data Mining Textbook
No ratings yet
Comprehensive Data Mining Textbook
24 pages
CS246: Mining Massive Datasets Jure Leskovec,: Stanford University
No ratings yet
CS246: Mining Massive Datasets Jure Leskovec,: Stanford University
56 pages
CS246 Final Exam Review Session
No ratings yet
CS246 Final Exam Review Session
48 pages
Data Mining: Ying Liu, Prof., PH.D
No ratings yet
Data Mining: Ying Liu, Prof., PH.D
57 pages
Stanford CS246: Mining Massive Datasets
No ratings yet
Stanford CS246: Mining Massive Datasets
69 pages
Data Mining and Machine Learning Insights
No ratings yet
Data Mining and Machine Learning Insights
52 pages
Lecture 27
No ratings yet
Lecture 27
21 pages
Syllabus Sem 7
No ratings yet
Syllabus Sem 7
10 pages
Data Mining Course Overview CS 583
No ratings yet
Data Mining Course Overview CS 583
22 pages
Data Mining
No ratings yet
Data Mining
26 pages
Machine Learning and Data Mining Overview
No ratings yet
Machine Learning and Data Mining Overview
40 pages
Support Machine Learning
No ratings yet
Support Machine Learning
161 pages
ICHME7 Abstract Dirk de Bock
No ratings yet
ICHME7 Abstract Dirk de Bock
1 page
Answers - Resit Exam Cognitive Psychology 13
No ratings yet
Answers - Resit Exam Cognitive Psychology 13
3 pages
Shellsort - Wikipedia
No ratings yet
Shellsort - Wikipedia
5 pages
Quicksort Algorithm and Partitioning Explained
No ratings yet
Quicksort Algorithm and Partitioning Explained
14 pages
Final Exam Solution - Test Paper Final Exam Solution - Test Paper
No ratings yet
Final Exam Solution - Test Paper Final Exam Solution - Test Paper
82 pages
Automata Theory Exam Solutions
No ratings yet
Automata Theory Exam Solutions
3 pages
WRAP - Cs RR 099
No ratings yet
WRAP - Cs RR 099
15 pages
2G1505 Automata Theory: Oo Oo
No ratings yet
2G1505 Automata Theory: Oo Oo
3 pages
Turing Machine Exercises and Solutions
No ratings yet
Turing Machine Exercises and Solutions
2 pages
Resit 070103
No ratings yet
Resit 070103
1 page
Solutions Languages Formal
No ratings yet
Solutions Languages Formal
11 pages
Automata Theory Exam Solutions
No ratings yet
Automata Theory Exam Solutions
4 pages
40 Out
No ratings yet
40 Out
80 pages
Open Source DOS Programs in Assembly
No ratings yet
Open Source DOS Programs in Assembly
6 pages
C++ - Do I Need To Call glEnableVertexAttribArray If I Use VAOs - Stack Overflow
No ratings yet
C++ - Do I Need To Call glEnableVertexAttribArray If I Use VAOs - Stack Overflow
1 page
Accessing GPU with C and OpenGL
No ratings yet
Accessing GPU with C and OpenGL
4 pages
Modern OpenGL Resources and Updates
No ratings yet
Modern OpenGL Resources and Updates
2 pages
Ibm PC - What Was The First Multiprocessor x86 Motherboard - Retrocomputing Stack Exchange
No ratings yet
Ibm PC - What Was The First Multiprocessor x86 Motherboard - Retrocomputing Stack Exchange
5 pages
Why x86 Lacks Direct IP Instruction
No ratings yet
Why x86 Lacks Direct IP Instruction
10 pages
Understanding Amorphous Metals and Alloys
No ratings yet
Understanding Amorphous Metals and Alloys
10 pages
Double Buffering in OpenGL Explained
No ratings yet
Double Buffering in OpenGL Explained
2 pages
Prof. Dr. A.C. (Alexandru) Telea - How To Find Us - Find A Member of Staff - University of Groningen
No ratings yet
Prof. Dr. A.C. (Alexandru) Telea - How To Find Us - Find A Member of Staff - University of Groningen
3 pages
SDL Double Buffering: Front vs Back Buffer Access
No ratings yet
SDL Double Buffering: Front vs Back Buffer Access
4 pages
Double Buffering on LPC1788: Solutions
No ratings yet
Double Buffering on LPC1788: Solutions
3 pages
Netherlands Prepaid SIM Guide
No ratings yet
Netherlands Prepaid SIM Guide
15 pages
European Roaming Unions - Prepaid Data SIM Card Wiki - Fandom
No ratings yet
European Roaming Unions - Prepaid Data SIM Card Wiki - Fandom
13 pages
Groningen Rental Deposit Dispute
No ratings yet
Groningen Rental Deposit Dispute
4 pages
Tenant Guide: Rental Deposits
No ratings yet
Tenant Guide: Rental Deposits
2 pages
Gratis Juridisch Advies - Het Juridisch Loket
No ratings yet
Gratis Juridisch Advies - Het Juridisch Loket
4 pages
EU Roaming Regulation 2022 Overview
No ratings yet
EU Roaming Regulation 2022 Overview
12 pages
Dual-Band Flat Antenna For Polarization Diversity With High Isolation
No ratings yet
Dual-Band Flat Antenna For Polarization Diversity With High Isolation
4 pages
DR Label Software Operation - Toc
No ratings yet
DR Label Software Operation - Toc
5 pages
Toa Presentation - Module 3 Functional Concepts and Interior Environment
No ratings yet
Toa Presentation - Module 3 Functional Concepts and Interior Environment
21 pages
How To Write Findings
0% (1)
How To Write Findings
3 pages
CIPW Norm Calculation Guide
No ratings yet
CIPW Norm Calculation Guide
5 pages
Bin Vibrator Inquiry Data Sheet
No ratings yet
Bin Vibrator Inquiry Data Sheet
1 page
Study The Performance of Dissolved Air F
No ratings yet
Study The Performance of Dissolved Air F
5 pages
KCSE 2020 Physics Paper 2 Predictions
No ratings yet
KCSE 2020 Physics Paper 2 Predictions
10 pages
Enamelled Wire Diameter Specifications
No ratings yet
Enamelled Wire Diameter Specifications
1 page
Statistics Chapter 8 Review
No ratings yet
Statistics Chapter 8 Review
4 pages
Signals & Systems 2018-19
No ratings yet
Signals & Systems 2018-19
3 pages
Precalculus
No ratings yet
Precalculus
5 pages
JNTUK R20 ML UNIT-I (Chapter-I)
No ratings yet
JNTUK R20 ML UNIT-I (Chapter-I)
9 pages
Number System Aptitude Questions
No ratings yet
Number System Aptitude Questions
23 pages
H902NXED Board Datasheet
No ratings yet
H902NXED Board Datasheet
1 page
Probabilistic Structural Mechanics
No ratings yet
Probabilistic Structural Mechanics
756 pages
CS7-IR and Continuity Test Report For Electrical Cables 2022-08-21
No ratings yet
CS7-IR and Continuity Test Report For Electrical Cables 2022-08-21
1 page
Overview of Common Input Devices
No ratings yet
Overview of Common Input Devices
7 pages
Angulated Views in Coronary Angiography
No ratings yet
Angulated Views in Coronary Angiography
26 pages
Rectangular Tank Satu Lagi
No ratings yet
Rectangular Tank Satu Lagi
1 page
Fe Sem 01 Eng Maths Syllbus
No ratings yet
Fe Sem 01 Eng Maths Syllbus
7 pages
Quants Geometry
No ratings yet
Quants Geometry
258 pages
Association Between Crowding
No ratings yet
Association Between Crowding
21 pages
Practice Questions-Work and Energy
No ratings yet
Practice Questions-Work and Energy
12 pages
Class 10 Arithmetic Progressions: Answer The Questions
No ratings yet
Class 10 Arithmetic Progressions: Answer The Questions
14 pages
R501: Versatile Fiber Optic Monitor
No ratings yet
R501: Versatile Fiber Optic Monitor
4 pages
Ex6 - SEMIBATCH REACTOR
No ratings yet
Ex6 - SEMIBATCH REACTOR
4 pages
Site Analysis
No ratings yet
Site Analysis
9 pages
Decision Theory
No ratings yet
Decision Theory
17 pages
Halogen-Free Power Cables Guide
No ratings yet
Halogen-Free Power Cables Guide
2 pages

ch01 Intro

Uploaded by

ch01 Intro

Uploaded by

Note to other teachers and users of these slides: We would be delighted if you found this our

Mining of Massive Datasets

Data Mining ≈ Big Data ≈

Community Web Decision AssociaOon

¡ For e-­‐mailing us, always use:

¡ We will post course announcements to

¡ Final exam: 40%

¡ It’s going to be fun and hard work. ☺

01/10, Thu HW1

¡ We provide some background, but

§ All sessions will be held in Thornton 102,

You might also like

¡ For e-‐mailing us, always use: