Machine Learning Zoomcamp
Session #1.1
Introduction to
Machine Learning
DataTalks.Club
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
Imagine we have a car classifieds website
Pictures taken from olx.ua
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
I want to
sell my
car
John Torcasio/unsplash (source)
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
📱
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
📱
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
🤔
📱
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
How can we help our user select the best price?
🤔
📱
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
What do we know about cars?
Price
$1.1k
$0.6k
$23k
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
What do we know about cars?
Year Price
1995 $1.1k
1980 $0.6k
2016 $23k
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
What do we know about cars?
Year Make Price
1995 GAZ $1.1k
1980 VAZ $0.6k
2016 BWM $23k
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
What do we know about cars?
Year Make Mileage Price
1995 GAZ 200.000 $1.1k
1980 VAZ 100.000 $0.6k
2016 BWM 5.000 $23k
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
What do we know about cars?
Year Make Mileage ... Price
1995 GAZ 200.000 ... $1.1k
1980 VAZ 100.000 ... $0.6k
2016 BWM 5.000 ... $23k
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
Year Make Mileage ... Price
1995 GAZ 200.000 ... $1.1k
👷
1980 VAZ 100.000 ... $0.6k
2016 BWM 5.000 ... $23k
Using this information, an expert can determine the price
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
DATA 👷 PATTERNS
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
DATA 👷 PATTERNS
DATA ML PATTERNS
If an expert can, so can a model!
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
Year Make Mileage ... Price
1995 GAZ 200.000 ... $1.1k
1980 VAZ 100.000 ... $0.6k
2016 BWM 5.000 ... $23k
“Features” “Target”
what we know about cars what we want to predict
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
Year Make Mileage ... Price
1995 GAZ 200.000 ... $1.1k
1980 VAZ 100.000 ... $0.6k
2016 BWM 5.000 ... $23k
🚗 ... ... ... ... ...
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
Machine Learning
Year Make Mileage ... Price
1995 GAZ 200.000 ... $1.1k
1980 VAZ 100.000 ... $0.6k train
model
2016 BWM 5.000 ... $23k
🚗 ... ... ... ... ...
“Features” “Target”
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
Using a model
Year Make Mileage ... Price Price
1995 GAZ 200.000 ... $1.1k $1.5k
1980 VAZ 100.000 ... $0.6k predict $0.4k
model
2016 BWM 5.000 ... $23k $20k
🚗 ... ... ... ... ... ...
“Features” “Target” “Predictions”
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
�
�
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
Year 1995
Make GAZ
📱 model
Mileage 200.000
... ...
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
Year 1995
Make GAZ
📱 model
Mileage 200.000
... ...
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
🥳
📱
DataTalks.Club — mlzoomcamp.com — @Al_Grigor Summary
Model training
Features
ML Model
Target
DataTalks.Club — mlzoomcamp.com — @Al_Grigor Summary
Model training
Year Make Mileage ...
1995 GAZ 200.000 ...
1980 VAZ 100.000 ...
2016 BWM 5.000 ...
... ... ... ...
ML Model
Price
$1.1k
$0.6k
$23k
...
DataTalks.Club — mlzoomcamp.com — @Al_Grigor Summary
Predictions
Features Model Predictions
DataTalks.Club — mlzoomcamp.com — @Al_Grigor Summary
Predictions
Year Make Mileage ... Price
1996 Volvo 100.000 ... $1.1k
1991 GAZ 50.000 ... Model $0.6k
2018 Audi 2.000 ... $23k
... ... ... ... ...
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
Next
Machine Learning vs Rule-Based System
● Spam detection example