Test 1

Uploaded by

Yassër Bøureghda

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views3 pages

Test 1

Uploaded by

Yassër Bøureghda

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Introduction to Machine Learning

Test 1 Machine Learning

Ayoub Asri

2023-09-30

Contents
Rules : 1

Questions 1
1. KNN for regression . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1

2. KNN for classification 2

3. KNN 2

4. SVM classification plot 2

5. SVM for classification 3

Rules :
1. The student have to answer all these questions and provide one file containing code, results and
comments (or more one file). The comments must be about the results not the code itself.
2. Only the use of tidymodels is accepted on the modeling stage. Any other tool is allowed in the other
steps.
3. The deadline is 10 days from the moment this document was sent whcih make the deadline : Thursday
November 23th 2023 at 23:59:59
4. Any new ideas on commenting the results or use of pre-processing or any idea to improve the result or
the performance of the model will be highly rewarded.
5. Any cheating will have consequences on the final mark.

Questions
1. KNN for regression
The goal of this exercise is to fit a KNN model for a regression problem. We want to determine the price of a
car based on its caracteristics.
Q1.
Split the dataset into 90% on training set.
Create a Nearest Neighbor model while tuning the hyperparameter of Neighbors and taking all information
included in all the variables (if possible)

1
2. KNN for classification
We will try to fit the attrition variable using KNN model. Load the dataset “hrt.csv”
Q1. inspect the dataset and split it (80/20 split)
Q2. create a KNN model to estimate the attrition variable. The student must tune the hyperparameter
“neighbors”. The choice of the recipe is mandatory but the choice of recipe can vary.

3. KNN
In this last part of KNN we will study the effect of changing the value of the neighbors in the resulting
estimate in a regression problem of KNN.
To illustrate this aspect, we will use the data set from the file “sacramento.csv”
Q1. plot the scatterplot of the price (target variable) vs sqft (the feature variable). Is there any relationship ?
Q2. split the data (3/4 for train)
Q3. Tune the neighbors hyperparameter for the KNN model using a grid of all values between a 1 and 200.
Q4. plot all the values of the performance measure depending on the different values of the hyperparameters.
Q5. create a function that takes an imput value of the neighbor and that estimate a model and return
(output) a plot containing the scatterplot of the y vs x variable plus a line plot of the estimated y (ŷ) vs x.
Use this function to try different values for the neighbors. What this plot illustrate when we use higher values
of neighbors ? smaller values ?

4. SVM classification plot

Run this code to create this fictional data set.
library(tidyverse)

## -- Attaching core tidyverse packages ------------------------ tidyverse 2.0.0 --

## v dplyr 1.1.3 v readr 2.1.4
## v forcats 1.0.0 v stringr 1.5.0
## v ggplot2 3.4.4 v tibble 3.2.1
## v lubridate 1.9.3 v tidyr 1.3.0
## v purrr 1.0.2
## -- Conflicts ------------------------------------------ tidyverse_conflicts() --
## x dplyr::filter() masks stats::filter()
## x dplyr::lag() masks stats::lag()
## i Use the conflicted package (<http://conflicted.r-lib.org/>) to force all conflicts to become errors
library(tidymodels)

## -- Attaching packages -------------------------------------- tidymodels 1.1.1 --

## v broom 1.0.5 v rsample 1.2.0
## v dials 1.2.0 v tune 1.1.2
## v infer 1.0.5 v workflows 1.1.3
## v modeldata 1.2.0 v workflowsets 1.0.1
## v parsnip 1.1.1 v yardstick 1.2.0
## v recipes 1.0.8
## -- Conflicts ----------------------------------------- tidymodels_conflicts() --
## x scales::discard() masks purrr::discard()
## x dplyr::filter() masks stats::filter()

2
## x recipes::fixed() masks stringr::fixed()
## x dplyr::lag() masks stats::lag()
## x yardstick::spec() masks readr::spec()
## x recipes::step() masks stats::step()
## * Use tidymodels_prefer() to resolve common conflicts.
x <- rbind(matrix(rnorm(200),100,2),matrix(rnorm(200,mean=3),100,2))
y <- matrix(c(rep(1,100),rep(-1,100)))

df <- x %>%
bind_cols(y)

## New names:
## * `` -> `...1`
## * `` -> `...2`
## * `` -> `...3`
names(df) <- c("x1", "x2", "y")

df <- df %>%
mutate(y = as.factor(y))

Q1. plot the scatterplot containing x2 vs x1 and color the results by the target variable (yellow for -1 and
green for +1)
Q2. create an SVM with RBF kernel model tune its parameters and finally plot a classification plot using
this final model.
Q3. interpret each color and shape of the resulting classification plot
Q4. precise the performance measure of this model (on the training set !)

5. SVM for classification

Use the same data set hrt from exercise 2.
Q1. split the data (80/20 split)
Q2. define an appropriate recipe.
Q3. fit 3 models of SVM with different kernels. (use random values of hyperparameters for each model).
which is the best model ?
Q4. Use an SVM model with an RBF kernel and tune its hyperparameters. Present the predictions and the
performance measures for the best model.

Lesson 4 - Supervised Learning
No ratings yet
Lesson 4 - Supervised Learning
36 pages
cp4252 Machine Learning Lab Manual
No ratings yet
cp4252 Machine Learning Lab Manual
21 pages
Handling The Dataset Using R - Word
No ratings yet
Handling The Dataset Using R - Word
54 pages
FAQ's - Supervised Learning
No ratings yet
FAQ's - Supervised Learning
4 pages
R Machine Learning Commands Guide
No ratings yet
R Machine Learning Commands Guide
2 pages
R and Python Programming Exercises
100% (1)
R and Python Programming Exercises
24 pages
Article - 10 Machine Learning Algorithms in R
No ratings yet
Article - 10 Machine Learning Algorithms in R
2 pages
K-Nearest Neighbour Classification Worksheet
No ratings yet
K-Nearest Neighbour Classification Worksheet
15 pages
Untitled Document
No ratings yet
Untitled Document
6 pages
Classification Models for Churn Prediction
No ratings yet
Classification Models for Churn Prediction
6 pages
Machine Learning Algorithms Guide
No ratings yet
Machine Learning Algorithms Guide
34 pages
Machine Learning LAB
No ratings yet
Machine Learning LAB
20 pages
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
No ratings yet
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
11 pages
Analysis Course HW2
No ratings yet
Analysis Course HW2
13 pages
ISYE6501 Homework 2
No ratings yet
ISYE6501 Homework 2
11 pages
Week 1 HW
No ratings yet
Week 1 HW
3 pages
ML Full For Print New 1
No ratings yet
ML Full For Print New 1
38 pages
07 Tidymodels
No ratings yet
07 Tidymodels
64 pages
Slide 2 ML Basics
No ratings yet
Slide 2 ML Basics
42 pages
Machine Learning Final Manual
No ratings yet
Machine Learning Final Manual
45 pages
Model Lab
No ratings yet
Model Lab
6 pages
ML Lab
No ratings yet
ML Lab
23 pages
LAB-4 Report
No ratings yet
LAB-4 Report
21 pages
Machine Learning Programs
No ratings yet
Machine Learning Programs
10 pages
Tidy Models
No ratings yet
Tidy Models
39 pages
83 Sklearn Pipeline
No ratings yet
83 Sklearn Pipeline
8 pages
ML Concepts: 1. Parametric Vs Non-Parametric Models:: Examples: Linear, Logistic, SVM
No ratings yet
ML Concepts: 1. Parametric Vs Non-Parametric Models:: Examples: Linear, Logistic, SVM
34 pages
Lecture 9 Machine Learning Using Caret API Updated
No ratings yet
Lecture 9 Machine Learning Using Caret API Updated
46 pages
St. John College of Engineering and Management, Palghar - Maharashtra
No ratings yet
St. John College of Engineering and Management, Palghar - Maharashtra
11 pages
ML Assignments 2025
No ratings yet
ML Assignments 2025
91 pages
CS 189 Machine Learning Final Exam
No ratings yet
CS 189 Machine Learning Final Exam
13 pages
Aml Lab
No ratings yet
Aml Lab
6 pages
R Assignment
No ratings yet
R Assignment
8 pages
MLLab Manual
No ratings yet
MLLab Manual
24 pages
Jntuk R20 ML Unit-Ii
No ratings yet
Jntuk R20 ML Unit-Ii
37 pages
ML Lab Experiments (1) - Pages-5
No ratings yet
ML Lab Experiments (1) - Pages-5
8 pages
Matlab Homework Experts 2
No ratings yet
Matlab Homework Experts 2
10 pages
Machine Learning
100% (5)
Machine Learning
56 pages
4K-Nearest Neighbor
No ratings yet
4K-Nearest Neighbor
38 pages
DSCI 6003 Class Notes
No ratings yet
DSCI 6003 Class Notes
7 pages
AAIC Syllabus
No ratings yet
AAIC Syllabus
19 pages
Machine Learning: Supervised /unsupervised
No ratings yet
Machine Learning: Supervised /unsupervised
33 pages
Da Thoery
No ratings yet
Da Thoery
24 pages
A1388404476 - 64039 - 23 - 2023 - Machine Learning II
No ratings yet
A1388404476 - 64039 - 23 - 2023 - Machine Learning II
10 pages
ML Unit-2
No ratings yet
ML Unit-2
33 pages
ISYE6501 HW1 Kevin
No ratings yet
ISYE6501 HW1 Kevin
7 pages
HW 02
No ratings yet
HW 02
3 pages
Machine Learning
No ratings yet
Machine Learning
8 pages
7708 - MBA PredAnanBigDataNov21
No ratings yet
7708 - MBA PredAnanBigDataNov21
11 pages
Wa0003
No ratings yet
Wa0003
16 pages
Supervised Learning Techniques
No ratings yet
Supervised Learning Techniques
33 pages
EX - NO:3: Algorithm
No ratings yet
EX - NO:3: Algorithm
11 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
Data Science
No ratings yet
Data Science
15 pages
Dav Pracs
No ratings yet
Dav Pracs
9 pages
Terms of Reference - Tajikistan
No ratings yet
Terms of Reference - Tajikistan
2 pages
Wilson 1993
No ratings yet
Wilson 1993
15 pages
Akta Satelit On Astra 4A at 4
No ratings yet
Akta Satelit On Astra 4A at 4
6 pages
Destiny Consultancy: "No Advice Only Solution"
No ratings yet
Destiny Consultancy: "No Advice Only Solution"
13 pages
Siemens Containerized Substation
No ratings yet
Siemens Containerized Substation
10 pages
Planning Ii: Urban Design Investigation
No ratings yet
Planning Ii: Urban Design Investigation
13 pages
Geography Notefor Grade 11,2 ND Term 2025 Fien 35 SK
No ratings yet
Geography Notefor Grade 11,2 ND Term 2025 Fien 35 SK
128 pages
A.P. Student and Teacher Development Plan
No ratings yet
A.P. Student and Teacher Development Plan
2 pages
Kangkong Growth with MSG
No ratings yet
Kangkong Growth with MSG
8 pages
BAT Product Academy - BLEND - Fact Sheet
No ratings yet
BAT Product Academy - BLEND - Fact Sheet
2 pages
Invoice 4776584451147068501
No ratings yet
Invoice 4776584451147068501
3 pages
Student Project Acknowledgments
No ratings yet
Student Project Acknowledgments
1 page
Er Diagrams
No ratings yet
Er Diagrams
5 pages
Scale Calibration Procedures in Hospitality
No ratings yet
Scale Calibration Procedures in Hospitality
3 pages
Coal Mine Safety Modeling Guide
No ratings yet
Coal Mine Safety Modeling Guide
11 pages
Python Project: How To Manage A Speed Sensor With A Labjack U3 HV
0% (1)
Python Project: How To Manage A Speed Sensor With A Labjack U3 HV
9 pages
Science Technology and Society - LP3
No ratings yet
Science Technology and Society - LP3
14 pages
Practical Exercise3 - Cultural Ecosystem Services Assesssment
No ratings yet
Practical Exercise3 - Cultural Ecosystem Services Assesssment
3 pages
ABB Low Voltage Coils & Kits Pricing
No ratings yet
ABB Low Voltage Coils & Kits Pricing
1 page
General Awareness Sample Question Paper
No ratings yet
General Awareness Sample Question Paper
24 pages
EBF PPT Part 1 Unit 2 Financial Systems NEU 2022
No ratings yet
EBF PPT Part 1 Unit 2 Financial Systems NEU 2022
73 pages
Data Structures Complete
50% (2)
Data Structures Complete
255 pages
Saint Mary’s School Rajkot Mind Fest Quiz
No ratings yet
Saint Mary’s School Rajkot Mind Fest Quiz
4 pages
EPP - ICT - Creating A Multimedia Presentation Using The Advanced Features of MS PowerPoint Tool
No ratings yet
EPP - ICT - Creating A Multimedia Presentation Using The Advanced Features of MS PowerPoint Tool
27 pages
Felisa Nur-LAPORAN KEGIATAN MINI PROJEC
No ratings yet
Felisa Nur-LAPORAN KEGIATAN MINI PROJEC
63 pages
CA Intermediate FM SM Exam
No ratings yet
CA Intermediate FM SM Exam
6 pages
Privacy Concerns in Lending Apps
No ratings yet
Privacy Concerns in Lending Apps
1 page
Restaurant Queuing System Analysis
No ratings yet
Restaurant Queuing System Analysis
6 pages
LCCC Educ 97 Mid Term Exam
No ratings yet
LCCC Educ 97 Mid Term Exam
7 pages
Waybill-2023-06-21 09 - 33 - 41
No ratings yet
Waybill-2023-06-21 09 - 33 - 41
10 pages