0% found this document useful (0 votes)

39 views33 pages

Regression Scikit Learn

regression

Uploaded by

amarinder765

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views33 pages

Regression Scikit Learn

regression

Uploaded by

amarinder765

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

scikit-learn

▪ introduction
▪ installation/distribution
▪ essential/auxiliary libraries
▪ usage

1
scikit-learn

▪ free
introduction--- ▪ open-source
▪ constantly being developed and improved
scikit-learn (also known as sklearn) is ▪ an active user community.
a free software machine ▪ state-of-the-art machine learning algorithms
learning library for ▪ provides nice documentation
the Python programming language. ▪ widely used in industry and academia
▪ a wealth of tutorials and code snippets are
available online.
▪ works well with many scientific Python
tools

2
scikit-learn

▪ for scientific computation

dependencies --- ▪ NumPy
▪ SciPy.
scikit-learn heavily relies on
NumPy and SciPy for its ▪ for plotting
▪ matplotlib
functions-moreover, can be used
more effectively with other ▪ for interactive development
auxiliary packages ▪ Ipython
▪ Jupyter Notebook

3
scikit-learn

installation--- 1
• Anaconda Free
(recommended)
▪ can be independently installed
▪ (recommended) can be 2
• Enthought canopy Not free

installed via a number of

• Python( x, y ) Free
python distributions ⇛ 3

if you install any of these Python

distributions, scikit-learn comes
packaged with it -

4
scikit-learn

comes with:
▪ NumPy,
▪ SciPy,
▪
anaconda --- ▪
matplotlib,
pandas,
a Python distribution for ▪ IPython,
▪ Jupyter Notebook,
large-scale data processing,
▪ scikit-learn
predictive analytics, and
scientific computing ⇛ available on:
▪ Mac
▪ OS
▪ Windows

5
scikit-learn

Jupyter Notebook
• provides an interactive environment
libraries--- • runs code in the browser.
• great tool for exploratory data analysis
essentially required or increase the

⇛ NumPy •
•
widely used by data scientists
supports many programming languages
effectiveness of scikit-learn

⇛ SciPy
⇛ Jupyter Notebook NumPy
⇛ matplotlib • fundamental packages for scientific computing
• provides functionality for:
⇛ Pandas • multidimensional arrays
• high-level mathematical functions, e.g.,
• linear algebra operations
• Fourier transform
• pseudorandom number generators.

6
scikit-learn

NumPy, SciPy
:: strengths ::

7
scikit-learn

SciPy
libraries--- • a collection of functions for scientific computing
• provides, among other functionality:
essentially required or increase the

⇛ NumPy • advanced linear algebra routines,

effectiveness of scikit-learn

•
⇛ SciPy •
mathematical function optimization,
signal processing,
⇛ Jupyter Notebook • special mathematical functions,
⇛ matplotlib • statistical distributions.
⇛ Pandas • scikit-learn draws from SciPy’s collection of functions
for implementing its algorithms.

8
scikit-learn

matplotlib
libraries--- • primary scientific plotting library in Python
essentially required or increase the

• provides functions for

⇛ NumPy • making publication-quality visualizations:
effectiveness of scikit-learn

⇛ SciPy • line charts,

⇛ Jupyter Notebook • histograms,
• scatter plots,
⇛ matplotlib • and so on.
⇛ Pandas

9
scikit-learn

libraries--- pandas
essentially required or increase the

⇛ NumPy •
•
Python library for data wrangling and analysis
effectiveness of scikit-learn

built around a data structure called the DataFrame

⇛ SciPy • a DataFrame is a table
⇛ Jupyter Notebook • has methods for manipulating this table, e.g.,
• allows SQL-like queries and joins on such tables
⇛ matplotlib
⇛ Pandas

10
Fitting the Linear Regression Model
𝑚
𝜏 = 𝑥 𝑖 ,𝑦 𝑖
𝑖=1
, 𝑥 𝑖
𝜖ℝ 𝑛
,𝑦 𝑖
∈ℝ
(𝑖) (𝑖) (𝑖)
▪ model: 𝑦ො = 𝑤0 + 𝑤1 𝑥1 + 𝑤2 𝑥2 + ⋯ + 𝑤𝑛 𝑥𝑛
▪ model parameters: 𝑤0 , 𝑤1 , 𝑤1 ,…, 𝑤𝑛
▪ intercept: 𝑤0
▪ coefficients: 𝑤1 , 𝑤1 ,…, 𝑤𝑛

▪ dataset: the Boston data

11
the Boston data
• The Boston house-price data of Harrison, D. and
Rubinfeld, D. L. 'Hedonic prices and the demand
for clean air', J. Environ. Economics &
Management, vol.5, 81-102, 1978.
▪ Regression diagnostics: Identifying Influential
Data and Sources of Collinearity’…what
influences housing prices in Boston-

CRIM ZN INDUS CHAS NOX RM AGE DIS RAD TAX PTRATIO B LSTAT MEDV
0.00632 18 2.31 0 0.538 6.575 65.2 4.09 1 296 15.3 396.9 4.98 24
0.02731 0 7.07 0 0.469 6.421 78.9 4.9671 2 242 17.8 396.9 9.14 21.6
0.02729 0 7.07 0 0.469 7.185 61.1 4.9671 2 242 17.8 392.83 4.03 34.7
0.03237 0 2.18 0 0.458 6.998 45.8 6.0622 3 222 18.7 394.63 2.94 33.4
0.06905 0 2.18 0 0.458 7.147 54.2 6.0622 3 222 18.7 396.9 5.33 36.2
0.02985 0 2.18 0 0.458 6.43 58.7 6.0622 3 222 18.7 394.12 5.21 28.7
0.08829 12.5 7.87 0 0.524 6.012 66.6 5.5605 5 311 15.2 395.6 12.43 22.9
0.14455 12.5 7.87 0 0.524 6.172 96.1 5.9505 5 311 15.2 396.9 19.15 27.1
0.21124 12.5 7.87 0 0.524 5.631 100 6.0821 5 311 15.2 386.63 29.93 16.5

12
the Boston housing example
𝑖 𝑖 𝑖
𝑥1 𝑥2
… 𝑥13 𝑦ො 𝑖

𝑠𝑖𝑧𝑒 = 506 × (13 + 1)

𝑖 𝑖 𝑖
𝑦ො = 𝑤0 + 𝑤1 𝑥1 + 𝑤2 𝑥2 + ⋯ + 𝑤13 𝑥13
𝑓: ℝ13 → ℝ
13
exploring the data 14

Steps ---
▪ import the dataset loader
▪ create the loader object
▪ explore/understand the data
▪ shape of the data
▪ description(DESCR)
▪ feature names/values feature values target values

▪ target names/values
names of the features
▪ file path
▪ etc.
information about the data

file path
exploring the data 15

Steps ---
▪ import the dataset loader
▪ create the loader object
▪ explore/understand the data #columns/
#features
#rows/
▪ shape of the data #training examples
▪ description(DESCR)
▪ feature names/values
▪ target names/values
▪ file path
▪ etc.

506 rows, 1 target

exploring the data 16

Steps ---
▪ import the dataset loader
▪ create the loader object
▪ explore/understand the data
▪ shape of the data
▪ description(DESCR)
▪ feature names/values
▪ target names/values
▪ file path
▪ etc.
exploring the data 17

Steps ---
▪ import the dataset loader
▪ create the loader object
▪ explore/understand the data
▪ shape of the data
▪ description(DESCR) …
▪ feature names/values
▪ target names/values
▪ file path
▪ etc.
exploring the data 20

training ---
▪ split the data into training(75%), test sets (25%)
▪ import the model
▪ fit the model to the data
▪ test the model
▪ predict
training the algorithm 21

training ---
▪ split the data into training(75%), test sets (25%)
▪ import the model
▪ fit the model to the data
▪ test the model
▪ predict
training the algorithm 22

training ---
▪ split the data into training/test sets
▪ import the model
▪ fit the model to the data
▪ test the model
▪ predict
training the algorithm 23

training ---
▪ split the data into training/test sets
▪ import the model
▪ fit the model to the data
▪ test the model
▪ predict
the iris data

Iris Flower--
Data about 150 iris flowers to
be classified into 3 varieties; Sepal length Sepal width Petal length Petal width specie

sitosa, versicolor, virginica 5.1 3.3 1.7 0.5 sitosa

4.9 3.0 1.4 0.2 versicolor
5.4 3.6 1.4 0.2 sitosa
6.0 2.7 5.1 1.5 virginica

size: 150 × (4 + 1)

24
25

training the algorithm

step 1

Steps---
▪ load the data
▪ explore the data step 2
▪ split into training and validation
subsets
▪ import the optimizer

step 3
26

training the algorithm

step 4

Steps---
▪ load the data
▪ explore the data step 5
▪ split into training and validation
subsets
▪ import the optimizer
▪ fit to the data (derive the model)
▪ check accuracy of the model on step 6
the data
27

training the algorithm

step 4

training the algorithm

step 7

Steps---
▪ load the data
▪ explore the data
▪ split into training and validation
subsets
▪ import the optimizer
▪ fit to the data (derive the model)
▪ check accuracy of the model on
the data
▪ predict with the model derived
29

training the algorithm

30
31
https://scikit-
learn.org/stable/modules/generated/sklearn.linear_model.Logi
sticRegression.html#sklearn.linear_model.LogisticRegression

32
end

Scikit-Learn (Sklearn) in Python
No ratings yet
Scikit-Learn (Sklearn) in Python
5 pages
Machine Learning Lab Dlihebca6sem
100% (1)
Machine Learning Lab Dlihebca6sem
25 pages
Scikit-Learn Overview and Algorithms
100% (2)
Scikit-Learn Overview and Algorithms
12 pages
Unit 1-1
No ratings yet
Unit 1-1
10 pages
Supervised Learning Overview in Scikit-Learn
No ratings yet
Supervised Learning Overview in Scikit-Learn
4 pages
Intro To Scikit Learning
No ratings yet
Intro To Scikit Learning
18 pages
ML Pgms - 24mar2025
No ratings yet
ML Pgms - 24mar2025
23 pages
Scikit-learn Datasets and Setup Guide
No ratings yet
Scikit-learn Datasets and Setup Guide
92 pages
Digital Principal and System Design
No ratings yet
Digital Principal and System Design
17 pages
Lab 3 - SciKitLearn ML
No ratings yet
Lab 3 - SciKitLearn ML
2 pages
ML Lab Manual (Vim)
No ratings yet
ML Lab Manual (Vim)
13 pages
Introduction to Machine Learning with Scikit-Learn
No ratings yet
Introduction to Machine Learning with Scikit-Learn
2 pages
04 MLModelingBasics
No ratings yet
04 MLModelingBasics
61 pages
Scikit Learn Quick Guide: Introduction
No ratings yet
Scikit Learn Quick Guide: Introduction
111 pages
Scikit Learn
No ratings yet
Scikit Learn
25 pages
Scikit-Learn: Library For Machine Learning and Data Science With Python
100% (1)
Scikit-Learn: Library For Machine Learning and Data Science With Python
11 pages
Ch1 - Slides - Supervised Learning
No ratings yet
Ch1 - Slides - Supervised Learning
32 pages
Machine Learning Lab Programs
No ratings yet
Machine Learning Lab Programs
6 pages
Numpy, Scipy, Matplot
No ratings yet
Numpy, Scipy, Matplot
5 pages
Scikit
No ratings yet
Scikit
3 pages
Supervised Learning: Andreas Müller
No ratings yet
Supervised Learning: Andreas Müller
43 pages
Cp4252-Machine Learning Lab Manual 23-24
No ratings yet
Cp4252-Machine Learning Lab Manual 23-24
28 pages
ML Lab Manual
No ratings yet
ML Lab Manual
38 pages
Prac1 174 Final
No ratings yet
Prac1 174 Final
17 pages
Unit 4
No ratings yet
Unit 4
105 pages
ML - Lab - Programs - J
No ratings yet
ML - Lab - Programs - J
18 pages
ML Exp
No ratings yet
ML Exp
9 pages
ML Engineer
No ratings yet
ML Engineer
2 pages
Scikit-Learn for Data Scientists
No ratings yet
Scikit-Learn for Data Scientists
27 pages
Data Analytics Libraries Overview
No ratings yet
Data Analytics Libraries Overview
8 pages
ML LabManual
No ratings yet
ML LabManual
16 pages
FDS Lab
No ratings yet
FDS Lab
11 pages
Categorical to Quantitative Variables in Python
No ratings yet
Categorical to Quantitative Variables in Python
23 pages
ML Project Assigment
No ratings yet
ML Project Assigment
32 pages
Machine Learning With Python Supervised Learning
No ratings yet
Machine Learning With Python Supervised Learning
114 pages
Machine Learningusing Python
No ratings yet
Machine Learningusing Python
18 pages
BCA ML Lab Manual: Setup & Examples
No ratings yet
BCA ML Lab Manual: Setup & Examples
23 pages
Exercise and Experiment 3
No ratings yet
Exercise and Experiment 3
14 pages
Vtu ML
No ratings yet
Vtu ML
62 pages
Python SciKit Learn Tutorial - DigitalOcean
No ratings yet
Python SciKit Learn Tutorial - DigitalOcean
11 pages
Mlpy 2
No ratings yet
Mlpy 2
18 pages
Scikit-Learn: Machine Learning in Python
No ratings yet
Scikit-Learn: Machine Learning in Python
6 pages
Scikit-learn: Python ML for All
No ratings yet
Scikit-learn: Python ML for All
6 pages
Python Machine Learning
No ratings yet
Python Machine Learning
29 pages
ML Lab Manual
No ratings yet
ML Lab Manual
20 pages
Scikit-learn for Iris Classification
No ratings yet
Scikit-learn for Iris Classification
20 pages
ML Lab Manual
No ratings yet
ML Lab Manual
12 pages
Machine Learning
No ratings yet
Machine Learning
17 pages
Practical File DL
No ratings yet
Practical File DL
14 pages
Python ML Exercises for Beginners
No ratings yet
Python ML Exercises for Beginners
19 pages
Scikit-Learn - Machine Learning in Python PDF
No ratings yet
Scikit-Learn - Machine Learning in Python PDF
6 pages
Data Sets
No ratings yet
Data Sets
36 pages
BCS 402 Lesson 5
No ratings yet
BCS 402 Lesson 5
16 pages
Pai 6
No ratings yet
Pai 6
17 pages
ML in Simple Words: in Python, The Function Is Used To Display Output On The Screen or Other Standard Output Device
No ratings yet
ML in Simple Words: in Python, The Function Is Used To Display Output On The Screen or Other Standard Output Device
30 pages
ML Lab
No ratings yet
ML Lab
4 pages
1 Introduction PDF
No ratings yet
1 Introduction PDF
11 pages
Data Mining Essen, Als 2: Data Mining in Prac, Ce, With Python
No ratings yet
Data Mining Essen, Als 2: Data Mining in Prac, Ce, With Python
31 pages
ML Lab File
No ratings yet
ML Lab File
43 pages
Module 3 - File Handling in Python - Ipynb - Colab
No ratings yet
Module 3 - File Handling in Python - Ipynb - Colab
11 pages
Task 2 - Design An Oral Activity - Sara Herves Fernández
No ratings yet
Task 2 - Design An Oral Activity - Sara Herves Fernández
13 pages
The Grammar Translation Method
No ratings yet
The Grammar Translation Method
20 pages
Al-Siddique Brook Field Grammar School College and Lalian
No ratings yet
Al-Siddique Brook Field Grammar School College and Lalian
3 pages
I. Write The Correct Punctuation Mark That Best Completes Each Sentence
No ratings yet
I. Write The Correct Punctuation Mark That Best Completes Each Sentence
3 pages
Word Class
No ratings yet
Word Class
20 pages
Math 10 - Division Summative Test 2023
100% (1)
Math 10 - Division Summative Test 2023
7 pages
God Gives Moses The Ten Commandments LE
No ratings yet
God Gives Moses The Ten Commandments LE
9 pages
Siva Puranam
100% (3)
Siva Puranam
6 pages
Prayer Guide for Devout Christians
No ratings yet
Prayer Guide for Devout Christians
3 pages
Cases Answers ٢ ENT
No ratings yet
Cases Answers ٢ ENT
69 pages
Sawyer Moranville, 1 John Biblical Greek Visual Reader
No ratings yet
Sawyer Moranville, 1 John Biblical Greek Visual Reader
22 pages
Ideas: The Trend of Engaging Role-Playing Communities On Telegram For Learning Written English
No ratings yet
Ideas: The Trend of Engaging Role-Playing Communities On Telegram For Learning Written English
19 pages
The 7 Literary Standards
No ratings yet
The 7 Literary Standards
3 pages
LaTeX Side-by-Side Figures Guide
No ratings yet
LaTeX Side-by-Side Figures Guide
3 pages
A Comparison of Fluid Factor in AVO Analysis
No ratings yet
A Comparison of Fluid Factor in AVO Analysis
5 pages
Unit 8
No ratings yet
Unit 8
3 pages
M. Pandiyan's Resume and Qualifications
No ratings yet
M. Pandiyan's Resume and Qualifications
2 pages
Ancient Lecture 4
No ratings yet
Ancient Lecture 4
39 pages
M Letter Vocabulary with Synonyms
No ratings yet
M Letter Vocabulary with Synonyms
3 pages
English Home Assignment for STD-XI Students
No ratings yet
English Home Assignment for STD-XI Students
2 pages
PowerPoint 2013: Hyperlinks Guide
No ratings yet
PowerPoint 2013: Hyperlinks Guide
13 pages
PDF SAP S4HCON E S4HCON2019 Certificati PDF
No ratings yet
PDF SAP S4HCON E S4HCON2019 Certificati PDF
7 pages
Corporate Java Trainer Profile Summary
No ratings yet
Corporate Java Trainer Profile Summary
4 pages
Chapter 5.1 - Mail Room
100% (1)
Chapter 5.1 - Mail Room
24 pages
1000+ Beginner Programming Projects & Practice Problems - Programmer's Motivation
No ratings yet
1000+ Beginner Programming Projects & Practice Problems - Programmer's Motivation
6 pages
Interview Transcript on Learning Strategies
No ratings yet
Interview Transcript on Learning Strategies
4 pages
Life and Afterlife Reflections
No ratings yet
Life and Afterlife Reflections
6 pages
Project 1
No ratings yet
Project 1
4 pages
Collins Vocabulary For IELTS - Williams Anneli - 2012 - 128p-7-7
No ratings yet
Collins Vocabulary For IELTS - Williams Anneli - 2012 - 128p-7-7
36 pages

Regression Scikit Learn

Uploaded by

Regression Scikit Learn

Uploaded by

scikit-learn

▪ for scientific computation

installed via a number of

if you install any of these Python

⇛ NumPy • advanced linear algebra routines,

• provides functions for

⇛ SciPy • line charts,

built around a data structure called the DataFrame

▪ dataset: the Boston data

𝑠𝑖𝑧𝑒 = 506 × (13 + 1)

506 rows, 1 target

sitosa, versicolor, virginica 5.1 3.3 1.7 0.5 sitosa

training the algorithm

training the algorithm

training the algorithm

training the algorithm

training the algorithm

You might also like