0% found this document useful (0 votes)

12 views61 pages

M1 LinearRegression

Uploaded by

connectwithprime2

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views61 pages

M1 LinearRegression

Uploaded by

connectwithprime2

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Linear Regression

Anshu Pandey

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
• A Supervised Learning Algorithm
that learns from a set of training
samples

• It estimates relationship between

a dependent variable
(target/label) and one or more
independent variable
(predictors).

WHAT IS LINEAR REGRESSION?

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Linear Regression
Univariate
Linear
Regression
Multivariat
e Linear
Regression
Polynomial
Linear
Regression
www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
• During the training
period the regression
line is getting more fit.

Univariate Linear Regression

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Housing Prices
Prediction

Price in Lakh (INR)

Area Price
( sq ft) In INR

1200 20,00,000

1800 42,00,000

3200 44,00,000

3800 25,00,000 Area in 1000 sq. feet

4200 62,00,000

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
y: Dependent Variable, criterion variable, or regressand.
Housing Prices x: Independent variable, predictor variables or regressors.

Prediction
Area Price

Price in Lakh (INR)

( sq ft) (x) In INR (y)

1200 20,00,000

1800 42,00,000

3200 44,00,000

3800 25,00,000 Area in 1000 sq. feet

4200 62,00,000

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Housing Prices
Prediction Linear Regression in one Variable

Area Price
( sq ft) In INR

Price in Lakh (INR)

1200 20,00,000

1800 42,00,000

3200 44,00,000

3800 25,00,000

4200 62,00,000 Area in 1000 sq. feet

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Housing Prices
Prediction

Price in Lakh (INR)

Area in 1000 sq. feet

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Variables affecting Regression Equation

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Housing Prices
Prediction

Price in Lakh (INR)

Area in 1000 sq. feet

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Regression Equation:

Parameters

Cost Function:

Goal

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Gradient Descent Algorithm

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Gradient Descent Algorithm Linear Regression Model

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Univariate
Linear
Regression
Linear Regression Process Visualization

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
• Establish If there is a relationship between two
variables.
Examples – relationship between housing process
Objective of and area of house, no of hours of study and the
marks obtained, income and spending etc.
Linear
• Prediction of new possible values
Regression Based on the area of house predicting the house
prices in a particular month; based on number of
hour studied predicting the possible marks. Sales
in next 3months etc.

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
LINEAR REGRESSION USE CASES

Real Estate • To model residential home prices as a function of the home's living
area, bathrooms, number of bedrooms, lot size.

• To analyze the effect of a proposed radiation treatment on

Medicine reducing tumor sizes based on patient attributes such as age or
weight.

Demand Forecasting • To predict demand for goods and services. For example, restaurant
chains can predict the quantity of food depending on weather.

• To predict company’s sales based on previous month’s sales and

Marketing stock prices of a company.

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Programming with Python

Simple Linear Regression

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Import the libraries

import numpy
import matplotlib as plt
Import pandas

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Import dataset

dataset=pandas.read_csv(‘salary_data.csv’)
X=[Link][:,:-1].values
Y=[Link][:,1].values

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Import dataset

dataset=pandas.read_csv(‘salary_data.csv’)
X=[Link][:,:-1].values
Y=[Link][:,1].values

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Train test split

from sklearn.model_selection import train_test_split

xtrain,xtest,ytrain,ytest =
train_test_split(X,y,test_size=0.2,random_state=0)

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Simple Linear Regression

from sklearn import linear_model

alg = linear_model.LinearRegression()
[Link](xtrain,ytrain)

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Predicting the test results

ypred=[Link](xtest)

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Visualizing the training results
[Link](xtrain,ytrain,‘g’)
[Link](xtrain,[Link](xtrain),’r’)
[Link](“Training set”)
[Link](“Experience”)
[Link](“Salary”)
[Link]()

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Visualizing the test results
[Link](xtest,ytest,‘g’)
[Link](xtest,[Link](xtest),’r’)
[Link](“Test set”)
[Link](“Experience”)
[Link](“Salary”)
[Link]()

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Test Score (Accuracy on test data)
accuracy=[Link](xtest,ytest)
print(accuracy)

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Coefficient and intercept value
#for printing coefficient
alg.coef_
# for printing intercept value
alg.intercept_

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Performance Analysis
from [Link] import mean_squared_error, r2_score
# The mean squared error
print("Mean squared error: %.2f"%mean_squared_error(ytest,ypred))
# Explained variance score: 1 is perfect prediction
print('Variance score: %.2f' % r2_score(ytest, ypred))

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Programming with Python

Multivariate Linear Regression

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
One Hot Encoding
When some inputs are categories (e.g. gender) rather than numbers (e.g.
age) we need to represent the category values as numbers so they can be
used in our linear regression equations.

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Dummy Variables Dummy Variables

Salary Credit Score Age State Californi

New York
192,451 485 42 New York a
118,450 754 35 California 1 0
258,254 658 28 California 0 1
200,123 755 48 New York 0 1
152,485 654 52 California 1 0
0 1

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Encoding Categorical Data
from [Link] import LabelEncoder
from [Link] import OneHotEncoder
labelencoder = LabelEncoder()
#considering X is dataset from above slide
# 3 is the index number of state
X[:, 3] = labelencoder.fit_transform(X[:, 3])
onehotencoder = OneHotEncoder(categorical_features = [3])
X = onehotencoder.fit_transform(X).toarray()

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Avoiding the Dummy variable trap
• X=X[:,1:]

• NOTE : if you have n dummy variables remove one dummy variable to

avoid the dummy variable trap. However the linear regression model that
is built in R and Python takes care of this. But there is no harm in
removing it by ourselves

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Feature Scaling

Standardization Normalization

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Standard Scale using sklearn
from [Link] import StandardScaler
sc_x = StandardScaler()
sc_y = StandardScaler()
X_std = sc_x.fit_transform(X)
y_std = sc_y.fit_transform(y)

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Boston housing data
In [1]: boston = pd.read_csv('[Link]’)
In [2]: print([Link]()
CRIM ZN INDUS CHAS NX RM AGE
DIS RAD TAX \
0 0.00632 18.0 2.31 0 0.538 6.575 65.2 4.0900 1
296.0
1 0.02731 0.0 7.07 0 0.469 6.421 78.9 4.9671 2
242.0
2 0.02729
PTRATIO B 0.0 7.07
LSTAT 0 MEDV 0.469 7.185 61.1 4.9671 2
0 242.0
15.3 396.90 4.98 24.0
31 0.03237
17.8 396.90 0.0
9.14 2.18
21.6 0 0.458 6.998 45.8 6.0622 3
2 222.0
17.8 392.83 4.03 34.7
43 0.06905
18.7 394.63 0.0
2.94 2.18
33.4 0 0.458 7.147 54.2 6.0622 3
4 222.0
18.7 396.90 5.33 36.2

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Creating feature and target arrays
In [3]: X = [Link]('MEDV', axis=1).values
In [4]: y = boston['MEDV'].values

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Predicting house value from a single feature
In [5]: X_rooms = X[:,5]
In [6]: type(X_rooms), type(y)
Out[6]: ([Link], [Link])
In [7]: y = [Link](-1, 1)
In [8]: X_rooms = X_rooms.reshape(-1, 1)

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Plotting house value vs. number of rooms
In [9]: [Link](X_rooms, y)
In [10]: [Link]('Value of house /1000 ($)’)
In [11]: [Link]('Number of rooms’)
In [12]: [Link]()

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Plotting house value vs. number of rooms

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Fitting a regression model
In [13]: from numpy import linspace
In [14]: from sklearn import linear_model
In [15]: alg = linear_model.LinearRegression()
In [16]: [Link](X_rooms, y)
In [17]: k=linspace(min(X_rooms),max(X_rooms)).reshape(-1,1)
In [18]: [Link](X_rooms, y, color='blue’)
In [19]: [Link](k, [Link](k),'b', linewidth=3)
In [20]: [Link]()

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Fitting a regression model

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Linear regression on all features
In [1]: from sklearn.model_selection import train_test_split
In [2]: X_train, X_test, y_train, y_test = train_test_split(X,
y,test_size = 0.3, random_state=42)
In [3]: alg2 = linear_model.LinearRegression()
In [4]: [Link](X_train, y_train)
In [5]: y_pred = [Link](X_test)
In [6]: [Link](X_test, y_test)
Out[6]: 0.71122600574849526

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Cross Validation

Split 1 Fold 1 Fold 2 Fold 3 Fold 4 Fold 5 Metric 1

Split 2 Fold 1 Fold 2 Fold 3 Fold 4 Fold 5 Metric 2
Split 3 Fold 1 Fold 2 Fold 3 Fold 4 Fold 5 Metric 3
Split 4 Fold 1 Fold 2 Fold 3 Fold 4 Fold 5 Metric 4
Split 5 Fold 1 Fold 2 Fold 3 Fold 4 Fold 5 Metric 5

Test Set

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Cross-validation and model performance

• 5 folds = 5-fold CV
• 10 folds = 10-fold CV
• k folds = k-fold CV
• More folds = More computationally expensive

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Cross-validation in scikit-learn
In [1]: from sklearn.model_selection import cross_val_score
In [2]: alg = linear_model.LinearRegression()
In [3]: cv_results = cross_val_score(alg, X, y, cv=5)
In [4]: print(cv_results)
[ 0.63919994 0.71386698 0.58702344 0.07923081 -0.25294154]
In [5]: [Link](cv_results)
Out[5]: 0.35327592439587058

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Overfitting & Generalisation
• As we train our model with more and more data the it may start to ﬁt
the training data more and more accurately, but become worse at
handling test data that we feed to it later.
• This is known as “over-ﬁtting” and results in an increased generalization
error.

• Large coefficients lead to overfitting

• Penalizing large coefficients: Regularization

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
How to minimize?
• To minimize the generalization error we should

• Collect as much sample data as possible.

• Use a random subset of our sample data for training.

• Use the remaining sample data to test how well our model copes
with data it was not trained with.

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
L1 Regularisation (Lasso)
(Least Absolute Shrinkage and Selection Operator)

• Having a large number of samples (n) with respect to the number of

dimensionality (d) increases the quality of our model.
• One way to reduce the eﬀective number of dimensions is to use those that
most contribute to the signal and ignore those that mostly act as noise.
• L1 regularization achieves this by adding a penalty that results in the
weight for the dimensions that act as noise becoming 0.
• L1 regularisation encourages a sparse vector of weights in which few are
non-zero and many are zero.

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
L1 Regularisation (Lasso)
• Depending on the regularization strength, certain weights can become zero,
which makes the LASSO also useful as a supervised feature selection technique:

• A limitation of the LASSO is that it selects at most n variables if m > n.

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Lasso regression in scikit-learn
In [1]: from sklearn.linear_model import Lasso
In [2]: X_train, X_test, y_train, y_test = train_test_split(X,
y,test_size = 0.3, random_state=42)
In [3]: lasso = Lasso(alpha=0.1, normalize=True)
In [4]: [Link](X_train, y_train)
In [5]: lasso_pred = [Link](X_test)
In [6]: [Link](X_test, y_test)
Out[6]: 0.59502295353285506

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
L2 Regularisation (Ridge)
• Another way to reduce the complexity of our model and prevent
overﬁtting to outliers is L2 regression, which is also known as ridge
regression.

• In L2 Regularization we introduce an additional term to the cost function

that has the eﬀect of penalizing large weights and thereby minimizing
this skew.

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
L2 Regularisation (Ridge)
• Ridge regression is an L2 penalized model where we simply add the squared
sum of the weights to our least-squares cost function:

• By increasing the value of the hyperparameter λ , we increase the regularization

strength and shrink the weights of our model.

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Ridge regression in scikit-learn
In [1]: from sklearn.linear_model import Ridge
In [2]: X_train, X_test, y_train, y_test = train_test_split(X,
y,test_size = 0.3, random_state=42)
In [3]: ridge = Ridge(alpha=0.1, normalize=True)
In [4]: [Link](X_train, y_train)
In [5]: ridge_pred = [Link](X_test)
In [6]: [Link](X_test, y_test)
Out[6]: 0.69969382751273179

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
L1 & L2 Regularisation (Elastic Net)
• • L1 Regularisation minimises the impact of dimensions that have low weights
and are thus largely “noise”.

• • L2 Regularisation minimise the impacts of outliers in our training data.

• • L1 & L2 Regularisation can be used together and the combination is referred

to as Elastic Net regularisation.

• • Because the diﬀerential of the error function contains the sigmoid which has
no inverse, we cannot solve for w and must use gradient descent.

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Lasso regression for feature selection

• Can be used to select important features of a dataset

• Shrinks the coefficients of less important features to exactly 0.

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Lasso regression for feature selection
In [1]: from sklearn.linear_model import Lasso
In [2]: names = [Link]('MEDV', axis=1).columns
In [3]: lasso = Lasso(alpha=0.1)
In [4]: lasso_coef = [Link](X, y).coef_
In [5]: [Link](range(len(names)), lasso_coef)
In [6]: [Link](range(len(names)), names, rotation=60)
In [7]: [Link]('Coefficients')
In [8]: [Link]()

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Lasso regression for feature selection

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
Practice Datasets

[Link]

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216
For more information or to set up an appointment, please contact us today.
jointact@[Link]

www. [Link] • jointact@[Link] • 18008338228 +65 31586636 +1(973) 598-3969 +44 203-808-4216

Linear Regression in Machine Learning
No ratings yet
Linear Regression in Machine Learning
21 pages
Algorithms For Data Science: Attendance: 88772147
No ratings yet
Algorithms For Data Science: Attendance: 88772147
35 pages
AI Lec 3
No ratings yet
AI Lec 3
36 pages
Polynomial Regression in Python Explained
No ratings yet
Polynomial Regression in Python Explained
1 page
ML Week 4
No ratings yet
ML Week 4
5 pages
ML Lecture - 3
No ratings yet
ML Lecture - 3
47 pages
AI Lab7
No ratings yet
AI Lab7
13 pages
ML 01 (Pranavv)
No ratings yet
ML 01 (Pranavv)
14 pages
Unit Ii
No ratings yet
Unit Ii
48 pages
Linear Regression
No ratings yet
Linear Regression
15 pages
Sourav Moocs A2 65
No ratings yet
Sourav Moocs A2 65
32 pages
ML 01 (Shubham)
No ratings yet
ML 01 (Shubham)
14 pages
House Price Prediction Using Linear Regression in ML
No ratings yet
House Price Prediction Using Linear Regression in ML
9 pages
House Price Prediction Guide
No ratings yet
House Price Prediction Guide
32 pages
Polynomial Linear Regression
No ratings yet
Polynomial Linear Regression
4 pages
Supervised Learning: Classification & Regression
No ratings yet
Supervised Learning: Classification & Regression
6 pages
Supervised Learning: Regression & Classification
No ratings yet
Supervised Learning: Regression & Classification
66 pages
Ba ZG512 Ec-2r First Sem 2024-2025
No ratings yet
Ba ZG512 Ec-2r First Sem 2024-2025
12 pages
ANN-Unit 3 - Regression & Multi-Layer Perceptron
No ratings yet
ANN-Unit 3 - Regression & Multi-Layer Perceptron
35 pages
? What Is Regression
No ratings yet
? What Is Regression
12 pages
Unit 6
No ratings yet
Unit 6
107 pages
Simple Linear Regression-Merged
No ratings yet
Simple Linear Regression-Merged
65 pages
Ayush File 1
No ratings yet
Ayush File 1
37 pages
chp6 (10) Fam
No ratings yet
chp6 (10) Fam
24 pages
BITS F464 ML Lecture Notes
No ratings yet
BITS F464 ML Lecture Notes
86 pages
Data Science: Linear Regression
No ratings yet
Data Science: Linear Regression
19 pages
Unit II - Supervised Machine Learning Techniques
No ratings yet
Unit II - Supervised Machine Learning Techniques
131 pages
ES205 Presentation
No ratings yet
ES205 Presentation
13 pages
Linear Regression Basics in Python
No ratings yet
Linear Regression Basics in Python
17 pages
ML Lab Manual
No ratings yet
ML Lab Manual
19 pages
Machine Learning: Introduction and Linear Regression
No ratings yet
Machine Learning: Introduction and Linear Regression
29 pages
Day.9 SML
No ratings yet
Day.9 SML
23 pages
Single Variate Linear Regression
No ratings yet
Single Variate Linear Regression
106 pages
ML 2
No ratings yet
ML 2
63 pages
TNP Lecture 2 G1G2
No ratings yet
TNP Lecture 2 G1G2
58 pages
AML Course Entry Quiz Sample Questions
No ratings yet
AML Course Entry Quiz Sample Questions
2 pages
Unit - 2, Updated Notes
No ratings yet
Unit - 2, Updated Notes
121 pages
Linear Regression Lab Guide
No ratings yet
Linear Regression Lab Guide
5 pages
Lecture W2ab
No ratings yet
Lecture W2ab
56 pages
Linear Regression in Machine Learning
No ratings yet
Linear Regression in Machine Learning
1 page
UNIT6
No ratings yet
UNIT6
8 pages
Practical # 10
No ratings yet
Practical # 10
5 pages
Document From Jahnavi
No ratings yet
Document From Jahnavi
20 pages
C1 W1 Lab03 Model Representation Soln-Copy1
No ratings yet
C1 W1 Lab03 Model Representation Soln-Copy1
7 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
5 pages
AIML Exp2 2023200043
No ratings yet
AIML Exp2 2023200043
12 pages
Machine Learning: Spam Filtering & Regression
No ratings yet
Machine Learning: Spam Filtering & Regression
8 pages
Slide 3 Linear Regression
No ratings yet
Slide 3 Linear Regression
27 pages
Complete
No ratings yet
Complete
12 pages
Hota ML Regression
No ratings yet
Hota ML Regression
57 pages
ML101 C&a
No ratings yet
ML101 C&a
33 pages
Regression 0
No ratings yet
Regression 0
108 pages
Last Five Years' Sales: Year (X) Sales (Y)
No ratings yet
Last Five Years' Sales: Year (X) Sales (Y)
12 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
7 pages
(Slide) Non Linear Regression
No ratings yet
(Slide) Non Linear Regression
39 pages
Week - 03 Week04
No ratings yet
Week - 03 Week04
32 pages
FMLT Unit 4 Cat Ii Notes
No ratings yet
FMLT Unit 4 Cat Ii Notes
15 pages
GitHub Education
No ratings yet
GitHub Education
1 page
oRGANİZATİONAL DESİGN - Manufacturing and Service Chapter 8
No ratings yet
oRGANİZATİONAL DESİGN - Manufacturing and Service Chapter 8
25 pages
E-Commerce Solutions for SMEs
No ratings yet
E-Commerce Solutions for SMEs
63 pages
Pressure Drop Calcualtion
No ratings yet
Pressure Drop Calcualtion
11 pages
CSE 431 Computer Architecture Fall 2005 Lecture 17: VLIW Processors
No ratings yet
CSE 431 Computer Architecture Fall 2005 Lecture 17: VLIW Processors
18 pages
VRV Thermistor Error Codes Guide
No ratings yet
VRV Thermistor Error Codes Guide
1 page
AI Safety in Healthcare Guide
No ratings yet
AI Safety in Healthcare Guide
7 pages
Linux Manual
No ratings yet
Linux Manual
20 pages
Anderson & Tushman (1990)
No ratings yet
Anderson & Tushman (1990)
31 pages
Serial No. 211716
100% (3)
Serial No. 211716
339 pages
Antony
No ratings yet
Antony
1 page
INSTANTID Sample Report NexisLexis
No ratings yet
INSTANTID Sample Report NexisLexis
2 pages
Track Welding Technical Provision 0343
No ratings yet
Track Welding Technical Provision 0343
10 pages
Muhammad Bilal Kamran, 418402 - Muhammad Hannan, 407747 Muhammad Hammas Naveed, 426229 Arsalan Rafique
No ratings yet
Muhammad Bilal Kamran, 418402 - Muhammad Hannan, 407747 Muhammad Hammas Naveed, 426229 Arsalan Rafique
14 pages
HSJGHHJGJJ
No ratings yet
HSJGHHJGJJ
4 pages
05 Laboratory Exercise 2
No ratings yet
05 Laboratory Exercise 2
3 pages
Livre
No ratings yet
Livre
168 pages
Green Technology Thesis Writing Help
100% (3)
Green Technology Thesis Writing Help
6 pages
Delta SFC
No ratings yet
Delta SFC
10 pages
Sta 2300 West
No ratings yet
Sta 2300 West
3 pages
Catalogue Havells Air Cooler2
No ratings yet
Catalogue Havells Air Cooler2
13 pages
Computer Ulagam June 2010
No ratings yet
Computer Ulagam June 2010
32 pages
Research in Education Evidence-Based Inquiry James Mcmillan Sally Schumacher Seventh Edition
0% (1)
Research in Education Evidence-Based Inquiry James Mcmillan Sally Schumacher Seventh Edition
7 pages
Swarm Intelligence and PSO
No ratings yet
Swarm Intelligence and PSO
2 pages
BXC-V250 310M II Manual
No ratings yet
BXC-V250 310M II Manual
14 pages
BSIT 1A Activity (Answer)
No ratings yet
BSIT 1A Activity (Answer)
3 pages
State Space Modeling and Analysis of Bicyle Dynamics Presentation
No ratings yet
State Space Modeling and Analysis of Bicyle Dynamics Presentation
19 pages
Amendment in S&B Act For Resurvey
No ratings yet
Amendment in S&B Act For Resurvey
9 pages
Airlock Fibre Cyclone Dia.900mm
No ratings yet
Airlock Fibre Cyclone Dia.900mm
1 page
Power Plant Optimization Guide
No ratings yet
Power Plant Optimization Guide
26 pages

M1 LinearRegression

Uploaded by

M1 LinearRegression

Uploaded by

Linear Regression

• It estimates relationship between

WHAT IS LINEAR REGRESSION?

Univariate Linear Regression

Price in Lakh (INR)

3800 25,00,000 Area in 1000 sq. feet

Price in Lakh (INR)

3800 25,00,000 Area in 1000 sq. feet

Price in Lakh (INR)

4200 62,00,000 Area in 1000 sq. feet

Price in Lakh (INR)

Price in Lakh (INR)

• To analyze the effect of a proposed radiation treatment on

• To predict company’s sales based on previous month’s sales and

Simple Linear Regression

from sklearn.model_selection import train_test_split

from sklearn import linear_model

Multivariate Linear Regression

Salary Credit Score Age State Californi

• NOTE : if you have n dummy variables remove one dummy variable to

Split 1 Fold 1 Fold 2 Fold 3 Fold 4 Fold 5 Metric 1

• Large coefficients lead to overfitting

• Collect as much sample data as possible.

• Use a random subset of our sample data for training.

• Having a large number of samples (n) with respect to the number of

• A limitation of the LASSO is that it selects at most n variables if m > n.

• In L2 Regularization we introduce an additional term to the cost function

• By increasing the value of the hyperparameter λ , we increase the regularization

• • L2 Regularisation minimise the impacts of outliers in our training data.

• • L1 & L2 Regularisation can be used together and the combination is referred

• Can be used to select important features of a dataset

• Shrinks the coefficients of less important features to exactly 0.

You might also like