SML - Week 3

This document discusses linear regression using the Boston housing dataset. It covers: 1. Loading and splitting the data into training and test sets. 2. Fit a linear regression model to a single feature (LSTAT) using linear algebra functions to calculate weights. The model is used to make predictions on training and test data. 3. Fit a linear regression model to all features using Scikit-Learn and compare results to the single feature model. 4. Expand linear regression using polynomial features to model nonlinear relationships. Models of varying degrees are compared based on training and test error.

Uploaded by

szho68

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views5 pages

SML - Week 3

Uploaded by

szho68

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Week 3 - Linear Regression

(OLS = ordinary least squared regression)

(e.g. linear regression & basis function regression)

0. initial set-up
# randomly split data to training set & test set
import numpy as np
# (to assess generalisation error)
import pandas as pd
from sklearn.model_selection import train_test_split
import matplotlib.pyplot as plt
X_train, X_test, y_train, y_test = train_test_split(ds, y,
import seaborn as sns
test_size=0.2, random_state=90051)
sns.set_style('darkgrid')
print("Training set has {} instances. Test set has {}
plt.rcParams['figure.dpi'] = 108
instances.".format(X_train.shape[0],
X_test.shape[0]))
1. The Boston Housing dataset
input = data about towns (506 instances, 13 features)
# select subset of the features
predicted output = median house value
X_train_s = X_train[features].values
X_test_s = X_test[features].values
# load data, convert to pandas dataframe
# Training set has 404 instances. Test set has 102
from sklearn.datasets import load_boston
instances.
boston = load_boston()
ds = pd.DataFrame(boston.data,
2. Linear algebra solutions
columns=boston.feature_names)
Fit a linear regression to the single-featured data.
y = pd.Series(boston.target, name='MEDV')
(How? fit the model by linear algebra functions)
ds.head()

# focus on single feature LSTAT only (input)

features = ['LSTAT']

# response variable (output) = “MEDV”

#ds['LSTAT'] = ds['LSTAT'].apply(lambda x: x/100.)
for f in features:
plt.figure()
plt.scatter(ds[f], y, marker='.')
plt.xlabel(f)
plt.ylabel(r'Median House Value ($\times
10^3$ USD)')
# Prepend a column of 1's to the design matrices print('Train MSE:', mean_squared_error(y_pred_train,
(combined the bias term to the weights vector) y_train))
X_train_b = np.column_stack( print('Test MSE:', mean_squared_error(y_pred_test,
(np.ones_like(X_train_s), X_train_s) ) y_test))
X_test_b = np.column_stack( # Train MSE: 38.6322164416081
(np.ones_like(X_test_s), X_test_s)) # Test MSE: 38.00420488101306
print('Design matrix shape:', X_train_s.shape)
# Design matrix shape: (404, 1) 4. linear regression using sklearn
# use sklearn to fit linear regression directly
w = np.linalg.solve( (X_train_b.T @ X_train_b), from sklearn.linear_model import LinearRegression
(X_train_b.T @ y_train)) lr = LinearRegression().fit(X_train_s, y_train)
print('Weights:', w)
# Weights: [34.51530004 -0.95801769] # lr returns the weight for bias term
lr.intercept_
@ matmul np.dot() # 34.5153000408642

# plot predictions on test data # lr returns the weight for non-bias terms
# X = design matrix, w = weights vector lr.coef_
def predict(X, w): # array([-0.95801769])
return np.dot(X, w)
# predict by sklearn (using ALL 13 features)
X_grid = np.linspace(X_train_s.min(), X_train_s.max(), lr_full = LinearRegression().fit(X_train, y_train)
num=1001) y_pred_train = lr_full.predict(X_train)
x = np.column_stack((np.ones_like(X_grid), X_grid)) y_pred_test = lr_full.predict(X_test)
y = predict(x, w)
plt.plot(X_grid, y, 'k-', label='Prediction') print('Train MSE:', mean_squared_error(y_pred_train,
plt.scatter(X_train_s, y_train, color='b', marker='.', y_train))
label='Train') print('Test MSE:', mean_squared_error(y_pred_test,
#plt.scatter(X_test_s, y_test, color='r', marker='.', y_test))
label='Test') # Train MSE: 20.059284291202285
plt.legend() # Test MSE: 30.72694987338853
plt.ylabel("$y$ (Median House Value)")
plt.xlabel("$x$ (LSTAT)") # calculate mean squared error by sklearn
plt.show() from sklearn.metrics import mean_squared_error as
sk_mse
print('Train MSE:', sk_mse(y_pred_train, y_train))
print('Test MSE:', sk_mse(y_pred_test, y_test))
# Train MSE: 20.059284291202285
# Test MSE: 30.72694987338853

5. Basis expansion
basis expansion = extend linear regression by mapping
original features to another feature
(purpose: model the non-linear relationship of original
features by linear relationship of new features)
(How? perform linear regression on new features s.t.
# calculate mean error over training set & test set
def mean_squared_error(y_true, y_pred): Then, weights for transformed features should be:
return np.mean((y_pred - y_true)**2)
(by applying normal equation),

y_pred_train = predict(X_train_b, w)
y_pred_test = predict(X_test_b, w)
where is transformed design metrix.
# The graph polynomial LR is better fit than LR model
One option for mapping is “polynomial basis # compute mean squared error on train set/test set
function of single-feature, e.g. y_pred_train_poly = lr_poly.predict(Phi_train)
, y_pred_test_poly = lr_poly.predict(Phi_test)
which includes bias term = 1 print('Train MSE for polynomial features of degree {}:
{:.3f}'.format(degree,
# compute the transformed design matrix mean_squared_error(y_pred_train_poly,
# (purpose: get new features by transformation) y_train)))
from sklearn.preprocessing import print('Test MSE for polynomial features of degree {}:
PolynomialFeatures {:.3f}'.format(degree,
degree = 2 mean_squared_error(y_pred_test_poly,
poly = PolynomialFeatures(degree=degree) y_test)))
Phi_train = poly.fit_transform(X_train_s) print('Train MSE using linear features only:
Phi_test = poly.fit_transform(X_test_s) {:.3f}'.format(mean_squared_error(lr.predict(X
print("Original design matrix (first 5 rows):\n", _train_s), y_train)))
X_train_s[0:5], "\n") print('Test MSE using linear features only:
print("Transformed design matrix (first 5 rows):\n", {:.3f}'.format(mean_squared_error(lr.predict(X
Phi_train[0:5]) _test_s), y_test)))
# original vs transformed design matrix: # output (large decrease in train error, but smaller
decrease in test error)

# adjust the degree of polynomial for basis expansion

# degree = 0 ~ 11
degrees = list(range(12))
models = list()
# linear regression on transformed feature train_mses = list() # mean squared error
lr_poly = LinearRegression(fit_intercept=False). test_mses = list()
fit(Phi_train, y_train)
X_grid = np.linspace(min(X_train_s.min(),
# plot the prediction by lr_poly X_test_s.min()),
X_grid = np.linspace(X_train_s.min(), X_train_s.max(), max(X_train_s.max(), X_test_s.max()),
num=1001) num=1001) # draw blank graph
Phi_grid = poly.fit_transform(X_grid[:,np.newaxis])
y = lr_poly.predict(Phi_grid) plt.figure(figsize=(20,16))
plt.plot(X_grid, y, 'k-', label='Prediction') for i, degree in enumerate(degrees):
plt.scatter(X_train_s, y_train, color='b', marker='.', plt.subplot(len(degrees)//2, 2, i+1)
label='Train')
plt.scatter(X_test_s, y_test, color='r', marker='.', # Transform features
label='Test') poly = PolynomialFeatures(degree=degree)
plt.legend() Phi_train, Phi_test = poly.fit_transform(X_train_s),
plt.ylabel("$y$ (Median House Value)") poly.fit_transform(X_test_s)
plt.xlabel("$x$ (LSTAT)") Phi_grid = poly.fit_transform(X_grid[:,np.newaxis])
plt.show()
# Fit model
lr_poly = LinearRegression().fit(Phi_train, y_train)
models.append(lr_poly)

# Evaluate
train_mse =
mean_squared_error(lr_poly.predict(Phi_trai
n), y_train)
train_mses.append(train_mse)
test_mse =
mean_squared_error(lr_poly.predict(Phi_test)
, y_test) 6. Bonus: ridge regression
test_mses.append(test_mse) ridge regression = one of regularisation with L2 penalty
(purpose: solve bias-variance trade-off)
# plot scatter plot for predicted y and actual y (How? add penalty term to least squares cost, which
plt.plot(X_grid, lr_poly.predict(Phi_grid), 'k', encourage sparse/small weight)
label='Prediction')
plt.scatter(X_train_s, y_train, color='b', marker='.',
label='Train')
plt.scatter(X_test_s, y_test, color='r', marker='.',
label='Test')
plt.title('Degree {} | Train MSE {:.3f}, Test MSE
{:.3f}'.format(degree, train_mse, test_mse)) # rescale LSTAT feature
plt.legend() from sklearn.linear_model import Ridge
X_train_s = X_train_s / 100.0
plt.suptitle('Polynomial regression for different X_test_s = X_test_s / 100.0
polynomial degrees', y=1.05, fontsize=32)
plt.tight_layout() # replace “LinearRegression()” by “Ridge(alpha =
0.002)” (all other code is same as Part 5)
degrees = list(range(12))
models = list()
train_mses = list()
test_mses = list()

X_grid = np.linspace(min(X_train_s.min(),
X_test_s.min()),
max(X_train_s.max(), X_test_s.max()),
num=1001)

plt.figure(figsize=(20,16))
for i, degree in enumerate(degrees):
plt.subplot(len(degrees)//2, 2, i+1)
# plot “mean squared error vs polynomial degree”
plt.plot(degrees, train_mses, color='b', label='Train') # Transform features
plt.plot(degrees, test_mses, color='r', label='Test') poly = PolynomialFeatures(degree=degree)
plt.title('MSE vs. polynomial degree') Phi_train, Phi_test = poly.fit_transform(X_train_s),
plt.ylabel('MSE') poly.fit_transform(X_test_s)
plt.xlabel('Polynomial degree') Phi_grid = poly.fit_transform(X_grid[:,np.newaxis])
plt.legend()
plt.show() # Fit model
lr_poly = Ridge(alpha = 0.002).fit(Phi_train, y_train)
models.append(lr_poly)

# Evaluate
train_mse =
mean_squared_error(lr_poly.predict(Phi_train
), y_train)
train_mses.append(train_mse)
test_mse = # plot “L2 norm of weights vs degree”
mean_squared_error(lr_poly.predict(Phi_test) w_L2 = [np.sum(m.coef_**2) for m in models]
, y_test) plt.plot(degrees, w_L2)
test_mses.append(test_mse) plt.xlabel('Polynomial degree')
plt.ylabel(r'$\| \mathbf{w} \|_2^2$')
plt.plot(X_grid, lr_poly.predict(Phi_grid), 'k', plt.show()
label='Prediction')
plt.scatter(X_train_s, y_train, color='b', marker='.',
label='Train')
#plt.scatter(X_test_s, y_test, color='r', marker='.',
label='Test')
plt.title('Degree {} | Train MSE {:.3f}'.format(degree,
train_mse))
plt.legend()

plt.suptitle('Polynomial ridge regression for different

polynomial degrees', y=1.05, fontsize=32)
plt.tight_layout()

# after applying ridge regression, model is no longer

overfitting for large polynomial degrees

# plot “mean squared error vs degree” for ridge

plt.plot(degrees, train_mses, color='b', label='Train')
plt.plot(degrees, test_mses, color='r', label='Test')
plt.title('MSE vs. polynomial degree')
plt.ylabel('MSE')
plt.xlabel('Polynomial degree')
plt.legend()
plt.show()

Message
No ratings yet
Message
5 pages
ML-Lab07-Building and Evaluating Multivariate Regression Models in Python
No ratings yet
ML-Lab07-Building and Evaluating Multivariate Regression Models in Python
5 pages
Machine Learning Regression Lab Tasks
No ratings yet
Machine Learning Regression Lab Tasks
7 pages
22 Practice Polynomial Regression
No ratings yet
22 Practice Polynomial Regression
6 pages
Big Data Assignment - 4
No ratings yet
Big Data Assignment - 4
6 pages
Docu 4
No ratings yet
Docu 4
3 pages
ML Manual
No ratings yet
ML Manual
30 pages
Machine Learning
No ratings yet
Machine Learning
10 pages
Machine Learning Lab: Regression Analysis
No ratings yet
Machine Learning Lab: Regression Analysis
15 pages
Machine Learning Lab Assignments
100% (2)
Machine Learning Lab Assignments
23 pages
ML Manual
No ratings yet
ML Manual
9 pages
IoT Task4 21BEC0384
No ratings yet
IoT Task4 21BEC0384
9 pages
ML Assignment 1ipynb
No ratings yet
ML Assignment 1ipynb
10 pages
ML Practical 5
No ratings yet
ML Practical 5
10 pages
Zerox Ready
No ratings yet
Zerox Ready
21 pages
ML Journal External
No ratings yet
ML Journal External
14 pages
Wa0002.
No ratings yet
Wa0002.
5 pages
Aiml Practicals
No ratings yet
Aiml Practicals
22 pages
ML Practical 5
No ratings yet
ML Practical 5
10 pages
Python File
No ratings yet
Python File
5 pages
Experiment 7 ML Vtu
No ratings yet
Experiment 7 ML Vtu
5 pages
Lab Manual 05
No ratings yet
Lab Manual 05
13 pages
Linear Regression for Data Science Students
No ratings yet
Linear Regression for Data Science Students
21 pages
T2 Summary VHA
No ratings yet
T2 Summary VHA
14 pages
Experiment 4 ML
No ratings yet
Experiment 4 ML
9 pages
Linear Regression Analysis Guide
No ratings yet
Linear Regression Analysis Guide
15 pages
Linear Regression with Boston Housing Data
No ratings yet
Linear Regression with Boston Housing Data
14 pages
7th ExP
No ratings yet
7th ExP
4 pages
Boston Housing Price Prediction
No ratings yet
Boston Housing Price Prediction
3 pages
Housing Price Prediction with Regression
No ratings yet
Housing Price Prediction with Regression
5 pages
ML Brefing
No ratings yet
ML Brefing
28 pages
Unit 3 5
No ratings yet
Unit 3 5
4 pages
LAB5 Regularization
No ratings yet
LAB5 Regularization
6 pages
Integrated System Lab
No ratings yet
Integrated System Lab
25 pages
Btech1007022 Lab5.1
No ratings yet
Btech1007022 Lab5.1
9 pages
Btech1007022 Lab5
No ratings yet
Btech1007022 Lab5
14 pages
Print Out ML - Finallllllllllllllll
No ratings yet
Print Out ML - Finallllllllllllllll
11 pages
Exp 2 (Multiple Linear Regression)
No ratings yet
Exp 2 (Multiple Linear Regression)
6 pages
Machine Learning Practical File
No ratings yet
Machine Learning Practical File
37 pages
Ds 4 Linears Boston
No ratings yet
Ds 4 Linears Boston
2 pages
Assignment 1
No ratings yet
Assignment 1
4 pages
Assignment 1
No ratings yet
Assignment 1
5 pages
Practical 8
No ratings yet
Practical 8
5 pages
ML Lab Experiment Shivansh
No ratings yet
ML Lab Experiment Shivansh
29 pages
Home Price Prediction in Coimbatore
No ratings yet
Home Price Prediction in Coimbatore
3 pages
ML Minimized Programs
No ratings yet
ML Minimized Programs
9 pages
Practicalpgm ML
No ratings yet
Practicalpgm ML
33 pages
ML Regression for Data Scientists
No ratings yet
ML Regression for Data Scientists
7 pages
ML Record
No ratings yet
ML Record
19 pages
Regression Model Training Guide
No ratings yet
Regression Model Training Guide
13 pages
MLDAP Module2
No ratings yet
MLDAP Module2
32 pages
Advanced Regression with IPL Data
No ratings yet
Advanced Regression with IPL Data
25 pages
Assignment 5
No ratings yet
Assignment 5
9 pages
I Implementation of Regression
No ratings yet
I Implementation of Regression
6 pages
LearningBasics - Ai ML
No ratings yet
LearningBasics - Ai ML
1 page
Unit 3 7
No ratings yet
Unit 3 7
4 pages
Data Mining Practicals
No ratings yet
Data Mining Practicals
22 pages
AI Lec 3
No ratings yet
AI Lec 3
36 pages
CEET424 Highway Engineering: de La Salle University - Dasmariñas College of Engineering, Architecture and Technology
No ratings yet
CEET424 Highway Engineering: de La Salle University - Dasmariñas College of Engineering, Architecture and Technology
15 pages
Graduate Programs at Marine Institute
No ratings yet
Graduate Programs at Marine Institute
24 pages
The Legitimacy of The Human Remi Brague Download
No ratings yet
The Legitimacy of The Human Remi Brague Download
55 pages
IGNOU
No ratings yet
IGNOU
8 pages
Advanced Linguistics
67% (3)
Advanced Linguistics
8 pages
Earth Charter: Education & Sustainability
No ratings yet
Earth Charter: Education & Sustainability
67 pages
Project Management Coursework Example
100% (2)
Project Management Coursework Example
5 pages
Behaviorism and Learning Theories Overview
50% (2)
Behaviorism and Learning Theories Overview
30 pages
Assessment in Learning 2
No ratings yet
Assessment in Learning 2
91 pages
School Organization Essentials
100% (1)
School Organization Essentials
21 pages
Waiver Statement: Acknowledgment
No ratings yet
Waiver Statement: Acknowledgment
2 pages
Cambridge Checkpoint English Workbook 8 Public PDF
No ratings yet
Cambridge Checkpoint English Workbook 8 Public PDF
17 pages
BCA Re-Admission Receipt for Khushboo
No ratings yet
BCA Re-Admission Receipt for Khushboo
2 pages
Sanket Butani's Tech Resume
No ratings yet
Sanket Butani's Tech Resume
1 page
Contributors Xatc
No ratings yet
Contributors Xatc
5 pages
2019 INSET PPST Training Report
50% (2)
2019 INSET PPST Training Report
4 pages
NCM 109 Course Outline: Maternal & Child Care
No ratings yet
NCM 109 Course Outline: Maternal & Child Care
10 pages
Understanding Attention: Types & Tests
No ratings yet
Understanding Attention: Types & Tests
11 pages
Action Research Data Analysis Guide
No ratings yet
Action Research Data Analysis Guide
18 pages
B. Portfolio MKT Examples - Smartketing AE
No ratings yet
B. Portfolio MKT Examples - Smartketing AE
30 pages
Anjali New
No ratings yet
Anjali New
3 pages
Sechenov University General Medicine Curriculum
No ratings yet
Sechenov University General Medicine Curriculum
3 pages
Synchrony Diachrony
100% (1)
Synchrony Diachrony
3 pages
O Level Economics Syllabus
No ratings yet
O Level Economics Syllabus
17 pages
Sofia Peredo Resume
No ratings yet
Sofia Peredo Resume
1 page
Writing Manuals
No ratings yet
Writing Manuals
6 pages
SYLLABUS
No ratings yet
SYLLABUS
18 pages
Schools and the Creativity Crisis
No ratings yet
Schools and the Creativity Crisis
4 pages
Vygotsky's Sociocultural Development Theory
No ratings yet
Vygotsky's Sociocultural Development Theory
5 pages
HSC Feb 2023 Exam Pass Statistics
No ratings yet
HSC Feb 2023 Exam Pass Statistics
5 pages

SML - Week 3

Uploaded by

SML - Week 3

Uploaded by

Week 3 - Linear Regression

(OLS = ordinary least squared regression)

# focus on single feature LSTAT only (input)

# response variable (output) = “MEDV”

# adjust the degree of polynomial for basis expansion

plt.suptitle('Polynomial ridge regression for different

# after applying ridge regression, model is no longer

# plot “mean squared error vs degree” for ridge

You might also like