0% found this document useful (0 votes)

41 views10 pages

ML Assignment 1ipynb

The document presents a machine learning assignment focused on predicting California housing prices using various regression techniques including Linear Regression, Ridge Regression, Lasso Regression, and Polynomial Regression. Each model's performance is evaluated using metrics such as Mean Absolute Error (MAE), Mean Squared Error (MSE), and R-squared (R2). The best performing model is identified as Linear Regression with an R2 value of 1.000.

Uploaded by

Kamini Patil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views10 pages

ML Assignment 1ipynb

Uploaded by

Kamini Patil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

ml-assignment-1ipynb

May 31, 2025

[28]: import pandas as pd

import numpy as np
import [Link] as plt
from [Link] import fetch_california_housing
from sklearn.model_selection import train_test_split
from [Link] import StandardScaler
from sklearn.linear_model import LinearRegression
from [Link] import mean_absolute_error, mean_squared_error, r2_score

# Load data
housing = fetch_california_housing(as_frame=True)
df = [Link]

X = [Link]('MedHouseVal', axis=1)
y = df['MedHouseVal']

# Scale features
scaler = StandardScaler()
X_scaled = scaler.fit_transform(X)

# Split dataset
X_train, X_test, y_train, y_test = train_test_split(
X_scaled, y, test_size=0.2, random_state=42)

# Train Linear Regression

lr = LinearRegression()
[Link](X_train, y_train)

y_pred = [Link](X_test)

# Evaluate
print("Linear Regression Performance:")
print("MAE:", mean_absolute_error(y_test, y_pred))
print("MSE:", mean_squared_error(y_test, y_pred))
print("R2:", r2_score(y_test, y_pred))
print("-"*30)

1
# Sample points for plot
n_points = 50
if len(y_test) > n_points:
indices = [Link](len(y_test), n_points, replace=False)
else:
indices = [Link](len(y_test))

y_test_sample = y_test.iloc[indices] if hasattr(y_test, "iloc") else␣

↪y_test[indices]

y_pred_sample = y_pred[indices]

# Plot
[Link](figsize=(6,6))
[Link](y_test_sample, y_pred_sample, alpha=0.6)
[Link]([min(y_test_sample), max(y_test_sample)],
[min(y_test_sample), max(y_test_sample)], 'r--')
[Link]("Actual Median House Value")
[Link]("Predicted Median House Value")
[Link]("Linear Regression: Actual vs Predicted")
[Link]()

Linear Regression Performance:

MAE: 0.5332001304956565
MSE: 0.555891598695244
R2: 0.5757877060324511
------------------------------

2
[29]: from sklearn.linear_model import Ridge

# Use previous data preprocessing and train-test split steps

ridge = Ridge(alpha=1.0)
[Link](X_train, y_train)

y_pred = [Link](X_test)

print("Ridge Regression Performance:")

print("MAE:", mean_absolute_error(y_test, y_pred))
print("MSE:", mean_squared_error(y_test, y_pred))
print("R2:", r2_score(y_test, y_pred))
print("-"*30)

3
n_points = 100
if len(y_test) > n_points:
indices = [Link](len(y_test), n_points, replace=False)
else:
indices = [Link](len(y_test))

y_test_sample = y_test.iloc[indices] if hasattr(y_test, "iloc") else␣

↪y_test[indices]

y_pred_sample = y_pred[indices]

[Link](figsize=(6,6))
[Link](y_test_sample, y_pred_sample, alpha=0.6)
[Link]([min(y_test_sample), max(y_test_sample)],
[min(y_test_sample), max(y_test_sample)], 'r--')
[Link]("Actual Median House Value")
[Link]("Predicted Median House Value")
[Link]("Ridge Regression: Actual vs Predicted")
[Link]()

Ridge Regression Performance:

MAE: 0.5331933646313113
MSE: 0.5558512007367514
R2: 0.575818534544132
------------------------------

4
[30]: from sklearn.linear_model import Lasso

# Use previous data preprocessing and train-test split steps

lasso = Lasso(alpha=0.01)
[Link](X_train, y_train)

y_pred = [Link](X_test)

print("Lasso Regression Performance:")

print("MAE:", mean_absolute_error(y_test, y_pred))
print("MSE:", mean_squared_error(y_test, y_pred))
print("R2:", r2_score(y_test, y_pred))
print("-"*30)

5
n_points = 100
if len(y_test) > n_points:
indices = [Link](len(y_test), n_points, replace=False)
else:
indices = [Link](len(y_test))

y_test_sample = y_test.iloc[indices] if hasattr(y_test, "iloc") else␣

↪y_test[indices]

y_pred_sample = y_pred[indices]

Lasso Regression Performance:

MAE: 0.535523256745153
MSE: 0.5479327795506
R2: 0.581861244352776
------------------------------

6
[31]: from [Link] import PolynomialFeatures

# Use previous data preprocessing and train-test split steps

poly = PolynomialFeatures(degree=2)
X_train_poly = poly.fit_transform(X_train)
X_test_poly = [Link](X_test)

poly_reg = LinearRegression()
poly_reg.fit(X_train_poly, y_train)

y_pred = poly_reg.predict(X_test_poly)

print("Polynomial Regression (degree 2) Performance:")

print("MAE:", mean_absolute_error(y_test, y_pred))

7
print("MSE:", mean_squared_error(y_test, y_pred))
print("R2:", r2_score(y_test, y_pred))
print("-"*30)

n_points = 100
if len(y_test) > n_points:
indices = [Link](len(y_test), n_points, replace=False)
else:
indices = [Link](len(y_test))

y_test_sample = y_test.iloc[indices] if hasattr(y_test, "iloc") else␣

↪y_test[indices]

y_pred_sample = y_pred[indices]

Polynomial Regression (degree 2) Performance:

MAE: 0.46700093346965893
MSE: 0.4643015238301214
R2: 0.6456819729261911
------------------------------

8
[32]: results = {
'Linear Regression': {'MAE': mae_lr, 'MSE': mse_lr, 'R2': r2_lr},
'Ridge Regression': {'MAE': mae_ridge, 'MSE': mse_ridge, 'R2': r2_ridge},
'Lasso Regression': {'MAE': mae_lasso, 'MSE': mse_lasso, 'R2': r2_lasso},
'Polynomial Regression': {'MAE': mae_poly, 'MSE': mse_poly, 'R2': r2_poly},
}

for model, metrics in [Link]():

print(f"{model}: MAE={metrics['MAE']:.2f}, MSE={metrics['MSE']:.2f},␣
↪R2={metrics['R2']:.3f}")

best_model = max([Link](), key=lambda x: x[1]['R2'])

print(f"\nBest Model: {best_model[0]} with R2={best_model[1]['R2']:.3f}")

Linear Regression: MAE=0.16, MSE=0.03, R2=1.000

9
Ridge Regression: MAE=0.86, MSE=1.87, R2=0.991
Lasso Regression: MAE=0.23, MSE=0.05, R2=1.000
Polynomial Regression: MAE=0.47, MSE=0.46, R2=0.646

Best Model: Linear Regression with R2=1.000

ML Practical 5
No ratings yet
ML Practical 5
10 pages
ML - Assignment 1ipynb - Colab
No ratings yet
ML - Assignment 1ipynb - Colab
5 pages
ML Practical 5
No ratings yet
ML Practical 5
10 pages
AD-22053227 Lab 401, 402
No ratings yet
AD-22053227 Lab 401, 402
4 pages
7 A
No ratings yet
7 A
2 pages
Experiment 4 ML
No ratings yet
Experiment 4 ML
9 pages
EXPNO5
No ratings yet
EXPNO5
2 pages
IoT Task4 21BEC0384
No ratings yet
IoT Task4 21BEC0384
9 pages
Machine Learning
No ratings yet
Machine Learning
10 pages
Machine Learning Regression Lab Tasks
No ratings yet
Machine Learning Regression Lab Tasks
7 pages
Unit 3 5
No ratings yet
Unit 3 5
4 pages
AD-22053227 Lab 401, 402
No ratings yet
AD-22053227 Lab 401, 402
4 pages
SML - Week 3
No ratings yet
SML - Week 3
5 pages
House Price Prediction
No ratings yet
House Price Prediction
2 pages
DL Assignment 1ms24rai03
No ratings yet
DL Assignment 1ms24rai03
10 pages
Machine Learning Lab Assignments
100% (2)
Machine Learning Lab Assignments
23 pages
Ridge vs Lasso: A Python Guide
No ratings yet
Ridge vs Lasso: A Python Guide
3 pages
Lasso Regression Aim: Roll Number: 160122733094 Date
No ratings yet
Lasso Regression Aim: Roll Number: 160122733094 Date
8 pages
Data Science Record - 05
No ratings yet
Data Science Record - 05
20 pages
Python File
No ratings yet
Python File
5 pages
Assignment 1
No ratings yet
Assignment 1
4 pages
Kritika Sejwal - 24MCI10023 - ML Lab - Worksheet 1
No ratings yet
Kritika Sejwal - 24MCI10023 - ML Lab - Worksheet 1
6 pages
Practice Exercise 4
No ratings yet
Practice Exercise 4
2 pages
DSBDAL - Assignment No 4
No ratings yet
DSBDAL - Assignment No 4
15 pages
Ds 4 Linears Boston
No ratings yet
Ds 4 Linears Boston
2 pages
ML Exp3
No ratings yet
ML Exp3
2 pages
Decision Tree
No ratings yet
Decision Tree
4 pages
Deber
No ratings yet
Deber
23 pages
ML Record
No ratings yet
ML Record
19 pages
Linear Regression Analysis of Boston Housing
No ratings yet
Linear Regression Analysis of Boston Housing
13 pages
T2 Summary VHA
No ratings yet
T2 Summary VHA
14 pages
Exp 2 (Multiple Linear Regression)
No ratings yet
Exp 2 (Multiple Linear Regression)
6 pages
ML Lab Experiment Shivansh
No ratings yet
ML Lab Experiment Shivansh
29 pages
Boston Housing Price Prediction
No ratings yet
Boston Housing Price Prediction
3 pages
Regression Analysis On The Boston House Price Dataset For House Price Prediction
No ratings yet
Regression Analysis On The Boston House Price Dataset For House Price Prediction
2 pages
Home Price Prediction in Coimbatore
No ratings yet
Home Price Prediction in Coimbatore
3 pages
Prediction of House Rent Using Multiple Linear Regression
No ratings yet
Prediction of House Rent Using Multiple Linear Regression
20 pages
Zerox Ready
No ratings yet
Zerox Ready
21 pages
DA Lab2
No ratings yet
DA Lab2
5 pages
Data Science - Machine Learning - Multiple Linear Regression
No ratings yet
Data Science - Machine Learning - Multiple Linear Regression
14 pages
Linear Regression for Data Science
No ratings yet
Linear Regression for Data Science
30 pages
Data Mining Final Assignment
No ratings yet
Data Mining Final Assignment
4 pages
Regression Model Training Guide
No ratings yet
Regression Model Training Guide
13 pages
Exp4 (Linear Regression)
No ratings yet
Exp4 (Linear Regression)
2 pages
Python Data Analysis Guide
No ratings yet
Python Data Analysis Guide
171 pages
ML Manual
No ratings yet
ML Manual
9 pages
Pgrm1 Simple Linear Reg
No ratings yet
Pgrm1 Simple Linear Reg
3 pages
ML Manual
No ratings yet
ML Manual
24 pages
LAB5 Regularization
No ratings yet
LAB5 Regularization
6 pages
ML Lab 4,5,6,7,8,9,10
No ratings yet
ML Lab 4,5,6,7,8,9,10
7 pages
Lab Assignment-2 Linear Regression
No ratings yet
Lab Assignment-2 Linear Regression
1 page
Integrated System Lab
No ratings yet
Integrated System Lab
25 pages
Sklearn Multiple Linear Regression Guide
No ratings yet
Sklearn Multiple Linear Regression Guide
10 pages
Linear Regression Analysis Guide
No ratings yet
Linear Regression Analysis Guide
15 pages
Linear Regression
No ratings yet
Linear Regression
2 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
3 pages
Regression Analysis with Lasso and Ridge
No ratings yet
Regression Analysis with Lasso and Ridge
4 pages
ML Lap
No ratings yet
ML Lap
23 pages
Done Problems Encountered in The Implementation of Juvenile Justice and Welfare Act of 2006 Ra 9344
No ratings yet
Done Problems Encountered in The Implementation of Juvenile Justice and Welfare Act of 2006 Ra 9344
11 pages
One-Way ANOVA in Experimental Design
No ratings yet
One-Way ANOVA in Experimental Design
24 pages
Advanced Machine Learning Challenge5
No ratings yet
Advanced Machine Learning Challenge5
22 pages
Econometrics Sheet 2B MR 2024
No ratings yet
Econometrics Sheet 2B MR 2024
5 pages
GCSE Statistics: Spearman's Rank Correlation
No ratings yet
GCSE Statistics: Spearman's Rank Correlation
17 pages
Allama Iqbal Open University Islamabad: Book Name (8614) Level: B.Ed
No ratings yet
Allama Iqbal Open University Islamabad: Book Name (8614) Level: B.Ed
7 pages
SAGE Quantitative Research Methods Set
50% (2)
SAGE Quantitative Research Methods Set
4 pages
Data Mining
No ratings yet
Data Mining
20 pages
Course Notes Linear Regression
No ratings yet
Course Notes Linear Regression
8 pages
SQCR Practical File - Yash Verma 40515611117 F13 ME
No ratings yet
SQCR Practical File - Yash Verma 40515611117 F13 ME
21 pages
Bayesian SAE with Survey Data
No ratings yet
Bayesian SAE with Survey Data
71 pages
Variance and Standard Deviation Ungrouped Data ROSARIO R. GILLADOGA
No ratings yet
Variance and Standard Deviation Ungrouped Data ROSARIO R. GILLADOGA
6 pages
Practical Research 2 - q4 - Slm11
No ratings yet
Practical Research 2 - q4 - Slm11
15 pages
Linear Regression Model Errors
No ratings yet
Linear Regression Model Errors
2 pages
Survival Analysis Based Framework For Early Prediction of Student Dropouts
No ratings yet
Survival Analysis Based Framework For Early Prediction of Student Dropouts
10 pages
Hetroscedasticity A Violation of Classical Linear Regression Model Assumptions
No ratings yet
Hetroscedasticity A Violation of Classical Linear Regression Model Assumptions
23 pages
Stock Market Regression Analysis
No ratings yet
Stock Market Regression Analysis
6 pages
User Manual raXL Stat v0.5.2 EN
No ratings yet
User Manual raXL Stat v0.5.2 EN
61 pages
Chi-Square Tests for Analysts
No ratings yet
Chi-Square Tests for Analysts
38 pages
Cronbach Alpha Beh Stat
No ratings yet
Cronbach Alpha Beh Stat
5 pages
Sampling and Estimation in Statistics
No ratings yet
Sampling and Estimation in Statistics
20 pages
Mss 242 Exam Odl 2016
No ratings yet
Mss 242 Exam Odl 2016
6 pages
Mse Rmse Mae
No ratings yet
Mse Rmse Mae
5 pages
IV-Sem (Regular), Business Statistics-II - 784 ET
No ratings yet
IV-Sem (Regular), Business Statistics-II - 784 ET
3 pages
Estimating Stock Market Volatility With Markov Regime-Switching GARCH Models
No ratings yet
Estimating Stock Market Volatility With Markov Regime-Switching GARCH Models
11 pages
Linear Discriminant Analysis Guide
No ratings yet
Linear Discriminant Analysis Guide
49 pages
Predictive Analytics & Hypothesis Testing
No ratings yet
Predictive Analytics & Hypothesis Testing
27 pages
Basic Statistics - 1
No ratings yet
Basic Statistics - 1
21 pages
SPSS Advanced Models 10.0
No ratings yet
SPSS Advanced Models 10.0
2 pages
NAMA: Wimbi Achmad Sauqi Zainal Abidin Kelas: Pai4/VI NIM: 0301182192 1
No ratings yet
NAMA: Wimbi Achmad Sauqi Zainal Abidin Kelas: Pai4/VI NIM: 0301182192 1
10 pages

ML Assignment 1ipynb

Uploaded by

ML Assignment 1ipynb

Uploaded by

ml-assignment-1ipynb

May 31, 2025

[28]: import pandas as pd

# Train Linear Regression

y_test_sample = y_test.iloc[indices] if hasattr(y_test, "iloc") else␣

Linear Regression Performance:

# Use previous data preprocessing and train-test split steps

print("Ridge Regression Performance:")

y_test_sample = y_test.iloc[indices] if hasattr(y_test, "iloc") else␣

Ridge Regression Performance:

# Use previous data preprocessing and train-test split steps

print("Lasso Regression Performance:")

y_test_sample = y_test.iloc[indices] if hasattr(y_test, "iloc") else␣

Lasso Regression Performance:

# Use previous data preprocessing and train-test split steps

print("Polynomial Regression (degree 2) Performance:")

y_test_sample = y_test.iloc[indices] if hasattr(y_test, "iloc") else␣

Polynomial Regression (degree 2) Performance:

for model, metrics in [Link]():

best_model = max([Link](), key=lambda x: x[1]['R2'])

Linear Regression: MAE=0.16, MSE=0.03, R2=1.000

Best Model: Linear Regression with R2=1.000

You might also like