0% found this document useful (0 votes)

15 views8 pages

ML Exp 2

Uploaded by

Nishad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views8 pages

ML Exp 2

Uploaded by

Nishad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

PART A

(PART A: TO BE REFFERED BY STUDENTS)

Experiment No. 2
A.1 Aim:

To implement Linear Regression.

A.2 Prerequisite:
Python Basic Concepts

A.3 Outcome:
Students will be able to implement Linear Regression.

A.4 Theory:

Machine Learning, being a subset of Artificial Intelligence (AI), has been playing a dominant
role in our daily lives. Data science engineers and developers working in various domains are
widely using machine learning algorithms to make their tasks simpler and life easier.

What is a Regression Problem?

Majority of the machine learning algorithms fall under the supervised learning category. It is the
process where an algorithm is used to predict a result based on the previously entered values and
the results generated from them. Suppose we have an input variable ‘x’ and an output variable
‘y’ where y is a function of x (y=f{x}). Supervised learning reads the value of entered variable
‘x’ and the resulting variable ‘y’ so that it can use those results to later predict a highly accurate
output data of ‘y’ from the entered value of ‘x’. A regression problem is when the resulting
variable contains a real or a continuous value. It tries to draw the line of best fit from the data
gathered from a number of points.

Linear Regression
Linear regression is a quiet and simple statistical regression method used for predictive analysis
and shows the relationship between the continuous variables. Linear regression shows the linear
relationship between the independent variable (X-axis) and the dependent variable (Y-axis),
consequently called linear regression. If there is a single input variable (x), such linear regression
is called simple linear regression. And if there is more than one input variable, such linear
regression is called multiple linear regression. The linear regression model gives a sloped straight
line describing the relationship within the variables.

To calculate best-fit line linear regression uses a traditional slope-intercept form.

y= Dependent Variable.
x= Independent Variable.
a0= intercept of the line.
a1 = Linear regression coefficient.

Need of a Linear regression

As mentioned above, Linear regression estimates the relationship between a dependent variable
and an independent variable. Let’s understand this with an easy example:

Let’s say we want to estimate the salary of an employee based on year of experience. You have
the recent company data, which indicates that the relationship between experience and salary.
Here year of experience is an independent variable, and the salary of an employee is a dependent
variable, as the salary of an employee is dependent on the experience of an employee. Using this
insight, we can predict the future salary of the employee based on current & past information.

A regression line can be a Positive Linear Relationship or a Negative Linear Relationship.

PART B
(PART B : TO BE COMPLETED BY STUDENTS)

(Students must submit the soft copy as per following segments within two hours of the practical. The
soft copy must be uploaded on the Blackboard or emailed to the concerned lab in charge faculties at
the end of the practical in case the there is no Black board access available)

Roll. No. BE-A10 Name: Nishad Sutar

Class: BE-Comps A Batch: A1
Date of Experiment: 14/07/2025 Date of Submission: 21/07/2025
Grade:

B.1 Software Code written by student:

# Import required libraries
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
from sklearn.datasets import fetch_california_housing
from sklearn.linear_model import LinearRegression
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import StandardScaler
from sklearn.metrics import mean_squared_error, r2_score

# Load the dataset

housing = fetch_california_housing()
df = pd.DataFrame(housing.data, columns=housing.feature_names)
df['MedianHouseValue'] = housing.target

df.head()
df.info()

# Data Wrangling
print("Missing values in dataset:\n", df.isnull().sum())

# Feature Engineering

# Create new feature: Rooms per person

df['RoomsPerPerson'] = df['AveRooms'] / df['Population']
df['BedroomsPerRoom'] = df['AveBedrms'] / df['AveRooms']

# Handle infinite or NaN values (in case Population = 0)

df.replace([np.inf, -np.inf], np.nan, inplace=True)
df.dropna(inplace=True)

# Optional log transformation (can reduce skewness)

df['LogPopulation'] = np.log1p(df['Population'])
df['LogAveOccup'] = np.log1p(df['AveOccup'])

# Drop raw columns if using log-transformed versions

df.drop(['Population', 'AveOccup'], axis=1, inplace=True)

# Feature Matrix and Target

X = df.drop('MedianHouseValue', axis=1)
y = df['MedianHouseValue']

# Train/Test Split
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=0)

# Feature Scaling
scaler = StandardScaler()
X_train_scaled = scaler.fit_transform(X_train)
X_test_scaled = scaler.transform(X_test)

# Model Training
model = LinearRegression()
model.fit(X_train_scaled, y_train)

# Predictions
y_pred = model.predict(X_test_scaled)

# Evaluation
print(f"Intercept (a0): {model.intercept_}")
print(f"First Coefficient (a1): {model.coef_[0]}")
print(f"Mean Squared Error: {mean_squared_error(y_test, y_pred):.2f}")
print(f"R² Score: {r2_score(y_test, y_pred):.2f}")

# Plotting

plt.figure(figsize=(12, 5))

# Actual vs Predicted
plt.subplot(1, 2, 1)
plt.scatter(y_test, y_pred, alpha=0.5, color='blue')
plt.xlabel("Actual Median House Value")
plt.ylabel("Predicted Median House Value")
plt.title("Actual vs Predicted Values")
plt.grid(True)

plt.tight_layout()
plt.show()
B.2 Input and Output:
B.3 Observations and learning:
In this experiment, I successfully implemented the Linear Regression algorithm, a fundamental
supervised learning technique used for predictive analysis. I observed that this model works by
establishing a linear relationship between a dependent (target) variable and one or more independent
(predictor) variables. The core task was to find the "line of best fit" that most accurately represents the
data points, which is mathematically described by the slope-intercept formula, y = a0 + a1*x. I noted the
distinction between simple linear regression, which involves a single independent variable, and multiple
linear regression, which uses several. The example of predicting an employee's salary based on years of
experience clearly illustrated how this algorithm can be used to forecast continuous values in real-world
scenarios.
B.4 Conclusion:
In conclusion, this experiment fulfilled its aim of implementing a Linear Regression model. Through this
process, Ihave gained a practical understanding of how to predict continuous outcomes by modeling the
relationships between variables. The experiment reinforces that Linear Regression is a straightforward
and powerful statistical tool that serves as a cornerstone of machine learning. Its simplicity and
interpretability make it an essential algorithm for data scientists to master for tasks involving forecasting
and understanding data relationships.
B.5 Question of Curiosity
(To be answered by student based on the practical performed and learning/observations)

Linear Regression Code
No ratings yet
Linear Regression Code
5 pages
Experiment 2 Linear Regression
No ratings yet
Experiment 2 Linear Regression
42 pages
AIand MLlab 5
No ratings yet
AIand MLlab 5
10 pages
Regression Analysis and Equations
No ratings yet
Regression Analysis and Equations
16 pages
ML Lab Manual
100% (1)
ML Lab Manual
37 pages
19BCS2059 DL1
No ratings yet
19BCS2059 DL1
4 pages
ML Unit
No ratings yet
ML Unit
23 pages
223a1131 ML Exp 1
No ratings yet
223a1131 ML Exp 1
8 pages
ML Exp1 C36
No ratings yet
ML Exp1 C36
13 pages
B-56 Sanket Jambhulkar MLA-2
No ratings yet
B-56 Sanket Jambhulkar MLA-2
8 pages
CL IV Manual
No ratings yet
CL IV Manual
108 pages
Simple Linear Regression in Python
No ratings yet
Simple Linear Regression in Python
3 pages
Linear Regression Lab Guide
100% (1)
Linear Regression Lab Guide
8 pages
ML Exp 1
No ratings yet
ML Exp 1
6 pages
Lab 6 - Linear Regression and Multiple Linear Regression
No ratings yet
Lab 6 - Linear Regression and Multiple Linear Regression
12 pages
DSUP Exp4
No ratings yet
DSUP Exp4
6 pages
AI Lab7
No ratings yet
AI Lab7
13 pages
B.Tech Linear Regression Guide
No ratings yet
B.Tech Linear Regression Guide
6 pages
Machine Learning With Python Algorithms
No ratings yet
Machine Learning With Python Algorithms
28 pages
ML Manoj
No ratings yet
ML Manoj
51 pages
Assignment 7
No ratings yet
Assignment 7
4 pages
Lecture-2 Unit 2
No ratings yet
Lecture-2 Unit 2
56 pages
Lab Mannual of ML
No ratings yet
Lab Mannual of ML
43 pages
Deep Learning
No ratings yet
Deep Learning
7 pages
Machine Learning
No ratings yet
Machine Learning
30 pages
Practical 5
No ratings yet
Practical 5
8 pages
Regression Analysis
No ratings yet
Regression Analysis
16 pages
Practical # 10
No ratings yet
Practical # 10
5 pages
Machine Learning Model Building Guide
No ratings yet
Machine Learning Model Building Guide
53 pages
Module 2 Notes
No ratings yet
Module 2 Notes
4 pages
Python Data Analysis Guide
No ratings yet
Python Data Analysis Guide
171 pages
Dav Exp
No ratings yet
Dav Exp
11 pages
Machine Learning 2
No ratings yet
Machine Learning 2
45 pages
CSL0777 L15
No ratings yet
CSL0777 L15
24 pages
Dav 2,3
No ratings yet
Dav 2,3
6 pages
Implementation of Linear Regression With Python
No ratings yet
Implementation of Linear Regression With Python
5 pages
AI Lab Manual: Linear Regression Guide
No ratings yet
AI Lab Manual: Linear Regression Guide
6 pages
DMV Unit 3 PPT - RSK - 250419 - 125620 Jfhuehiwhu
No ratings yet
DMV Unit 3 PPT - RSK - 250419 - 125620 Jfhuehiwhu
89 pages
ML LN 3
No ratings yet
ML LN 3
44 pages
DSBDAL - Assignment No 4
No ratings yet
DSBDAL - Assignment No 4
15 pages
Machine Learning-SEAIML-241P (PR) Bharat
No ratings yet
Machine Learning-SEAIML-241P (PR) Bharat
42 pages
ML Manual
No ratings yet
ML Manual
37 pages
Simple Linear Regression (Precious)
No ratings yet
Simple Linear Regression (Precious)
3 pages
ML & DA Unit2 - Notes
No ratings yet
ML & DA Unit2 - Notes
57 pages
Linear Regression
No ratings yet
Linear Regression
6 pages
Linear Regression
No ratings yet
Linear Regression
7 pages
AI Lab9
No ratings yet
AI Lab9
5 pages
Linear Regression - Jupyter Notebook
100% (3)
Linear Regression - Jupyter Notebook
56 pages
Regression
No ratings yet
Regression
16 pages
MLDAP Module2
No ratings yet
MLDAP Module2
32 pages
Linear Regression in Python Guide
No ratings yet
Linear Regression in Python Guide
5 pages
Linear Regression Model 1
No ratings yet
Linear Regression Model 1
23 pages
Data Science for Beginners
No ratings yet
Data Science for Beginners
98 pages
Linear Regression Guide & Examples
No ratings yet
Linear Regression Guide & Examples
36 pages
Linear Regression in Machine Learning
No ratings yet
Linear Regression in Machine Learning
10 pages
AIML Lab
No ratings yet
AIML Lab
48 pages
Dav Exp2
No ratings yet
Dav Exp2
3 pages
ML Exp 1
No ratings yet
ML Exp 1
4 pages
Exp 2
No ratings yet
Exp 2
6 pages
NLP_MODULE-4_SH25[1]
No ratings yet
NLP_MODULE-4_SH25[1]
59 pages
ML Exp6
No ratings yet
ML Exp6
7 pages
Exp 5 ML
No ratings yet
Exp 5 ML
9 pages
BC Assignment 1
No ratings yet
BC Assignment 1
63 pages
ML Exp1 Part A
No ratings yet
ML Exp1 Part A
5 pages
1679386160algorithm and Pseudocode
No ratings yet
1679386160algorithm and Pseudocode
3 pages
Hector Charo
No ratings yet
Hector Charo
6 pages
Assignment1 40168195
No ratings yet
Assignment1 40168195
10 pages
User Manual
No ratings yet
User Manual
92 pages
CC102 Computer Programming
No ratings yet
CC102 Computer Programming
48 pages
Web Evolution & Internet Basics
No ratings yet
Web Evolution & Internet Basics
33 pages
CS-CS+-TS+ Series User Manual 0715
No ratings yet
CS-CS+-TS+ Series User Manual 0715
58 pages
Bresadkjfje
No ratings yet
Bresadkjfje
22 pages
Reviewed Competency Based Curriculum For Fashion Design Level 6
No ratings yet
Reviewed Competency Based Curriculum For Fashion Design Level 6
111 pages
RDC Trm-20-40 User Manual
No ratings yet
RDC Trm-20-40 User Manual
85 pages
Testbank For Inquiry Into Physics 8th Edition Ostdiek
No ratings yet
Testbank For Inquiry Into Physics 8th Edition Ostdiek
18 pages
Day5 FDP IoT Part1
No ratings yet
Day5 FDP IoT Part1
89 pages
Selfstudys Com File
No ratings yet
Selfstudys Com File
21 pages
Dsa General4
No ratings yet
Dsa General4
3 pages
Weg cfw11 Config Profinetio Siemensstep Appnote 21
No ratings yet
Weg cfw11 Config Profinetio Siemensstep Appnote 21
12 pages
Apple's Future Case Study Questions (5 Points Each)
100% (1)
Apple's Future Case Study Questions (5 Points Each)
2 pages
Assignment 5 Answers
No ratings yet
Assignment 5 Answers
31 pages
Türkiye Türkçesi Grameri Kitabı
No ratings yet
Türkiye Türkçesi Grameri Kitabı
1 page
Automated Bill To Coin Money Exchanger (Philippine Peso)
No ratings yet
Automated Bill To Coin Money Exchanger (Philippine Peso)
13 pages
Implementation and Deployment of Ip PBX (3CX) : BS Computer Science Session 2016-2020 (Spring) Submitted To
100% (1)
Implementation and Deployment of Ip PBX (3CX) : BS Computer Science Session 2016-2020 (Spring) Submitted To
59 pages
Rivision Excel
No ratings yet
Rivision Excel
3 pages
Experiment No: 9 TITTLE: Write A Program To Implement Game Playing Algorithms: Minimax and Alpha Beta
No ratings yet
Experiment No: 9 TITTLE: Write A Program To Implement Game Playing Algorithms: Minimax and Alpha Beta
4 pages
Solidworks Weldments 06
No ratings yet
Solidworks Weldments 06
13 pages
Facebook Case Study: Data Privacy & Strategy
No ratings yet
Facebook Case Study: Data Privacy & Strategy
27 pages
Yogyakarta Student Housing System
No ratings yet
Yogyakarta Student Housing System
9 pages
Video Server Request Router TV Director Datasheet Edgeware
No ratings yet
Video Server Request Router TV Director Datasheet Edgeware
2 pages
8th Sem PPT 9th June
No ratings yet
8th Sem PPT 9th June
16 pages
SBTS Profile and RET Connection v1.0
No ratings yet
SBTS Profile and RET Connection v1.0
125 pages
Install Radio 2260
100% (1)
Install Radio 2260
45 pages
5G NR Fundamentals Procedures and T M Aspects
100% (4)
5G NR Fundamentals Procedures and T M Aspects
244 pages

ML Exp 2

Uploaded by

ML Exp 2

Uploaded by

PART A

(PART A: TO BE REFFERED BY STUDENTS)

To implement Linear Regression.

What is a Regression Problem?

To calculate best-fit line linear regression uses a traditional slope-intercept form.

Need of a Linear regression

A regression line can be a Positive Linear Relationship or a Negative Linear Relationship.

Roll. No. BE-A10 Name: Nishad Sutar

B.1 Software Code written by student:

# Load the dataset

# Create new feature: Rooms per person

# Handle infinite or NaN values (in case Population = 0)

# Optional log transformation (can reduce skewness)

# Drop raw columns if using log-transformed versions

# Feature Matrix and Target

You might also like