0% found this document useful (0 votes)

50 views7 pages

Linear Regression

Uploaded by

faraz.com.in

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views7 pages

Linear Regression

Uploaded by

faraz.com.in

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

Regression

Regression in machine learning is a technique used to find the relationships between

independent and dependent variables, with the main purpose of predicting an outcome. It
involves training a set of algorithms to reveal patterns that characterize the distribution of
each data point. With patterns identified, the model can then make accurate predictions
for new data points or input values.

Types of Regression
1. Linear Regression
2. Logistic Regression

Linear Regression
Linear regression is a type of supervised machine-learning algorithm that learns from
the labelled datasets and maps the data points with most optimized linear functions
which can be used for prediction on new datasets. It assumes that there is a linear
relationship between the input and output, meaning the output changes at a constant
rate as the input changes. This relationship is represented by a straight line.
For example we want to predict a student's exam score based on how many hours they
studied. We observe that as students study more hours, their scores go up. In the
example of predicting exam scores based on hours studied. Here
 Independent variable (input): Hours studied because it's the factor we control or
observe.
 Dependent variable (output): Exam score because it depends on how many hours
were studied.

Equation of the Best-Fit Line

For simple linear regression (with one independent variable), the best-fit line is
represented by the equation
y=mx+b
Where:
 y is the predicted value (dependent variable)
 x is the input (independent variable)
 m is the slope of the line (how much y changes when x changes)
 b is the intercept (the value of y when x = 0)
The best-fit line will be the one that optimizes the values of m (slope) and b (intercept)
so that the predicted y values are as close as possible to the actual data points.

dy Test Mean(X) Mean(Y Deviations(X) Deviations(Y) Product of Sum of Square of

urs scor ) deviations Product deviations for X
e (Y) of
deviations
40 4 50 -2 -10 20 40 4
50 0 0 0 0
60 2 10 20 4

Calculate m = Sum of product of deviations / Sum of square of deviation for X

Calculate b = Mean of Y – (m* Mean of X)

Calculations

 Sum of Product of Deviations = 20 + 0 + 20 = 40

 Sum of Square of Deviations for X = 4 + 0 + 4 = 8

m = Sum of Product of Deviations / Sum of Square of Deviations for X

m = 40/8 = 5

b=Mean(Y) − (m * mean(X)) =50− (5*4) =30

Final Regression Equation

Y=5X+30Y

Study_hours.py
import pandas as pd
from sklearn.linear_model import LinearRegression

# Dataset
data = {
'StudyHours': [2, 3, 4, 5, 6, 7, 8],
'Marks': [40, 50, 55, 65, 70, 80, 85]
}
df = pd.DataFrame(data)

# Train model
X = df[['StudyHours']]
y = df['Marks']
regr = LinearRegression()
regr.fit(X, y)

# User input
study_hours = float(input("Enter study hours: "))

# Wrap input in DataFrame with the same column name

input_data = pd.DataFrame({'StudyHours': [study_hours]})
predicted_marks = regr.predict(input_data)

print(f"Study Hours: {study_hours}")

print(f"Predicted Marks: {predicted_marks[0]:.2f}")

Output
Enter study hours: 6
Study Hours: 6.0
Predicted Marks: 71.07

Q1: Fit a linear regression model for data set (x, y): (1, 1.5), (2, 3.0), (3, 4.5), (4, 6.0) and predict y for x = 5

Non-Linear Regression
Non-linear regression is a type of regression in machine learning where the relationship between input XXX
and output YYY is not a straight line. Instead, the data follows a curved pattern.

In such cases, a straight line (linear regression) does not fit well, so we use equations like polynomial,
exponential, logarithmic, or other non-linear functions.
Multiple Linear Regression
Linear regression is a statistical method used for predictive analysis. It models the
relationship between a dependent variable and a single independent variable by fitting a
linear equation to the data. Multiple Linear Regression extends this concept by
modelling the relationship between a dependent variable and two or more independent
variables. This technique allows us to understand how multiple features collectively
affect the outcomes.
Steps for Multiple Linear Regression
Steps to perform multiple linear regression are similar to that of simple linear Regression
but difference comes in the evaluation process. We can use it to find out which factor
has the highest influence on the predicted output and how different variables are related
to each other. Equation for multiple linear regression is:
y=β0+β1X1+β2X2+⋯+βnXn
Where:
 Y is the dependent variable

 X1,X2,⋯Xn are the independent variables

 β0 is the intercept

 β1,β2,⋯βn are the slopes

The goal of the algorithm is to find the best fit line equation that can predict the values
based on the independent variables. A regression model learns from the dataset with
known X and y values and uses it to predict y values for unknown X.

Q1.Find B0,B1,B2 using the given data.

Product Product Weekly

1 Sales 2 Sales Sales
(X1) (X2) (Y)
1 4 1
2 5 6
3 8 8
4 2 12

[ ] []
1 1 4 1
1 2 5 6
X= Y=
1 3 8 8
1 4 2 12
Step 1. Transpose of X

[ ]
1 1 1 1
X=1 2 3 4
4 5 8 2

Step 2. Multiply of X`.X

[ ][ ][ ]
1 1 4
1 1 1 1 4 10 19
1 2 5
X`.X = 1 2 3 4 . 1 3 8
= 1 0 30 46
4 5 8 2 1 9 46 109
1 4 2

Step 3. Multiply of X`.Y

[ ][ ] [ ]
1
1 1 1 1 27
6
X`.Y = 1 2 3 4 8
. = 85
4 5 8 2 1 22
12

Step 4. Inverse of (X`.X)-1

[ ][ ]
4 10 19 3.153 −0.590 −0.300
-1
(X`.X) = 10 30 46 = −0.590 0.204 0.016
19 46 109 −0.300 0.016 0.054

-1
Step 4. Put in this equation β=(X`X) X`Y

[ ][ ] [ ]
3.153 −0.590 −0.300 27 −1.699
β= −0.590 0.204 0.016 . 85 = 3.483
−0.300 0.016 0.054 122 −0.054

β0 = -1.699, β1 = 3.483, β2 = - 0.054

Logistic Regression
Logistic regression is a type of supervised machine-learning algorithm that also learns from labelled datasets but is
mainly used for classification problems instead of predicting continuous values. It assumes that the output is categorical,
such as Yes/No or 0/1, and maps the data points using a logistic function (sigmoid curve) to estimate probabilities
between 0 and 1. This probability is then used to decide the class of new data points. For example, we may want to
predict whether a student will pass or fail based on how many hours they studied. We observe that as study hours
increase, the probability of passing also increases, which is captured by the S-shaped logistic curve.
Sigmoid Function
Y = 1/1+e-(a0 + a1*X)
Where :

 a0 → Intercept (similar to b in linear regression).

 a1→ Coefficient/weight of the feature XXX.

 X → Input (independent variable).

 Output → A probability between 0 and 1.

Example :

Study Hours (X) Output(Y) = Pass/Fail

2 0
3 0
4 0
5 1
6 1
7 1
8 1

Given :

a0 = -1.5

a1 = 0.6
Input (X) = 5

1
y= −(a 0 +a 1 X )
1+ ⅇ

1
y= −(−1.5 +0.6 x5 )
1+ ⅇ

1
y= −(1.5 )
1+ ⅇ

1
y=
1+0.2231

1
y=
1.2231

y=0.8175

Note : Value “Y” is greater than 0.5 then student is Pass.

What is Bayes theorem?

Bayes' theorem is a fundamental concept in probability theory that plays a crucial role in
various machine learning algorithms, especially in the fields of Bayesian statistics and
probabilistic modelling. It provides a way to update probabilities based on new evidence
or information. In the context of machine learning, Bayes' theorem is often used in
Bayesian inference and probabilistic models.

P(A∣B)=P(B∣A)⋅P(A)P(B)P(A∣B)=P(B)P(B∣A)⋅P(A)
The theorem can be mathematically expressed as:

where

 P(A∣B) → Posterior Probability

The probability that event A is true after seeing evidence B.

 P(B∣A) → Likelihood
The probability of seeing evidence B if hypothesis A is true.

 P(A) → Prior Probability

The probability we assign to A before seeing any evidence.

 P(B) → Marginal Probability (Evidence Probability)

The total probability of observing evidence B, across all possible hypotheses.

Regression: Machine Learning Algorithms
No ratings yet
Regression: Machine Learning Algorithms
5 pages
Module 2-Supervised Learning
No ratings yet
Module 2-Supervised Learning
74 pages
Regression Analysis Guide
No ratings yet
Regression Analysis Guide
13 pages
Unit-Iii-1 1
No ratings yet
Unit-Iii-1 1
31 pages
Unit-2 ML
No ratings yet
Unit-2 ML
39 pages
SML Updated UNIT 3
No ratings yet
SML Updated UNIT 3
41 pages
Linear and Logistic Regression
No ratings yet
Linear and Logistic Regression
21 pages
Linear & Polynomial Regression Guide
No ratings yet
Linear & Polynomial Regression Guide
56 pages
Linear Regression Lab Guide
100% (1)
Linear Regression Lab Guide
8 pages
ML - LAB - BE CSE (DS) Final
No ratings yet
ML - LAB - BE CSE (DS) Final
110 pages
Supervised Learning. wk3
No ratings yet
Supervised Learning. wk3
18 pages
2.1 Regression Analysis
No ratings yet
2.1 Regression Analysis
28 pages
Machine Learning Algorithms Guide
No ratings yet
Machine Learning Algorithms Guide
44 pages
Chp2 Logistic Regression
No ratings yet
Chp2 Logistic Regression
6 pages
Classification Algorithms Overview
No ratings yet
Classification Algorithms Overview
19 pages
Regression Modelling
No ratings yet
Regression Modelling
25 pages
Linear Regression
No ratings yet
Linear Regression
11 pages
Linear vs Logistic Regression Guide
No ratings yet
Linear vs Logistic Regression Guide
81 pages
LAB04 RegressionTasks
No ratings yet
LAB04 RegressionTasks
31 pages
Unit 3
No ratings yet
Unit 3
25 pages
Regression Analysis and Equations
No ratings yet
Regression Analysis and Equations
16 pages
Supervised Learning: Regression Techniques
No ratings yet
Supervised Learning: Regression Techniques
34 pages
6 ML Updated
No ratings yet
6 ML Updated
23 pages
Linear Regression For Machine Learning
No ratings yet
Linear Regression For Machine Learning
9 pages
228w1f0065 ML
No ratings yet
228w1f0065 ML
15 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
5 pages
Ch-2 Supervised Machine Learning
No ratings yet
Ch-2 Supervised Machine Learning
48 pages
Machine Learning: Regression & Classification
No ratings yet
Machine Learning: Regression & Classification
16 pages
UNIT 2 Machine Learning BCAI601BCDS062
No ratings yet
UNIT 2 Machine Learning BCAI601BCDS062
244 pages
Unit-2 Supervised Machine Learning
No ratings yet
Unit-2 Supervised Machine Learning
132 pages
NOTES - UNIT 2 - Machine Learning
No ratings yet
NOTES - UNIT 2 - Machine Learning
33 pages
Unit-2 Machine Learning
No ratings yet
Unit-2 Machine Learning
148 pages
AAI Lecture 10 SP 25
No ratings yet
AAI Lecture 10 SP 25
37 pages
Unit-2 ML
No ratings yet
Unit-2 ML
199 pages
Unit - Iii Supervisied Learning - Notes
No ratings yet
Unit - Iii Supervisied Learning - Notes
42 pages
R Language for Regression Analysis
No ratings yet
R Language for Regression Analysis
144 pages
Unit-2: Machine Learning Techniques (KCS-055) Module-2
No ratings yet
Unit-2: Machine Learning Techniques (KCS-055) Module-2
199 pages
Bias and Variance Tradeoff:: High Bias Underfitting Low Training & Testing
No ratings yet
Bias and Variance Tradeoff:: High Bias Underfitting Low Training & Testing
12 pages
ML Unit2
No ratings yet
ML Unit2
69 pages
Module 2
No ratings yet
Module 2
21 pages
DMML Unit4
No ratings yet
DMML Unit4
77 pages
3 Unit - Dspu
No ratings yet
3 Unit - Dspu
23 pages
LinearRegression PDF
No ratings yet
LinearRegression PDF
4 pages
Unit 2
No ratings yet
Unit 2
67 pages
Linear Regression
No ratings yet
Linear Regression
89 pages
ML - Module 2
No ratings yet
ML - Module 2
16 pages
ML Exp 1
No ratings yet
ML Exp 1
4 pages
Unit II NOTES
No ratings yet
Unit II NOTES
31 pages
Understanding Blue Property Assumptions
No ratings yet
Understanding Blue Property Assumptions
27 pages
Linear and Polynomial Regression
No ratings yet
Linear and Polynomial Regression
26 pages
ML Unit-4
No ratings yet
ML Unit-4
65 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
7 pages
Practical 5
No ratings yet
Practical 5
8 pages
OE-ML Unit - 3
No ratings yet
OE-ML Unit - 3
29 pages
Regression Analysis Linear Multiple Logistic
No ratings yet
Regression Analysis Linear Multiple Logistic
25 pages
CSE545 sp23 (7) Regressions To Transformers 3-29
No ratings yet
CSE545 sp23 (7) Regressions To Transformers 3-29
188 pages
Unit - 2, Updated Notes
No ratings yet
Unit - 2, Updated Notes
121 pages
Fai Module 3
No ratings yet
Fai Module 3
67 pages
Regression Techniques Guide
No ratings yet
Regression Techniques Guide
74 pages
QBA Chapter-4 Regression-Models
No ratings yet
QBA Chapter-4 Regression-Models
70 pages
Decision Sciences Formulae Sheet
No ratings yet
Decision Sciences Formulae Sheet
3 pages
Overview of Structural Equation Modeling
No ratings yet
Overview of Structural Equation Modeling
18 pages
Analisis Butir Soal Tes Uraian Ujian Tengah Semester Mata Kuliah Statistik
No ratings yet
Analisis Butir Soal Tes Uraian Ujian Tengah Semester Mata Kuliah Statistik
10 pages
Diamond Pricing for Data Analysts
50% (2)
Diamond Pricing for Data Analysts
20 pages
Machine Learning Concepts and Tools
No ratings yet
Machine Learning Concepts and Tools
11 pages
Reading 10 Simple Linear Regression
No ratings yet
Reading 10 Simple Linear Regression
3 pages
Assignments For Week 2-Revised Version
No ratings yet
Assignments For Week 2-Revised Version
4 pages
Annova One Way
No ratings yet
Annova One Way
4 pages
5fdc68679dfbe93e5ee3ec76 CH 1 Mathematical Modelling in Probability
No ratings yet
5fdc68679dfbe93e5ee3ec76 CH 1 Mathematical Modelling in Probability
7 pages
CS7015 (Deep Learning) : Lecture 12: Object Detection: R-CNN, Fast R-CNN, Faster R-CNN, You Only Look Once (YOLO)
No ratings yet
CS7015 (Deep Learning) : Lecture 12: Object Detection: R-CNN, Fast R-CNN, Faster R-CNN, You Only Look Once (YOLO)
47 pages
Note 4
No ratings yet
Note 4
18 pages
Machine Learning Exam Prep
No ratings yet
Machine Learning Exam Prep
5 pages
Basic Biostatistics: Key Procedures Explained
No ratings yet
Basic Biostatistics: Key Procedures Explained
5 pages
Cathy Econ0019 - w2
No ratings yet
Cathy Econ0019 - w2
62 pages
Questions For Chapter 2
No ratings yet
Questions For Chapter 2
6 pages
Kruskal Wallis Test
No ratings yet
Kruskal Wallis Test
3 pages
Intro to Linear Regression Basics
No ratings yet
Intro to Linear Regression Basics
34 pages
Regression Analysis: Mathematical Methods of Cognitive Science
100% (1)
Regression Analysis: Mathematical Methods of Cognitive Science
12 pages
Calculating Reliability
No ratings yet
Calculating Reliability
14 pages
OMBC106
No ratings yet
OMBC106
3 pages
Data Science Deep Learning & Artificial Intelligence
No ratings yet
Data Science Deep Learning & Artificial Intelligence
9 pages
两变量线性回归基础概念解析
No ratings yet
两变量线性回归基础概念解析
24 pages
Module No. 12 Title: Pearson R and Spearman Rho: 1. The Coefficient of Correlation 2. Rank Correlation
100% (1)
Module No. 12 Title: Pearson R and Spearman Rho: 1. The Coefficient of Correlation 2. Rank Correlation
14 pages
Introduction to Econometrics 4th Edition Christopher Dougherty
No ratings yet
Introduction to Econometrics 4th Edition Christopher Dougherty
418 pages
Ensemble Methods Unit - 4
No ratings yet
Ensemble Methods Unit - 4
17 pages
Finance-Focused Big Data Techniques
100% (1)
Finance-Focused Big Data Techniques
23 pages
Analisis Konsumsi Pakan dan QDP
No ratings yet
Analisis Konsumsi Pakan dan QDP
3 pages
8614.educational Statitics Unit 8
No ratings yet
8614.educational Statitics Unit 8
28 pages
03 Regression
No ratings yet
03 Regression
35 pages

Linear Regression

Uploaded by

Linear Regression

Uploaded by

Regression

Regression in machine learning is a technique used to find the relationships between

Equation of the Best-Fit Line

dy Test Mean(X) Mean(Y Deviations(X) Deviations(Y) Product of Sum of Square of

Calculate m = Sum of product of deviations / Sum of square of deviation for X

Calculate b = Mean of Y – (m* Mean of X)

 Sum of Product of Deviations = 20 + 0 + 20 = 40

m = Sum of Product of Deviations / Sum of Square of Deviations for X

b=Mean(Y) − (m * mean(X)) =50− (5*4) =30

Final Regression Equation

# Wrap input in DataFrame with the same column name

print(f"Study Hours: {study_hours}")

 X1,X2,⋯Xn are the independent variables

 β1,β2,⋯βn are the slopes

Q1.Find B0,B1,B2 using the given data.

Product Product Weekly

Step 2. Multiply of X`.X

Step 3. Multiply of X`.Y

Step 4. Inverse of (X`.X)-1

β0 = -1.699, β1 = 3.483, β2 = - 0.054

 a0 → Intercept (similar to b in linear regression).

 a1→ Coefficient/weight of the feature XXX.

 X → Input (independent variable).

 Output → A probability between 0 and 1.

Study Hours (X) Output(Y) = Pass/Fail

Note : Value “Y” is greater than 0.5 then student is Pass.

What is Bayes theorem?

 P(A∣B) → Posterior Probability

 P(A) → Prior Probability

 P(B) → Marginal Probability (Evidence Probability)

You might also like