0% found this document useful (0 votes)

12 views7 pages

Machine Learning Regression and PCA Guide

The document covers various machine learning concepts including linear regression, logistic regression, PCA, LDA, and linear classification, providing definitions, equations, and Python examples for each. Linear regression predicts continuous values, while logistic regression predicts binary outcomes using the sigmoid function. PCA and LDA are dimensionality reduction techniques, with PCA focusing on variance maximization and LDA on class separation.

Uploaded by

rishikammari22

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views7 pages

Machine Learning Regression and PCA Guide

Uploaded by

rishikammari22

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

MACHINE LEARNING

ASSIGNMENT-2
1. What is linear regression ?

Linear Regression is a statistical method used in machine learning to model the

relationship between a dependent variable (target) and one or more
independent variables (predictors). It is used for predicting continuous values.

The equation of simple linear regression (with one independent variable) is:
Y= mX + C
where:

Y is the dependent variable (target).

X is the independent variable (feature).

m is the slope (coefficient).

C is the intercept.

Example of Linear Regression in Python

We will use Python's sklearn library to implement linear regression on a simple
dataset.

Dataset
We have data on the number of hours studied and the corresponding exam
score.

Hours Studied Exam Score

1 50

2 55

3 65

4 70

5 75

Program:

MACHINE LEARNING ASSIGNMENT-2 1

import numpy as np
import [Link] as plt
from sklearn.linear_model import LinearRegression
X = [Link]([1, 2, 3, 4, 5]).reshape(-1, 1) # Independent variable
Y = [Link]([50, 55, 65, 70, 75]) # Dependent variable
model = LinearRegression()
[Link](X, Y)
predictions = [Link](X)
print("Slope (m):", model.coef_[0])
print("Intercept (C):", model.intercept_)
print("Predicted values:", predictions)
[Link](X, Y, color='blue', label='Actual Data')
[Link](X, predictions, color='red', label='Regression Line')
[Link]("Hours Studied")
[Link]("Exam Score")
[Link]()
[Link]()

Output:
Slope (m): 6.25
Intercept (C): 43.75
Predicted values: [50. 56.25 62.5 68.75 75. ]

2. What is logistic regression?

Logistic Regression is a classification algorithm used to predict binary or

categorical outcomes. Unlike linear regression, which predicts continuous
values, logistic regression predicts probabilities and maps them to class labels
using the sigmoid function:
P(Y=1)=1/1+e^−(mX+C)

where:

P(Y=1) is the probability of the positive class (Y = 1)

m is the coefficient (weight)

X is the independent variable

MACHINE LEARNING ASSIGNMENT-2 2

C is the intercept

If the probability is greater than 0.5, we classify it as 1 (Positive Class);

otherwise, as 0 (Negative Class).

Example of Logistic Regression in Python

Let's use logistic regression to predict whether a student will pass an exam
based on study hours.

Dataset
Pass (1) / Fail
Hours Studied
(0)

1 0

2 0

3 0

4 1

5 1

Program:

import numpy as np
import [Link] as plt
from sklearn.linear_model import LogisticRegression
X = [Link]([1, 2, 3, 4, 5]).reshape(-1, 1) # Independent variable
Y = [Link]([0, 0, 0, 1, 1]) # Dependent variable (Pass/Fail)
model = LogisticRegression()
[Link](X, Y)

predicted_probs = model.predict_proba(X)[:, 1] # Probability of passing

predictions = [Link](X) # Predicted class (0 or 1)
print("Predicted Probabilities:", predicted_probs)
print("Predicted Classes:", predictions)

[Link](X, Y, color='blue', label='Actual Data')

[Link](X, predicted_probs, color='red', label='Sigmoid Curve')
[Link]("Hours Studied")
[Link]("Probability of Passing")
[Link]()
[Link]()

MACHINE LEARNING ASSIGNMENT-2 3

Output:

Predicted Probabilities: [0.19 0.28 0.41 0.59 0.73]

Predicted Classes: [0 0 0 1 1]

3. What is PCA ?

Principal Component Analysis (PCA) is a dimensionality reduction technique

used in machine learning and statistics to transform high-dimensional data into
a lower-dimensional space while retaining the most important information.

Key Concepts of PCA:

1. Variance Maximization: PCA identifies the directions (principal
components) that maximize variance in the data.

2. Orthogonal Transformation: It creates new features (principal

components) that are uncorrelated.

3. Feature Reduction: Helps reduce computational cost and avoid overfitting

in high-dimensional datasets.

Example of PCA in Python

Let's apply PCA on the Iris dataset, which has 4 features. We'll reduce it to 2
dimensions for visualization.

import numpy as np
import [Link] as plt
from [Link] import PCA
from [Link] import load_iris
from [Link] import StandardScaler

iris = load_iris()
X = [Link] # Features (4D)

scaler = StandardScaler()
X_scaled = scaler.fit_transform(X)

pca = PCA(n_components=2)
X_pca = pca.fit_transform(X_scaled)

[Link](X_pca[:, 0], X_pca[:, 1], c=[Link], cmap='viridis', edgecolors='k')

[Link]("Principal Component 1")

MACHINE LEARNING ASSIGNMENT-2 4

[Link]("Principal Component 2")
[Link]("PCA on Iris Dataset")
[Link](label="Target Classes")
[Link]()

print("Explained Variance Ratio:", pca.explained_variance_ratio_)

print("Principal Components:", pca.components_
Outcome:

Explained Variance Ratio: [0.72 0.23]

Principal Components:
[[ 0.36 0.08 0.86 0.36]
[ 0.66 0.73 -0.17 -0.07]]

4. What is LDA ?

Linear Discriminant Analysis (LDA) is a supervised dimensionality reduction

technique used for classification problems. Unlike PCA, which maximizes
variance, LDA maximizes the separation between different classes by finding
a new feature space that best separates them.

Key Concepts of LDA

1. Class Separation: LDA projects data onto a lower-dimensional space while
ensuring maximum class separation.

2. Supervised Learning: LDA requires class labels, unlike PCA.

3. Feature Reduction: It reduces dimensions while preserving discriminative

information.

Example of LDA in Python

Let's apply LDA on the Iris dataset (3 classes, 4 features) and reduce it to 2
dimensions for visualization.
Program:

import numpy as np
import [Link] as plt
from sklearn.discriminant_analysis import LinearDiscriminantAnalysis

MACHINE LEARNING ASSIGNMENT-2 5

from [Link] import load_iris
from [Link] import StandardScaler
iris = load_iris()
X = [Link] # Features (4D)
y = [Link] # Class labels (3 classes)
scaler = StandardScaler()
X_scaled = scaler.fit_transform(X)

lda = LinearDiscriminantAnalysis(n_components=2)
X_lda = lda.fit_transform(X_scaled, y)

[Link](X_lda[:, 0], X_lda[:, 1], c=y, cmap='viridis', edgecolors='k')

[Link]("LDA Component 1")
[Link]("LDA Component 2")
[Link]("LDA on Iris Dataset")
[Link](label="Target Classes")
[Link]()

print("Explained Variance Ratio:", lda.explained_variance_ratio_)

Output:

Explained Variance Ratio: [0.99 0.01]

5. What is Linear classification?

Linear classification is a method used in machine learning where a model

separates different classes using a straight decision boundary (a line in 2D, a
plane in 3D, or a hyperplane in higher dimensions).

Key Concepts of Linear Classification:

1. Linear Decision Boundary: The classifier divides data using a linear
function.

2. Binary & Multi-Class Classification: It can handle both types.

3. Examples of Linear Classifiers: Logistic Regression, Support Vector

Machines (SVM), and Linear Discriminant Analysis (LDA).

Example: Linear Classification using Logistic Regression

We'll classify students as Pass (1) or Fail (0) based on their study hours.

MACHINE LEARNING ASSIGNMENT-2 6

Program:

import numpy as np
import [Link] as plt
from sklearn.linear_model import LogisticRegression

X = [Link]([1, 2, 3, 4, 5]).reshape(-1, 1) # Independent variable (Hours

Studied)
y = [Link]([0, 0, 0, 1, 1]) # Dependent variable (Pass/Fail)

model = LogisticRegression()
[Link](X, y)
X_test = [Link](0, 6, 100).reshape(-1, 1)
y_prob = model.predict_proba(X_test)[:, 1] # Probability of passing
[Link](X, y, color='blue', label='Actual Data')
[Link](X_test, y_prob, color='red', label='Decision Boundary')
[Link]("Hours Studied")
[Link]("Probability of Passing")
[Link]()
[Link]()

print("Coefficient (Slope):", model.coef_[0][0])

print("Intercept:", model.intercept_[0])
Output:

Coefficient (Slope): 1.2

Intercept: -3.5

MACHINE LEARNING ASSIGNMENT-2 7

Handwriting 20250302 155545 Via 10015 Io
No ratings yet
Handwriting 20250302 155545 Via 10015 Io
7 pages
Linear Discriminant Analysis Guide
No ratings yet
Linear Discriminant Analysis Guide
24 pages
Supervised Learning 1 PDF
100% (1)
Supervised Learning 1 PDF
162 pages
Understanding Linear Discriminant Analysis
No ratings yet
Understanding Linear Discriminant Analysis
24 pages
Aychew Chernet
No ratings yet
Aychew Chernet
8 pages
Data-Analytics-Manual Lab G.anill Kumar
No ratings yet
Data-Analytics-Manual Lab G.anill Kumar
23 pages
Unit 2 Supervised Learning
No ratings yet
Unit 2 Supervised Learning
20 pages
Reference Material - LDA
No ratings yet
Reference Material - LDA
24 pages
B-56 Sanket Jambhulkar MLA-3
No ratings yet
B-56 Sanket Jambhulkar MLA-3
7 pages
Logistic Regression in Python - Real Python
No ratings yet
Logistic Regression in Python - Real Python
27 pages
Linear Discriminat Analysis
No ratings yet
Linear Discriminat Analysis
23 pages
Deep Learningexp4
No ratings yet
Deep Learningexp4
4 pages
CS Notes
No ratings yet
CS Notes
3 pages
ML Unit 2
No ratings yet
ML Unit 2
53 pages
CSE AIML Flood Prediction Guide
No ratings yet
CSE AIML Flood Prediction Guide
5 pages
ML Ass
No ratings yet
ML Ass
16 pages
ML Lab Manual
No ratings yet
ML Lab Manual
36 pages
Linear Regression
No ratings yet
Linear Regression
11 pages
Machine Learning Laboratory Exercises
No ratings yet
Machine Learning Laboratory Exercises
16 pages
Supervised Learning. wk3
No ratings yet
Supervised Learning. wk3
18 pages
Unit 2 ML - Ver 2
No ratings yet
Unit 2 ML - Ver 2
129 pages
PA
No ratings yet
PA
8 pages
07 - Linear Models For Classification
No ratings yet
07 - Linear Models For Classification
76 pages
AIML Unit3
No ratings yet
AIML Unit3
48 pages
AI & ML Unit 3 Notes
No ratings yet
AI & ML Unit 3 Notes
20 pages
Linear Discriminant Analysis Reference
No ratings yet
Linear Discriminant Analysis Reference
6 pages
ML Lab File
No ratings yet
ML Lab File
48 pages
Data Mining with Python Lab Guide
No ratings yet
Data Mining with Python Lab Guide
39 pages
Machine Learning Lab Manual 06
100% (1)
Machine Learning Lab Manual 06
8 pages
2 - (9-3) Regression Classifiers
No ratings yet
2 - (9-3) Regression Classifiers
35 pages
Binary Classifier Evaluation Guide
No ratings yet
Binary Classifier Evaluation Guide
12 pages
ML101 C&a
No ratings yet
ML101 C&a
33 pages
DataAnalytics LabBook
No ratings yet
DataAnalytics LabBook
61 pages
Linear Regression Simple Technique For I
No ratings yet
Linear Regression Simple Technique For I
3 pages
LDA in Python: Machine Learning Lab
No ratings yet
LDA in Python: Machine Learning Lab
12 pages
Home Work
No ratings yet
Home Work
12 pages
ML 2 PPT Unit 2
No ratings yet
ML 2 PPT Unit 2
214 pages
Unit 3
No ratings yet
Unit 3
8 pages
Chapter 10 Logistic Reg - Week 07 - 01
No ratings yet
Chapter 10 Logistic Reg - Week 07 - 01
31 pages
Intro To Linear and Logistic Reg
No ratings yet
Intro To Linear and Logistic Reg
5 pages
3rd Unit Last 5 Answer AIML
No ratings yet
3rd Unit Last 5 Answer AIML
21 pages
Lecture-2 Unit 2
No ratings yet
Lecture-2 Unit 2
56 pages
LoTs and HoTs Question For Unit 3 and Unit 4 - 1
No ratings yet
LoTs and HoTs Question For Unit 3 and Unit 4 - 1
16 pages
MLCyber Lab
No ratings yet
MLCyber Lab
9 pages
DataAnalytics Lab Manual
No ratings yet
DataAnalytics Lab Manual
35 pages
Lecture 03 Bayes Classifier With Prob Concepts
No ratings yet
Lecture 03 Bayes Classifier With Prob Concepts
70 pages
DTS 101 Lecture 3
No ratings yet
DTS 101 Lecture 3
21 pages
PCA - Colab
No ratings yet
PCA - Colab
2 pages
Dsbda 5
No ratings yet
Dsbda 5
4 pages
DMKD External Exam Answers
No ratings yet
DMKD External Exam Answers
12 pages
20MEMECH Part 3 - Classification
No ratings yet
20MEMECH Part 3 - Classification
49 pages
AI19
No ratings yet
AI19
4 pages
LDA
No ratings yet
LDA
10 pages
Linear Classifier Examples
No ratings yet
Linear Classifier Examples
6 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
30 pages
Unit 4
No ratings yet
Unit 4
121 pages
Foundation of Machine Learning F-PMLFML02-WS
No ratings yet
Foundation of Machine Learning F-PMLFML02-WS
352 pages
MISY 631 Final Review Calculators Will Be Provided For The Exam
No ratings yet
MISY 631 Final Review Calculators Will Be Provided For The Exam
9 pages
Logistic Regression Explained
No ratings yet
Logistic Regression Explained
25 pages
For Top Level Managers : Manager
No ratings yet
For Top Level Managers : Manager
9 pages
Chapter Five Practice: Cost Behavior: Analysis and Use
No ratings yet
Chapter Five Practice: Cost Behavior: Analysis and Use
6 pages
HFD INTERPRETATION (AutoRecovered)
No ratings yet
HFD INTERPRETATION (AutoRecovered)
1 page
Possible Chapter Questions
No ratings yet
Possible Chapter Questions
6 pages
SChO'23 - Gold Tier Answers
No ratings yet
SChO'23 - Gold Tier Answers
47 pages
BT Pro V2 Manual WEB 08 2021
No ratings yet
BT Pro V2 Manual WEB 08 2021
12 pages
SMP English Exam Questions
No ratings yet
SMP English Exam Questions
9 pages
Flexible Learning for Educators
No ratings yet
Flexible Learning for Educators
6 pages
Worksheet - Where Do Animals Live
No ratings yet
Worksheet - Where Do Animals Live
4 pages
Communication Passport Template
No ratings yet
Communication Passport Template
16 pages
Electric Vehicle Business Plan
No ratings yet
Electric Vehicle Business Plan
53 pages
LBC 4.1
No ratings yet
LBC 4.1
523 pages
Annotatedbibliographiesbymollyzhou
No ratings yet
Annotatedbibliographiesbymollyzhou
6 pages
Group 7 - Characteristics of Good Language Assessment For English Specific Purposes
No ratings yet
Group 7 - Characteristics of Good Language Assessment For English Specific Purposes
10 pages
Tutorial 08 Part 1
No ratings yet
Tutorial 08 Part 1
95 pages
Re Ection and Peer Assessment To Promote Self-Directed Learning in Higher Education
No ratings yet
Re Ection and Peer Assessment To Promote Self-Directed Learning in Higher Education
13 pages
Consequences of Non-Mahrams' Company
No ratings yet
Consequences of Non-Mahrams' Company
27 pages
Ehrmann Apostolic Fathers
96% (26)
Ehrmann Apostolic Fathers
457 pages
Arf 4
No ratings yet
Arf 4
9 pages
Evaluation Sheet Classroom Advisory Tasks
No ratings yet
Evaluation Sheet Classroom Advisory Tasks
1 page
LO WEEK 6 IPHP Q1 W7 8 The Human Person and The Environment Gualdo Benguet Deleted
No ratings yet
LO WEEK 6 IPHP Q1 W7 8 The Human Person and The Environment Gualdo Benguet Deleted
10 pages
Overview of Cellular Network Principles
No ratings yet
Overview of Cellular Network Principles
18 pages
AARIKA Udyam Certificate
No ratings yet
AARIKA Udyam Certificate
4 pages
Assignment - Modal Verbs
No ratings yet
Assignment - Modal Verbs
2 pages
RV 10
No ratings yet
RV 10
111 pages
11 Sequence and Series 2023-24.
No ratings yet
11 Sequence and Series 2023-24.
9 pages
Learn Hindu Astrology Easily LESSON 6-10 by K N Rao
100% (2)
Learn Hindu Astrology Easily LESSON 6-10 by K N Rao
14 pages
D-STAR Handheld Radio Selection Chart: Full-Featured Radios Icom Has The Right Radio For The Right Job
No ratings yet
D-STAR Handheld Radio Selection Chart: Full-Featured Radios Icom Has The Right Radio For The Right Job
2 pages
January 2020 Mark Scheme
No ratings yet
January 2020 Mark Scheme
16 pages
Grade V Mathematics Workbook for Olympiads
No ratings yet
Grade V Mathematics Workbook for Olympiads
98 pages

Machine Learning Regression and PCA Guide

Uploaded by

Machine Learning Regression and PCA Guide

Uploaded by

MACHINE LEARNING

Linear Regression is a statistical method used in machine learning to model the

Y is the dependent variable (target).

X is the independent variable (feature).

m is the slope (coefficient).

Example of Linear Regression in Python

Hours Studied Exam Score

MACHINE LEARNING ASSIGNMENT-2 1

2. What is logistic regression?

Logistic Regression is a classification algorithm used to predict binary or

P(Y=1) is the probability of the positive class (Y = 1)

m is the coefficient (weight)

X is the independent variable

MACHINE LEARNING ASSIGNMENT-2 2

If the probability is greater than 0.5, we classify it as 1 (Positive Class);

Example of Logistic Regression in Python

predicted_probs = model.predict_proba(X)[:, 1] # Probability of passing

[Link](X, Y, color='blue', label='Actual Data')

MACHINE LEARNING ASSIGNMENT-2 3

Predicted Probabilities: [0.19 0.28 0.41 0.59 0.73]

Principal Component Analysis (PCA) is a dimensionality reduction technique

Key Concepts of PCA:

2. Orthogonal Transformation: It creates new features (principal

3. Feature Reduction: Helps reduce computational cost and avoid overfitting

Example of PCA in Python

[Link](X_pca[:, 0], X_pca[:, 1], c=[Link], cmap='viridis', edgecolors='k')

MACHINE LEARNING ASSIGNMENT-2 4

print("Explained Variance Ratio:", pca.explained_variance_ratio_)

Explained Variance Ratio: [0.72 0.23]

Linear Discriminant Analysis (LDA) is a supervised dimensionality reduction

Key Concepts of LDA

2. Supervised Learning: LDA requires class labels, unlike PCA.

3. Feature Reduction: It reduces dimensions while preserving discriminative

Example of LDA in Python

MACHINE LEARNING ASSIGNMENT-2 5

[Link](X_lda[:, 0], X_lda[:, 1], c=y, cmap='viridis', edgecolors='k')

print("Explained Variance Ratio:", lda.explained_variance_ratio_)

Explained Variance Ratio: [0.99 0.01]

5. What is Linear classification?

Linear classification is a method used in machine learning where a model

Key Concepts of Linear Classification:

2. Binary & Multi-Class Classification: It can handle both types.

3. Examples of Linear Classifiers: Logistic Regression, Support Vector

Example: Linear Classification using Logistic Regression

MACHINE LEARNING ASSIGNMENT-2 6

X = [Link]([1, 2, 3, 4, 5]).reshape(-1, 1) # Independent variable (Hours

print("Coefficient (Slope):", model.coef_[0][0])

Coefficient (Slope): 1.2

MACHINE LEARNING ASSIGNMENT-2 7

You might also like