0% found this document useful (0 votes)

25 views14 pages

Supervised Learning

Supervised Learning is a machine learning approach that uses labeled data to train models for classification and regression tasks. Classification involves predicting categories, while regression predicts continuous values, with various algorithms available for each task. Key applications include medical diagnosis, fraud detection, and business forecasting.

Uploaded by

M Sridhar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views14 pages

Supervised Learning

Uploaded by

M Sridhar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Supervised Learning

Supervised Learning
Supervised Learning is a type of machine learning where the model is trained using labeled
data (input with correct output).

The main focus is on two key tasks:

→
Classification predicting categories or classes (e.g., spam or not spam).
→
Regression predicting continuous values (e.g., house price, temperature).

It works by mapping input features to output labels through a learning process.

Plays a major role in AI applications such as:

Medical diagnosis
Fraud detection
Speech and image recognition
Business forecasting
Classification
Definition: Classification is a supervised learning task where the goal is to assign an input
into one of several predefined categories (classes).
Nature of Output: The output is categorical (e.g., Yes/No, Pass/Fail, Fraud/Not Fraud).
Purpose: Helps in decision-making by grouping data into meaningful classes.

Process:
1. Collect labeled training data.
2. Train a model to identify class patterns.
3. Use the model to classify new, unseen data.

Examples:
→
Email filtering Spam / Not Spam
→
Medical diagnosis Disease / No Disease
→
Image recognition Dog / Cat / Human
Classification Model
Definition: A classification model is a mathematical or computational representation that learns from
labeled data to predict the correct class for new inputs.

Input & Output:

→
Input Features (X1, X2, …, Xn)
→
Output Class label (Y)

Training Process:
Feed the model with training data.
Learn patterns and relationships between features and labels.
Adjust model parameters to minimize errors.

Prediction: For any unseen input, the model assigns it to the most probable class.
Evaluation: Performance is measured using metrics such as accuracy, precision, recall, and F1-score.

Example:
Input: Age, income, browsing history
Output: Likelihood of buying a product (Yes/No)
Classification – Learning Steps
Step 1: Data Collection
Gather a labeled dataset containing input features and their correct class labels.

Step 2: Data Preprocessing

Clean missing values, remove noise, and normalize data for better accuracy.

Step 3: Train-Test Split

Divide data into training set (to build the model) and testing set (to evaluate performance).

Step 4: Choose Algorithm

Select a suitable classification algorithm (e.g., Decision Tree, kNN, SVM).

Step 5: Model Training

Train the model using the training dataset to learn input-output relationships.

Step 6: Model Evaluation

Test the model with unseen data and calculate metrics (accuracy, precision, recall).

Step 7: Hyperparameter Tuning

Adjust parameters (like learning rate, depth, neighbors) to improve performance.

Step 8: Deployment & Prediction

Deploy the trained model to classify new real-world data.
Common Classification Algorithms
1. k-Nearest Neighbors (kNN) 4. Support Vector Machine (SVM)
Classifies a sample based on the majority Finds the best separating hyperplane
class of its nearest neighbors. between classes.
Simple and effective for small datasets. Effective for high-dimensional data.

2. Decision Tree 5. Naïve Bayes

Represents decisions in a tree-like structure Based on Bayes’ theorem with
of rules. independence assumptions.
Easy to interpret and visualize. Works well for text classification (e.g., spam
filtering).
3. Random Forest
An ensemble method combining multiple 6. Neural Networks
decision trees. Mimics the human brain’s neuron
Reduces overfitting and improves accuracy. connections.
Useful for complex tasks like image and
speech recognition.
Regression
Definition: Regression is a supervised learning technique used to predict continuous numerical values rather than categories.

Nature of Output: Output is a real-valued number (e.g., price, temperature, sales).

Purpose:
Identify relationships between input features and output variable.
Make future predictions based on historical data.

Process:
[Link] labeled dataset with numeric outcomes.
[Link] a regression model to fit a curve/line.
[Link] continuous values for unseen inputs.

Examples:
Predicting house price based on location, size, and age.
Forecasting stock market trends.
Estimating student marks based on study hours.
Difference from Classification:
→
Classification categorical outputs (Yes/No).
→
Regression continuous outputs (numbers).
Common Regression Algorithms
1. Linear Regression
Models relationship between input (X) and output (Y) as a straight line.

2. Multiple Regression
Uses multiple independent variables to predict a dependent variable.

3. Polynomial Regression
Fits curved data by adding higher-order polynomial terms.

4. Logistic Regression
Used for binary classification, outputs probability (0–1).

5. Ridge & Lasso Regression

Add penalty terms to reduce overfitting and handle multicollinearity.
Linear Regression
Equation: Y=mX+cY = mX + cY=mX+c (simple linear form).

Goal: Minimize difference between predicted and actual values.

Method Used: Least Squares Method to reduce errors.

Applications:
Predicting salary based on experience.
Estimating sales from advertising spend.
Advantages: Simple, interpretable, works well for linear data.
Multiple Regression
Definition: Extends linear regression by using two or more independent variables.

Equation: Y=b0+b1X1+b2X2+…+bnXnY = b0 + b1X1 + b2X2 + … +

bnXnY=b0+b1X1+b2X2+…+bnXn.

Purpose: Capture influence of multiple factors on a dependent variable.

Example: Predicting house price using area, location, number of rooms, and age.

Challenges: Risk of multicollinearity (when predictors are highly correlated).

Regression Assumptions
Linearity → Relationship between predictors and output must be
linear.

Independence → Observations should not influence each other.

Normality→ Residuals (errors) should follow a normal distribution.
Homoscedasticity → Constant variance of errors across data points.

No Multicollinearity → Predictors should not be highly correlated.

Polynomial Regression
Definition: Regression technique where the relationship is modeled

as an nth degree polynomial.

Equation: Y=b0+b1X+b2X2+b3X3+…+bnXnY = b0 + b1X + b2X^2 +

b3X^3 + … + bnX^nY=b0+b1X+b2X2+b3X3+…+bnXn.

Use Case: When data shows a non-linear pattern.

Example: Growth rate of a plant over time (curved trend).

Limitation: High-degree polynomials may lead to overfitting.

Logistic Regression
Purpose: Used for classification problems, not regression.
Output: Probability between 0 and 1.
Sigmoid Function: Converts linear output into probability.

Decision Boundary:
→
Probability > 0.5 Class 1
→
Probability ≤ 0.5 Class 0

Applications:
Medical diagnosis (disease/no disease).
Credit risk prediction (default/no default).
Maximum Likelihood Estimation (MLE)
Definition: A statistical method used to estimate parameters of a model.
Idea: Choose parameters that make the observed data most likely.

Steps:
Define likelihood function based on model.
Calculate probability of data given parameters.
Adjust parameters to maximize likelihood.

Applications in ML:
Used in Logistic Regression for parameter estimation.
Widely applied in probabilistic models (Naïve Bayes, Hidden Markov Models).

Advantage: Produces efficient and consistent parameter estimates.

Ca10bd6d De86 4bae 9427 c60d433d2076 Supervised Learning
No ratings yet
Ca10bd6d De86 4bae 9427 c60d433d2076 Supervised Learning
17 pages
Supervised Learning
No ratings yet
Supervised Learning
14 pages
Unit 6
No ratings yet
Unit 6
107 pages
Module 2
No ratings yet
Module 2
5 pages
Machine Learning: Spam Filtering & Regression
No ratings yet
Machine Learning: Spam Filtering & Regression
8 pages
SDL Unit 1
No ratings yet
SDL Unit 1
7 pages
Unit 3 DSA
No ratings yet
Unit 3 DSA
69 pages
Supervised Learning (Classification and Regression)
No ratings yet
Supervised Learning (Classification and Regression)
14 pages
Group 2 ML Asignmet
No ratings yet
Group 2 ML Asignmet
23 pages
Machine Learning Ppts
No ratings yet
Machine Learning Ppts
38 pages
Machine Learning QB
No ratings yet
Machine Learning QB
15 pages
Week 7. Intro To ML. Regression
No ratings yet
Week 7. Intro To ML. Regression
24 pages
Module 3
No ratings yet
Module 3
63 pages
Regression Vs Classification in Machine Learning Explained!
No ratings yet
Regression Vs Classification in Machine Learning Explained!
10 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
7 pages
ML Unit 1
No ratings yet
ML Unit 1
21 pages
ML Report 1
No ratings yet
ML Report 1
23 pages
Linear Regression
No ratings yet
Linear Regression
11 pages
Lecture - 2 & 3
No ratings yet
Lecture - 2 & 3
62 pages
Chapter - 2-ML
No ratings yet
Chapter - 2-ML
63 pages
Week - 03 Week04
No ratings yet
Week - 03 Week04
32 pages
Machine Learning
No ratings yet
Machine Learning
16 pages
Lec 2
No ratings yet
Lec 2
6 pages
Slide 1
No ratings yet
Slide 1
29 pages
ML 2 ND Unit
No ratings yet
ML 2 ND Unit
50 pages
Supervised Learning
No ratings yet
Supervised Learning
187 pages
Overview of Supervised Machine Learning
No ratings yet
Overview of Supervised Machine Learning
24 pages
Predictive Analytics & Data Mining
No ratings yet
Predictive Analytics & Data Mining
15 pages
Introduction To Ai & ML
No ratings yet
Introduction To Ai & ML
27 pages
Supervised and Unsupervised Learning
No ratings yet
Supervised and Unsupervised Learning
92 pages
1 - Supervised Learning & Its Types
No ratings yet
1 - Supervised Learning & Its Types
24 pages
Machine Learning Reg
No ratings yet
Machine Learning Reg
45 pages
(English (Auto-Generated) ) All Machine Learning Algorithms Explained in 17 Min (DownSub - Com)
No ratings yet
(English (Auto-Generated) ) All Machine Learning Algorithms Explained in 17 Min (DownSub - Com)
19 pages
Machine Learning
No ratings yet
Machine Learning
33 pages
MLT Study
No ratings yet
MLT Study
22 pages
Introduction to Predictive Analytics
No ratings yet
Introduction to Predictive Analytics
30 pages
Intro to Supervised Learning in ML
No ratings yet
Intro to Supervised Learning in ML
35 pages
Understanding Machine Learning Basics
No ratings yet
Understanding Machine Learning Basics
115 pages
Machine Learning
No ratings yet
Machine Learning
100 pages
Aiml Unit 3
No ratings yet
Aiml Unit 3
9 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
137 pages
Whole ML PDF 1614408656
100% (1)
Whole ML PDF 1614408656
214 pages
Intro to Machine Learning Algorithms
No ratings yet
Intro to Machine Learning Algorithms
72 pages
Lec05 - Supervised
No ratings yet
Lec05 - Supervised
26 pages
Unit 2
No ratings yet
Unit 2
151 pages
ML Introduction
No ratings yet
ML Introduction
76 pages
Chapter 03 - 1731422626
No ratings yet
Chapter 03 - 1731422626
42 pages
2-Machine Learning Algorithms
No ratings yet
2-Machine Learning Algorithms
16 pages
Unit 4 Classification and Regression
No ratings yet
Unit 4 Classification and Regression
63 pages
Machine Learning & NLP Overview
No ratings yet
Machine Learning & NLP Overview
41 pages
Top 10 Machine Learning Algorithms in 2025
No ratings yet
Top 10 Machine Learning Algorithms in 2025
21 pages
Unit 1 Machine Learning (2) (Autosaved)
No ratings yet
Unit 1 Machine Learning (2) (Autosaved)
44 pages
ML Notes
No ratings yet
ML Notes
31 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
17 pages
Unit - 2, Updated Notes
No ratings yet
Unit - 2, Updated Notes
121 pages
Dinesh ML
No ratings yet
Dinesh ML
11 pages
ML 01 (Pranavv)
No ratings yet
ML 01 (Pranavv)
14 pages
7 - Datamining Supervised Learning FINAL 1of3
No ratings yet
7 - Datamining Supervised Learning FINAL 1of3
41 pages
Processing and Visualizing Data
No ratings yet
Processing and Visualizing Data
20 pages
Operating System Virtual Memory
No ratings yet
Operating System Virtual Memory
10 pages
Types of Learning in ML
No ratings yet
Types of Learning in ML
20 pages
Unsupervised Learning & Neural Networks
No ratings yet
Unsupervised Learning & Neural Networks
20 pages
Secondary Storage Structure
No ratings yet
Secondary Storage Structure
10 pages
Google Analytics
No ratings yet
Google Analytics
20 pages
Social Media in 2025
No ratings yet
Social Media in 2025
20 pages
Modeling and Evaluation
No ratings yet
Modeling and Evaluation
20 pages
Operating Processes & CPU Scheduling
No ratings yet
Operating Processes & CPU Scheduling
10 pages
Sma Connections
No ratings yet
Sma Connections
11 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
20 pages
Deadlocks & Device Management
No ratings yet
Deadlocks & Device Management
10 pages
Social Media Landscape
No ratings yet
Social Media Landscape
10 pages
Introduction To Operating Systems
No ratings yet
Introduction To Operating Systems
10 pages
Chapter5-Multiple Linear Regression
No ratings yet
Chapter5-Multiple Linear Regression
5 pages
Thesis Acknowledgement Example
No ratings yet
Thesis Acknowledgement Example
3 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
Rao Et Al. 2012 Corporate Governance and Environmental Reporting An Australian Study
No ratings yet
Rao Et Al. 2012 Corporate Governance and Environmental Reporting An Australian Study
21 pages
Final Summit Homework
No ratings yet
Final Summit Homework
36 pages
4 - Practice Questions Regression
No ratings yet
4 - Practice Questions Regression
2 pages
Final Report
No ratings yet
Final Report
18 pages
Outline - Anacan, Chloe Andrea S. Rev3
No ratings yet
Outline - Anacan, Chloe Andrea S. Rev3
5 pages
Question Bank
No ratings yet
Question Bank
6 pages
Logistic Regression Guide
100% (1)
Logistic Regression Guide
34 pages
Multiple Regression for Economists
No ratings yet
Multiple Regression for Economists
78 pages
Solar Power Output Prediction Model
No ratings yet
Solar Power Output Prediction Model
20 pages
R Notes For Data Analysis and Statistical Inference
No ratings yet
R Notes For Data Analysis and Statistical Inference
10 pages
Cheatsheet Machine Learning Tips and Tricks PDF
No ratings yet
Cheatsheet Machine Learning Tips and Tricks PDF
2 pages
12th Economics EM Slow Learners Study Materials English Medium PDF Download
No ratings yet
12th Economics EM Slow Learners Study Materials English Medium PDF Download
22 pages
Least Square Method Definition
No ratings yet
Least Square Method Definition
7 pages
Civil Mtechgte (FT) R18
No ratings yet
Civil Mtechgte (FT) R18
70 pages
FINAL - CC01 - Group7
No ratings yet
FINAL - CC01 - Group7
23 pages
UG_37_151020251760513430218
No ratings yet
UG_37_151020251760513430218
62 pages
Nelson Et Al. - Psychology of Framing Effects - 1997
No ratings yet
Nelson Et Al. - Psychology of Framing Effects - 1997
26 pages
Chapter 4 in Managerial Economic
100% (2)
Chapter 4 in Managerial Economic
46 pages
Least Squares Regression Techniques
No ratings yet
Least Squares Regression Techniques
44 pages
Factors Affecting Procurement in Tanzania
No ratings yet
Factors Affecting Procurement in Tanzania
16 pages
HRM P4
No ratings yet
HRM P4
29 pages
IEEE Paper Format Template
No ratings yet
IEEE Paper Format Template
6 pages
Discrete Choice Models in Economics
No ratings yet
Discrete Choice Models in Economics
21 pages
Schizophrenia Pharmacoeconomic Study
No ratings yet
Schizophrenia Pharmacoeconomic Study
32 pages
Linear and Generalized Mixed Models
No ratings yet
Linear and Generalized Mixed Models
7 pages
Linear Regression Analysis in Excel
No ratings yet
Linear Regression Analysis in Excel
15 pages
MUF0142 Sample Exam Questions 4
No ratings yet
MUF0142 Sample Exam Questions 4
16 pages

Supervised Learning

Uploaded by

Supervised Learning

Uploaded by

Supervised Learning

The main focus is on two key tasks:

It works by mapping input features to output labels through a learning process.

Plays a major role in AI applications such as:

Input & Output:

Step 2: Data Preprocessing

Step 3: Train-Test Split

Step 4: Choose Algorithm

Step 5: Model Training

Step 6: Model Evaluation

Step 7: Hyperparameter Tuning

Step 8: Deployment & Prediction

2. Decision Tree 5. Naïve Bayes

Nature of Output: Output is a real-valued number (e.g., price, temperature, sales).

5. Ridge & Lasso Regression

Goal: Minimize difference between predicted and actual values.

Method Used: Least Squares Method to reduce errors.

Equation: Y=b0+b1X1+b2X2+…+bnXnY = b0 + b1X1 + b2X2 + … +

Purpose: Capture influence of multiple factors on a dependent variable.

Challenges: Risk of multicollinearity (when predictors are highly correlated).

Independence → Observations should not influence each other.

No Multicollinearity → Predictors should not be highly correlated.

as an nth degree polynomial.

Equation: Y=b0+b1X+b2X2+b3X3+…+bnXnY = b0 + b1X + b2X^2 +

Use Case: When data shows a non-linear pattern.

Example: Growth rate of a plant over time (curved trend).

Limitation: High-degree polynomials may lead to overfitting.

Advantage: Produces efficient and consistent parameter estimates.

You might also like