0% found this document useful (0 votes)

18 views6 pages

Machine Learning Algorithms

Machine learning algorithms enable computers to learn from data and improve over time, categorized into supervised, unsupervised, and reinforcement learning. Supervised learning uses labeled data for predictions, unsupervised learning identifies patterns in unlabeled data, and reinforcement learning involves agents making decisions based on feedback. Each category contains various algorithms, such as linear regression, decision trees, clustering methods, and Q-learning, each serving different purposes in data analysis and prediction.

Uploaded by

shefali

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views6 pages

Machine Learning Algorithms

Uploaded by

shefali

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Machine learning algorithms are essentially sets of instructions that allow

computers to learn from data, make predictions, and improve their performance
over time without being explicitly programmed. Machine learning algorithms
are broadly categorized into three types:

Supervised Learning: Algorithms learn from labeled data, where the input-
output relationship is known.
Unsupervised Learning: Algorithms work with unlabeled data to identify
patterns or groupings.
Reinforcement Learning: Algorithms learn by interacting with an environment
and receiving feedback in the form of rewards or penalties.
Machine-Learning-Algorithms1-(1)

Supervised Learning Algorithms

Supervised learning algos are trained on datasets where each example is paired
with a target or response variable, known as the label. The goal is to learn a
mapping function from input data to the corresponding output labels, enabling
the model to make accurate predictions on unseen data. Supervised learning
problems are generally categorized into two main types: Classification and
Regression. Most widely used supervised learning algorithms are:

1. Linear Regression

Linear regression is used to predict a continuous value by finding the best-fit

straight line between input (independent variable) and output (dependent
variable)

Minimizes the difference between actual values and predicted values using a
method called "least squares" to to best fit the data.
Predicting a person’s weight based on their height or predicting house prices
based on size.
2. Logistic Regression

Logistic regression predicts probabilities and assigns data points to binary

classes (e.g., spam or not spam).

It uses a logistic function (S-shaped curve) to model the relationship between

input features and class probabilities.
Used for classification tasks (binary or multi-class).
Outputs probabilities to classify data into categories.
Example : Predicting whether a customer will buy a product online (yes/no) or
diagnosing if a person has a disease (sick/not sick).
Note : Despite its name, logistic regression is used for classification tasks, not
regression.
3. Decision Trees

A decision tree splits data into branches based on feature values, creating a tree-
like structure.

Each decision node represents a feature; leaf nodes provide the final prediction.
The process continues until a final prediction is made at the leaf nodes
Works for both classification and regression tasks.
For more decision tree algorithms, you can explore:

Iterative Dichotomiser 3 (ID3) Algorithms

C5. Algorithms
Classification and Regression Trees Algorithms
4. Support Vector Machines (SVM)

SVMs find the best boundary (called a hyperplane) that separates data points
into different classes.

Uses support vectors (critical data points) to define the hyperplane.

Can handle linear and non-linear problems using kernel functions.
focuses on maximizing the margin between classes, making it robust for high-
dimensional data or complex patterns.
5. k-Nearest Neighbors (k-NN)

KNN is a simple algorithm that predicts the output for a new data point based
on the similarity (distance) to its nearest neighbors in the training dataset, used
for both classification and regression tasks.

Calculates distance between point with existing data points in training dataset
using a distance metric (e.g., Euclidean, Manhattan, Minkowski)
identifies k nearest neighbors to new data point based on the calculated
distances.
For classification, algorithm assigns class label that is most common among its
k nearest neighbors.
For regression, the algorithm predicts the value as the average of the values of
its k nearest neighbors.
6. Naive Bayes

Based on Bayes' theorem and assumes all features are independent of each other
(hence "naive")
Calculates probabilities for each class and assigns the most likely class to a data
point.
Assumption of feature independence might not hold in all cases ( rarely true in
real-world data )
Works well for high-dimensional data.
Commonly used in text classification tasks like spam filtering : Naive Bayes
7. Random Forest

Random forest is an ensemble method that combines multiple decision trees.

Uses random sampling and feature selection for diversity among trees.
Final prediction is based on majority voting (classification) or averaging
(regression).
Advantages : reduces overfitting compared to individual decision trees.
Handles large datasets with higher dimensionality.
For in-depth understanding : What is Ensemble Learning? - Two types of
ensemble methods in ML
7. Gradient Boosting (e.g., XGBoost, LightGBM, CatBoost)

These algorithms build models sequentially, meaning each new model corrects
errors made by previous ones. Combines weak learners (like decision trees) to
create a strong predictive model. Effective for both regression and classification
tasks. : Gradient Boosting in ML

XGBoost (Extreme Gradient Boosting) : Advanced version of Gradient

Boosting that includes regularization to prevent overfitting. Faster than
traditional Gradient Boosting, for large datasets.
LightGBM (Light Gradient Boosting Machine): Uses a histogram-based
approach for faster computation and supports categorical features natively.
CatBoost: Designed specifically for categorical data, with built-in encoding
techniques. Uses symmetric trees for faster training and better generalization.
For more ensemble learning and gradient boosting approaches, explore:

AdaBoost
Stacking - ensemble learning
8. Neural Networks (Including Multilayer Perceptron)

Neural Networks, including Multilayer Perceptrons (MLPs), are considered part

of supervised machine learning algorithms as they require labeled data to train
and learn the relationship between input and desired output; network learns to
minimize the error using backpropagation algorithm to adjust weights during
training.
Multilayer Perceptron (MLP): Neural network with multiple layers of nodes.
Used for both classification and regression ( Examples: image classification,
spam detection, and predicting numerical values like stock prices or house
prices)
For in-depth understanding : Supervised multi-layer perceptron model - What
is perceptron?
Unsupervised Learning Algorithms
Unsupervised learning algos works with unlabeled data to discover hidden
patterns or structures without predefined outputs. These are again divided into
three main categories based on their purpose: Clustering, Association Rule
Mining, and Dimensionality Reduction. First we'll see algorithms for
Clustering, then dimensionality reduction and at last association.

1. Clustering

Clustering algorithms group data points into clusters based on their similarities
or differences. The goal is to identify natural groupings in the data. Clustering
algorithms are divided into multiple types based on the methods they use to
group data. These types include Centroid-based methods, Distribution-based
methods, Connectivity-based methods, and Density-based methods. For
resources and in-depth understanding, go through the links below.

Centroid-based Methods: Represent clusters using central points, such as

centroids or medoids.
K-Means clustering: Divides data into k clusters by iteratively assigning points
to nearest centers, assuming spherical clusters.
K-Means++ clustering
K-Mode clustering
Fuzzy C-Means (FCM) Clustering
Distribution-based Methods
Gaussian mixture models (GMMs) : Models clusters as overlapping Gaussian
distributions, assigning probabilities for data points' cluster membership.
Expectation-Maximization Algorithms
Dirichlet process mixture models (DPMMs)
Connectivity based methods
Hierarchical clustering : Builds a tree-like structure (dendrogram) by merging or
splitting clusters, no predefined number.
Agglomerative Clustering
Divisive clustering
Affinity propagation
Density Based methods
DBSCAN (Density-Based Spatial Clustering of Applications with Noise) :
Forms clusters based on density, allowing arbitrary shapes and detecting
outliers, with distance and point parameters.
OPTICS (Ordering Points To Identify the Clustering Structure)
2. Dimensionality Reduction

Dimensionality reduction is used to simplify datasets by reducing the number of

features while retaining the most important information.

Principal Component Analysis (PCA): Transforms data into a new set of

orthogonal features (principal components) that capture the maximum variance.
t-distributed Stochastic Neighbor Embedding (t-SNE): Reduces dimensions for
visualizing high-dimensional data, preserving local relationships.
Non-negative Matrix Factorization (NMF) : Factorizes data into non-negative
components, useful for sparse data like text or images.
Independent Component Analysis (ICA)
Isomap : Preserves geodesic distances to capture non-linear structures in data.
Locally Linear Embedding (LLE) : Preserves local relationships by
reconstructing data points from their neighbors.
Latent Semantic Analysis (LSA) : Reduces the dimensionality of text data,
revealing hidden patterns.
Autoencoders : Neural networks that compress and reconstruct data, useful for
feature learning and anomaly detection.
3. Association Rule

Find patterns (called association rules) between items in large datasets, typically
in market basket analysis (e.g., finding that people who buy bread often buy
butter). It identifies patterns based solely on the frequency of item occurrences
and co-occurrences in the dataset.

Apriori algorithm : Finds frequent itemsets by iterating through data and

pruning non-frequent item combinations.
FP-Growth (Frequent Pattern-Growth) : Efficiently mines frequent itemsets
using a compressed FP-tree structure without candidate generation.
ECLAT (Equivalence Class Clustering and bottom-up Lattice Traversal) : Uses
vertical data format for faster frequent pattern discovery through efficient
intersection of itemsets.
Reinforcement Learning Algorithms
Reinforcement learning involves training agents to make a sequence of
decisions by rewarding them for good actions and penalizing them for bad ones.
Broadly categorized into Model-Based and Model-Free methods, these
approaches differ in how they interact with the environment.
1. Model-Based Methods

These methods use a model of the environment to predict outcomes and help the
agent plan actions by simulating potential results.

Markov decision processes (MDPs)

Bellman equation
Value iteration algorithm
Monte Carlo Tree Search
2. Model-Free Methods

These methods do not build or rely on an explicit model of the environment.

Instead, the agent learns directly from experience by interacting with the
environment and adjusting its actions based on feedback. Model-Free methods
can be further divided into Value-Based and Policy-Based methods:

Value-Based Methods: Focus on learning the value of different states or actions,

where the agent estimates the expected return from each action and selects the
one with the highest value.
Q-Learning
SARSA
Monte Carlo Methods
Policy-based Methods: Directly learn a policy (a mapping from states to
actions) without estimating values where the agent continuously adjusts its
policy to maximize rewards.
REINFORCE Algorithm
Actor-Critic Algorithm
Asynchronous Advantage Actor-Critic (A3C)

Module 1 & 2
No ratings yet
Module 1 & 2
21 pages
Unit 3
No ratings yet
Unit 3
61 pages
Machine Learning Algorithms Guide
No ratings yet
Machine Learning Algorithms Guide
10 pages
Group 2 ML Asignmet
No ratings yet
Group 2 ML Asignmet
23 pages
3.popular Machine Learning Algorithm
No ratings yet
3.popular Machine Learning Algorithm
11 pages
Ai Unit-4 ML
No ratings yet
Ai Unit-4 ML
4 pages
Chapter5 - Machine Learning
No ratings yet
Chapter5 - Machine Learning
37 pages
Unit-5 MECH 3-2
No ratings yet
Unit-5 MECH 3-2
14 pages
Machine Learning Algorithms Guide
No ratings yet
Machine Learning Algorithms Guide
15 pages
Introduction To Basics of Machine Learning Algorithms: Pankaj Oli
100% (1)
Introduction To Basics of Machine Learning Algorithms: Pankaj Oli
13 pages
CatBoost and XGBoost Overview
No ratings yet
CatBoost and XGBoost Overview
11 pages
UNIT1
No ratings yet
UNIT1
38 pages
Supervised ML Algorithms
No ratings yet
Supervised ML Algorithms
9 pages
Supervised Learning Algorithmn
No ratings yet
Supervised Learning Algorithmn
4 pages
Session 3 Types of Machine Learning
No ratings yet
Session 3 Types of Machine Learning
22 pages
Machine Learning
No ratings yet
Machine Learning
38 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
21 pages
Machine Learning Algorithms
No ratings yet
Machine Learning Algorithms
14 pages
ML & DL Notes
No ratings yet
ML & DL Notes
30 pages
What Are The Common Algorithms in Machine Learning
No ratings yet
What Are The Common Algorithms in Machine Learning
3 pages
DSF Unit 3
No ratings yet
DSF Unit 3
29 pages
ML Unit 2
No ratings yet
ML Unit 2
6 pages
Machine Learning Algorithms Guide
No ratings yet
Machine Learning Algorithms Guide
12 pages
Unit 1
No ratings yet
Unit 1
8 pages
Document
No ratings yet
Document
2 pages
IMTC634 - Data Science - Chapter 6
No ratings yet
IMTC634 - Data Science - Chapter 6
22 pages
11 Most Common Machine Learning Algorithms Explained in A Nutshell by Soner Yıldırım Towards Data Science
No ratings yet
11 Most Common Machine Learning Algorithms Explained in A Nutshell by Soner Yıldırım Towards Data Science
16 pages
ML Assignment 1
No ratings yet
ML Assignment 1
12 pages
Project Report 2
No ratings yet
Project Report 2
11 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
14 pages
Unit 1 Machine Learning
No ratings yet
Unit 1 Machine Learning
29 pages
ML - Unit 1
No ratings yet
ML - Unit 1
87 pages
Machine Learning Models
No ratings yet
Machine Learning Models
11 pages
Evolutional Study On KNN and K-Means Algorithms (SP)
No ratings yet
Evolutional Study On KNN and K-Means Algorithms (SP)
9 pages
1-Mapping Problems To Machine Learning Tasks
No ratings yet
1-Mapping Problems To Machine Learning Tasks
19 pages
Unit 1
No ratings yet
Unit 1
15 pages
Intro to Machine Learning Types
No ratings yet
Intro to Machine Learning Types
5 pages
Machine Learning Classification, Regression and Clustering
No ratings yet
Machine Learning Classification, Regression and Clustering
77 pages
Unit 4 Learning
No ratings yet
Unit 4 Learning
5 pages
Machine Learning Algorithms Guide
No ratings yet
Machine Learning Algorithms Guide
5 pages
Overview of Machine Learning Algorithms
No ratings yet
Overview of Machine Learning Algorithms
123 pages
Machine Learning - Iii
No ratings yet
Machine Learning - Iii
53 pages
Machine Learning Algorithms Overview
No ratings yet
Machine Learning Algorithms Overview
13 pages
Technical Report
No ratings yet
Technical Report
5 pages
Category AI Model
No ratings yet
Category AI Model
7 pages
Unit 3.1 Machine Learning Linear Regression
No ratings yet
Unit 3.1 Machine Learning Linear Regression
30 pages
Datascience Notes
No ratings yet
Datascience Notes
16 pages
Lec05 - Supervised
No ratings yet
Lec05 - Supervised
26 pages
Machine Learning IAI
No ratings yet
Machine Learning IAI
94 pages
ARTIFICIAL INTE-WPS Office
No ratings yet
ARTIFICIAL INTE-WPS Office
29 pages
Chapter Four
No ratings yet
Chapter Four
75 pages
Module 1
No ratings yet
Module 1
28 pages
Machine Learning: ML and Algorithms
No ratings yet
Machine Learning: ML and Algorithms
5 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
13 pages
Models For Machine Learning: M. Tim Jones
No ratings yet
Models For Machine Learning: M. Tim Jones
10 pages
ML Unit-1
No ratings yet
ML Unit-1
28 pages
Decision Trees
No ratings yet
Decision Trees
5 pages
Intro To ML and Its Type, Applications
No ratings yet
Intro To ML and Its Type, Applications
8 pages
Bike Buyer Prediction Using Classification Algorithm
No ratings yet
Bike Buyer Prediction Using Classification Algorithm
19 pages
Machine Learning
No ratings yet
Machine Learning
14 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
11 pages
CH 13 Recruitment and Selection
No ratings yet
CH 13 Recruitment and Selection
18 pages
Presentation 2
No ratings yet
Presentation 2
2 pages
Unit 1 Evs
No ratings yet
Unit 1 Evs
86 pages
Mini Project
No ratings yet
Mini Project
3 pages
INTRO To Sales Management A Guide To Success
No ratings yet
INTRO To Sales Management A Guide To Success
11 pages
Sales Assignment
No ratings yet
Sales Assignment
4 pages
Set 1 STPM Biology 2022 Kelantan (Soalan)
No ratings yet
Set 1 STPM Biology 2022 Kelantan (Soalan)
9 pages
AICTE Karma Scheme
No ratings yet
AICTE Karma Scheme
8 pages
Health 7 - Unit 1
No ratings yet
Health 7 - Unit 1
4 pages
Learning Online: Veletsianos, George
No ratings yet
Learning Online: Veletsianos, George
11 pages
Iyengar Matrimonial Data Overview
No ratings yet
Iyengar Matrimonial Data Overview
22 pages
Learners' Guide Notes Week 1 3
No ratings yet
Learners' Guide Notes Week 1 3
17 pages
( )
No ratings yet
( )
16 pages
LSAF - Registration Form
No ratings yet
LSAF - Registration Form
2 pages
Teaching and Learning Science Through Songs
No ratings yet
Teaching and Learning Science Through Songs
259 pages
Beginner's Guide to Competitive Programming
No ratings yet
Beginner's Guide to Competitive Programming
5 pages
WWE Progression
No ratings yet
WWE Progression
5 pages
Roll and Tell GTT
No ratings yet
Roll and Tell GTT
6 pages
Sleep's Impact on Student Performance
100% (1)
Sleep's Impact on Student Performance
2 pages
Undergraduate Students' Perceptions and Attitudes Towards A Career in Tourism Industry: The Case of Indonesia
No ratings yet
Undergraduate Students' Perceptions and Attitudes Towards A Career in Tourism Industry: The Case of Indonesia
12 pages
Investigating The Obstacles of Speaking Fluency Among EFL Sudanese University Students
No ratings yet
Investigating The Obstacles of Speaking Fluency Among EFL Sudanese University Students
76 pages
Botany NEP Syllabus 2024
No ratings yet
Botany NEP Syllabus 2024
82 pages
Keirsey Temperament Analysis: Guardian Traits
No ratings yet
Keirsey Temperament Analysis: Guardian Traits
9 pages
Hansgrohe ComfortZone Mixer Guide
No ratings yet
Hansgrohe ComfortZone Mixer Guide
7 pages
SEO Review of Search Text Evaluations
No ratings yet
SEO Review of Search Text Evaluations
4 pages
Osgerby, Bill - Playboys in Paradise - Masculinity
No ratings yet
Osgerby, Bill - Playboys in Paradise - Masculinity
245 pages
Project Work Template 2081
No ratings yet
Project Work Template 2081
16 pages
I.I.M.U.N. Coimbatore 2025
No ratings yet
I.I.M.U.N. Coimbatore 2025
10 pages
Art Lesson Plan for 8th Graders
No ratings yet
Art Lesson Plan for 8th Graders
4 pages
Philosophy of Law - Syllabus
No ratings yet
Philosophy of Law - Syllabus
6 pages
FIST DST Proposal Modified
100% (1)
FIST DST Proposal Modified
31 pages
Cover Letter Job Sample PDF
100% (2)
Cover Letter Job Sample PDF
4 pages
Kansteiner FindingMeaningMemory 2002
No ratings yet
Kansteiner FindingMeaningMemory 2002
20 pages
Research Processes For Practical Research 1
No ratings yet
Research Processes For Practical Research 1
40 pages
Data Modeling with DynamoDB in 60 Minutes
No ratings yet
Data Modeling with DynamoDB in 60 Minutes
35 pages
Single in Seoul-1
No ratings yet
Single in Seoul-1
5 pages