0% found this document useful (0 votes)

52 views7 pages

Unit 1 Notes

The lecture notes cover fundamental concepts in machine learning and deep learning, including learning algorithms, model capacity, overfitting and underfitting, hyperparameters, and validation techniques. Key topics also include bias-variance tradeoff, maximum likelihood estimation, Bayesian statistics, supervised and unsupervised learning algorithms, and optimization methods like stochastic gradient descent. Additionally, the notes discuss the challenges of shallow models and the advantages of deep learning, including architecture design and backpropagation techniques.

Uploaded by

Suresh Kanigiri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

52 views7 pages

Unit 1 Notes

Uploaded by

Suresh Kanigiri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

Lecture Notes: Machine Learning

Fundamentals & Deep Learning

I. Learning Algorithms
1. What is a Learning Algorithm?

 Definition: A method that improves performance (P) on a task (T) through

experience (E).
 Components:

o Task (T): Classification, regression, clustering.

o Performance Metric (P): Accuracy, MSE, F1-score.
o Experience (E): Training data (labeled or unlabeled).

2. Model Capacity

 Definition: A model’s ability to fit complex functions.

 Low Capacity: Underfitting (e.g., linear models).
 High Capacity: Overfitting (e.g., deep neural networks).
 VC Dimension: Measures capacity for classification tasks.

II. Overfitting & Underfitting

1. Overfitting

 Symptoms:

o Low training error, high test error.

o Model memorizes noise in training data.
 Solutions:

o Regularization (L1/L2, dropout).

o Cross-validation.
o Simplify the model.

2. Underfitting

 Symptoms:

o High training and test error.

o Model is too simple.
 Solutions:

o Increase model capacity.

o Feature engineering.

III. Hyperparameters & Validation Sets

1. Hyperparameters vs. Parameters

 Hyperparameters: Set before training (e.g., learning rate, batch size).

 Parameters: Learned during training (e.g., weights in a neural network).

2. Validation Sets

 Purpose: Tune hyperparameters without touching the test set.

 Techniques:

o Holdout Validation: Split data into train/validation/test.

o k-Fold Cross-Validation: Average performance over k splits.

IV. Bias & Variance

1. Definitions
 Bias: Error due to overly simplistic assumptions.
 Variance: Error due to sensitivity to small fluctuations in training data.

2. Tradeoff

 High Bias → Underfitting.

 High Variance → Overfitting.
 Balanced Model: Optimal complexity (e.g., regularization).

V. Maximum Likelihood Estimation (MLE)

 Goal: Find parameters that maximize the likelihood of observed data.
 Formula:
θMLE=arg⁡max⁡θP(X∣θ)θMLE=argθmaxP(X∣θ)

 Example: MLE for Gaussian distribution → Sample mean & variance.

VI. Bayesian Statistics

 Bayes’ Theorem:

P(θ∣X)=P(X∣θ)P(θ)P(X)P(θ∣X)=P(X)P(X∣θ)P(θ)

 Key Concepts:

o Prior: Belief before seeing data.

o Posterior: Updated belief after observing data.
o Conjugate Priors: Simplify posterior computation (e.g., Beta for Bernoulli).
VII. Supervised Learning Algorithms
1. Linear Regression

 Objective: Minimize MSE.

 Closed-form solution:
w=(XTX)−1XTyw=(XTX)−1XTy

2. Logistic Regression

 Objective: Maximize log-likelihood (binary classification).

 Sigmoid function:
P(y=1∣x)=11+e−wTxP(y=1∣x)=1+e−wTx1

3. Decision Trees

 Splitting Criteria: Gini impurity, information gain.

 Pruning: Prevent overfitting.

VIII. Unsupervised Learning

1. Clustering (k-Means)

 Steps:

1. Initialize k centroids.
2. Assign points to nearest centroid.
3. Update centroids.
4. Repeat until convergence.

2. PCA (Dimensionality Reduction)

 Goal: Project data onto lower-dimensional space.
 Eigenvalue decomposition:
XTX=VΛVTXTX=VΛVT

IX. Stochastic Gradient Descent (SGD)

 Update Rule:

wt+1=wt−η∇L(wt)wt+1=wt−η∇L(wt)

 Variants:

o Momentum: Accumulate past gradients.

o Adam: Adaptive learning rates.

X. Building a Machine Learning Algorithm

1. Define the task (e.g., classification).
2. Choose a model (e.g., neural network).
3. Select a loss function (e.g., cross-entropy).
4. Optimize (e.g., SGD).
5. Evaluate (e.g., test set accuracy).

XI. Challenges Motivating Deep Learning

 Limitations of Shallow Models:
o Cannot learn hierarchical features.
o Struggle with high-dimensional data (e.g., images).
 Deep Learning Solutions:

o Automatic feature extraction.

o Scalability with data.

XII. Deep Feedforward Networks

1. Learning XOR

 Problem: Linear models fail (non-linearly separable).

 Solution: 2-layer NN with non-linear activation (e.g., ReLU).

2. Gradient-Based Learning

 Backpropagation: Chain rule for computing gradients.

 Vanishing Gradients: Solved with ReLU, skip connections.

3. Hidden Units

 ReLU: f(x)=max⁡(0,x)f(x)=max(0,x) (avoids vanishing gradients).

 Sigmoid/Tanh: Used in LSTMs.

4. Architecture Design

 Wide vs. Deep: Tradeoff between parameters and abstraction.

 Residual Networks (ResNet): Skip connections for deep networks.

XIII. Backpropagation & Optimization

1. Chain Rule

∂L∂w=∂L∂y∂y∂w∂w∂L=∂y∂L∂w∂y

2. Optimization Algorithms

 SGD: Basic but noisy.

 Adam: Combines momentum + adaptive learning rates.

Lecture Notes On Machine Learning Concepts
No ratings yet
Lecture Notes On Machine Learning Concepts
5 pages
DL Insem 2024 FlyHigh Services
No ratings yet
DL Insem 2024 FlyHigh Services
8 pages
Machine Learning Topics
No ratings yet
Machine Learning Topics
4 pages
Machine Learning: Principles and Practices
No ratings yet
Machine Learning: Principles and Practices
5 pages
Data Science Notes B
No ratings yet
Data Science Notes B
5 pages
ML Question Bank Ese
No ratings yet
ML Question Bank Ese
37 pages
Machine Learning Guide: Types & Concepts
No ratings yet
Machine Learning Guide: Types & Concepts
4 pages
AI ML Concepts
No ratings yet
AI ML Concepts
97 pages
ClassNote One
No ratings yet
ClassNote One
2 pages
ML
No ratings yet
ML
18 pages
Machine Learning Basics Dl2 RK
No ratings yet
Machine Learning Basics Dl2 RK
16 pages
Machine Learning Overview
No ratings yet
Machine Learning Overview
7 pages
Machine Learning Concise Notes
No ratings yet
Machine Learning Concise Notes
7 pages
Unit 1 ML
No ratings yet
Unit 1 ML
41 pages
ML (Theory)
No ratings yet
ML (Theory)
11 pages
Deep Learning Exam Guide
No ratings yet
Deep Learning Exam Guide
19 pages
UCS - 401 - Unit-LV - Trends in Machine Learning - Model and Symbols - Bagging and Boosting, Multitask
No ratings yet
UCS - 401 - Unit-LV - Trends in Machine Learning - Model and Symbols - Bagging and Boosting, Multitask
44 pages
21CS743
No ratings yet
21CS743
27 pages
Machine Learning Basics 2
No ratings yet
Machine Learning Basics 2
3 pages
Supervised vs. Unsupervised Learning
No ratings yet
Supervised vs. Unsupervised Learning
7 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
64 pages
Machine Learning Algorithms Overview
No ratings yet
Machine Learning Algorithms Overview
8 pages
Deep Learning Exam: Key Concepts
No ratings yet
Deep Learning Exam: Key Concepts
32 pages
Paper 1
No ratings yet
Paper 1
12 pages
Reasearch 5
No ratings yet
Reasearch 5
5 pages
Reasearch 5
No ratings yet
Reasearch 5
5 pages
Machine Learning
No ratings yet
Machine Learning
21 pages
ML Notes-1
No ratings yet
ML Notes-1
59 pages
Comprehensive Machine Learning Guide
No ratings yet
Comprehensive Machine Learning Guide
20 pages
Advanced Deep Learning Concepts
No ratings yet
Advanced Deep Learning Concepts
5 pages
ML Algorithms Comprehensive Study
No ratings yet
ML Algorithms Comprehensive Study
9 pages
AI Module 1 Simple Notes
No ratings yet
AI Module 1 Simple Notes
14 pages
In Depth Explanation of Machine Learning Concepts
No ratings yet
In Depth Explanation of Machine Learning Concepts
3 pages
Ahishek File
No ratings yet
Ahishek File
6 pages
Machine Learning Fundamentals - A Beginner's Guide
No ratings yet
Machine Learning Fundamentals - A Beginner's Guide
2 pages
Notes Unit 1
No ratings yet
Notes Unit 1
13 pages
ML Basics Guide
No ratings yet
ML Basics Guide
2 pages
ML Unit 2
No ratings yet
ML Unit 2
23 pages
ChatPDF IMG 20250313 WA0000
No ratings yet
ChatPDF IMG 20250313 WA0000
2 pages
Machine Learning Overview and Techniques
No ratings yet
Machine Learning Overview and Techniques
30 pages
Data Input
No ratings yet
Data Input
6 pages
ML Notes
No ratings yet
ML Notes
52 pages
Key Concepts in Machine Learning
No ratings yet
Key Concepts in Machine Learning
2 pages
ML Unit1
No ratings yet
ML Unit1
25 pages
ITA6016 - Machine Learning Introduction
No ratings yet
ITA6016 - Machine Learning Introduction
13 pages
Machine Learning for CS Students
No ratings yet
Machine Learning for CS Students
35 pages
ML Sem
No ratings yet
ML Sem
24 pages
Unit-1 Introduction To Machine Learning: 1. What Is Learning? Learning Data Example
No ratings yet
Unit-1 Introduction To Machine Learning: 1. What Is Learning? Learning Data Example
15 pages
Rohit Unit 1 ML Notes
No ratings yet
Rohit Unit 1 ML Notes
27 pages
Unit3-2 Marks
No ratings yet
Unit3-2 Marks
10 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
6 pages
ChatPDF IMG 20250313 WA0000
No ratings yet
ChatPDF IMG 20250313 WA0000
2 pages
Machine Learning
No ratings yet
Machine Learning
3 pages
Class Notes: The Basics of Machine Learning
No ratings yet
Class Notes: The Basics of Machine Learning
4 pages
Machine Learning (R20a0518)
No ratings yet
Machine Learning (R20a0518)
87 pages
Machine Learning
No ratings yet
Machine Learning
38 pages
Chapter 1
No ratings yet
Chapter 1
28 pages
MachineLearning Perplexity
No ratings yet
MachineLearning Perplexity
5 pages
Machine Learning and Deep Learning Basics
No ratings yet
Machine Learning and Deep Learning Basics
36 pages
Compound Words in Speech Recognition
No ratings yet
Compound Words in Speech Recognition
6 pages
Lift Management Plan
No ratings yet
Lift Management Plan
9 pages
Week 6
No ratings yet
Week 6
34 pages
General Linear Models in Small Area Estimation: An Assessment in Agricultural Surveys
No ratings yet
General Linear Models in Small Area Estimation: An Assessment in Agricultural Surveys
23 pages
Comprehensive Survey of Tokenization Methods in Language Models
No ratings yet
Comprehensive Survey of Tokenization Methods in Language Models
19 pages
Silvapulle ExistenceMaximumLikelihood 1981
No ratings yet
Silvapulle ExistenceMaximumLikelihood 1981
5 pages
Statistics Course Syllabus & Schedule
50% (2)
Statistics Course Syllabus & Schedule
3 pages
Presentation Generalized Linear Model Theory
No ratings yet
Presentation Generalized Linear Model Theory
77 pages
(J. G. Kalbfleisch) Probability and Statistical I PDF
No ratings yet
(J. G. Kalbfleisch) Probability and Statistical I PDF
188 pages
Real Options Valuation of Photovoltaic Power Investments in Existing Buildings
No ratings yet
Real Options Valuation of Photovoltaic Power Investments in Existing Buildings
14 pages
Notational Conventions Class 13, 18.05 Jeremy Orloff and Jonathan Bloom 1 Learning Goals
No ratings yet
Notational Conventions Class 13, 18.05 Jeremy Orloff and Jonathan Bloom 1 Learning Goals
3 pages
Stata Command for Fixed-Effects SF Models
No ratings yet
Stata Command for Fixed-Effects SF Models
37 pages
RP-04: Monitoring and Adjustment of Calibration Intervals For Mass Standards
No ratings yet
RP-04: Monitoring and Adjustment of Calibration Intervals For Mass Standards
14 pages
Fundamentals of Data Science Unit 4
100% (1)
Fundamentals of Data Science Unit 4
31 pages
CS2A - April22 - EXAM - Clean Proof
No ratings yet
CS2A - April22 - EXAM - Clean Proof
8 pages
Chapter 8 ARIMA Models: 8.1 Stationarity and Differencing
100% (1)
Chapter 8 ARIMA Models: 8.1 Stationarity and Differencing
46 pages
An Introduction To: A Package For Geostatistical Data Analysis Using The Software and
No ratings yet
An Introduction To: A Package For Geostatistical Data Analysis Using The Software and
41 pages
Linear Regression Tutorial in Python
No ratings yet
Linear Regression Tutorial in Python
15 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
38 pages
CS229 Machine Learning Course Overview
100% (2)
CS229 Machine Learning Course Overview
245 pages
Introduction To Quantitative Ecology: Mathematical and Statistical Modelling For Beginners Timothy E. Essington Ebook With Full Chapters
100% (1)
Introduction To Quantitative Ecology: Mathematical and Statistical Modelling For Beginners Timothy E. Essington Ebook With Full Chapters
47 pages
Fisher Pcps 22 700 25
No ratings yet
Fisher Pcps 22 700 25
26 pages
(Ebook) Prospect Theory: For Risk and Ambiguity by Peter P. Wakker ISBN 9780511776571, 9780521748681, 9780521765015, 0511776578, 0521748682, 0521765013 Instant Download
100% (11)
(Ebook) Prospect Theory: For Risk and Ambiguity by Peter P. Wakker ISBN 9780511776571, 9780521748681, 9780521765015, 0511776578, 0521748682, 0521765013 Instant Download
55 pages
Manual
No ratings yet
Manual
21 pages
GANs for Anomaly Detection
No ratings yet
GANs for Anomaly Detection
13 pages
Food Inflation in India: The Role For Monetary Policy: Rahul Anand, Ding Ding, and Volodymyr Tulin
No ratings yet
Food Inflation in India: The Role For Monetary Policy: Rahul Anand, Ding Ding, and Volodymyr Tulin
23 pages
Rubin 1976
No ratings yet
Rubin 1976
12 pages
Introduction to Probability and Statistics
No ratings yet
Introduction to Probability and Statistics
30 pages
Bayesian Estimation Seminar Report
No ratings yet
Bayesian Estimation Seminar Report
9 pages
GEV Parameter Estimation and Stationary vs. Non-Stationary Analysis of Extreme Rainfall in African Test Cities
No ratings yet
GEV Parameter Estimation and Stationary vs. Non-Stationary Analysis of Extreme Rainfall in African Test Cities
22 pages

Unit 1 Notes

Uploaded by

Unit 1 Notes

Uploaded by

Lecture Notes: Machine Learning

Fundamentals & Deep Learning

 Definition: A method that improves performance (P) on a task (T) through

o Task (T): Classification, regression, clustering.

 Definition: A model’s ability to fit complex functions.

II. Overfitting & Underfitting

o Low training error, high test error.

o Regularization (L1/L2, dropout).

o High training and test error.

o Increase model capacity.

III. Hyperparameters & Validation Sets

 Hyperparameters: Set before training (e.g., learning rate, batch size).

 Purpose: Tune hyperparameters without touching the test set.

o Holdout Validation: Split data into train/validation/test.

IV. Bias & Variance

 High Bias → Underfitting.

V. Maximum Likelihood Estimation (MLE)

 Example: MLE for Gaussian distribution → Sample mean & variance.

VI. Bayesian Statistics

o Prior: Belief before seeing data.

 Objective: Minimize MSE.

 Objective: Maximize log-likelihood (binary classification).

 Splitting Criteria: Gini impurity, information gain.

VIII. Unsupervised Learning

2. PCA (Dimensionality Reduction)

IX. Stochastic Gradient Descent (SGD)

o Momentum: Accumulate past gradients.

X. Building a Machine Learning Algorithm

XI. Challenges Motivating Deep Learning

o Automatic feature extraction.

XII. Deep Feedforward Networks

 Problem: Linear models fail (non-linearly separable).

 Backpropagation: Chain rule for computing gradients.

 ReLU: f(x)=max⁡(0,x)f(x)=max(0,x) (avoids vanishing gradients).

 Wide vs. Deep: Tradeoff between parameters and abstraction.

XIII. Backpropagation & Optimization

 SGD: Basic but noisy.

You might also like