0% found this document useful (0 votes)

41 views5 pages

Module 1 ML Short Notes

Module 1 provides an overview of machine learning, including its definition, processes, and various paradigms such as supervised, semi-supervised, unsupervised, and reinforcement learning. Key concepts of supervised learning are discussed, including input representation, hypothesis class, VC dimension, and model selection. The document emphasizes the importance of generalization and the challenges of noise in data.

Uploaded by

Alphonse Joy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views5 pages

Module 1 ML Short Notes

Uploaded by

Alphonse Joy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

MODULE 1

SHORT NOTES
Module-1 (Overview of machine learning) Introduction to Machine Learning, Machine learning
paradigms-supervised, semi-supervised, unsupervised, reinforcement learning. Supervised
learning- Input representation, Hypothesis class, Version space, Vapnik-Chervonenk is (VC)
Dimension, Probably Approximately Correct Learning (PAC), Noise, Learning Multiple classes,
Model Selection and Generalization

1. Introduction to Machine Learning

Machine Learning (ML) is a part of Artificial Intelligence that focuses on creating algorithms
that learn patterns from data and make predictions or decisions without being explicitly
programmed.

● The ML process involves collecting data → training a model → testing on new data.

● It improves automatically with more data and better algorithms.

● Example: Spam filter learns from examples of spam and non-spam emails.

2. Machine Learning Paradigms

2.1 Supervised Learning

● Definition: Learning from labeled datasets where both input (X) and output (Y) are
known.

● Goal: Learn a mapping f:X→Yf: X \to Yf:X→Y to predict Y for new X.

● Types:

○ Regression → predicts continuous values (e.g., temperature).

○ Classification → predicts categories (e.g., “pass” or “fail”).

● Example: Predicting house prices from size, location, and number of rooms.
INTRODUCTION TO MACHINE LEARNING -module 1 short notes

2.2 Semi-Supervised Learning

● Uses a small labeled dataset + a large unlabeled dataset.

● Useful when labeling data is expensive or time-consuming.

● Labeled data guides the learning, unlabeled data helps improve accuracy.

● Example: Language translation with few manually translated sentences and many
unlabelled ones.

2.3 Unsupervised Learning

● Learns from unlabeled data (no output labels given).

● Goal: Find hidden patterns, clusters, or structures.

● Example:

○ Clustering: Group customers based on buying behavior.

○ Dimensionality Reduction: Reduce features while keeping important info (PCA).

2.4 Reinforcement Learning

● Learn by interacting with an environment and receiving rewards or penalties.

● Key terms:

○ Agent → learner/decision-maker.

○ Environment → system where agent acts.

○ Reward → positive or negative feedback.

● Example: Game-playing AI that learns to win by trial and error.

3. Supervised Learning – Key Concepts

Prepared by:Prof Merlin Joshi 1

INTRODUCTION TO MACHINE LEARNING -module 1 short notes

3.1 Input Representation

● How data is presented to the model (features).

● Good feature selection improves performance.

● Types:

○ Numeric → Age, height, salary.

○ Categorical → Gender, color, city (often converted to numeric using encoding).

3.2 Hypothesis Class

● The complete set of functions the learning algorithm can pick from.

● Example: For linear regression, the hypothesis class is the set of all straight-line
equations y=mx+c

● Bigger hypothesis class → more flexibility but risk of overfitting.

3.3 Version Space

● The subset of hypotheses from the hypothesis class that fit all training examples.

● As we get more training data, wrong hypotheses are removed, and the version space
becomes smaller.

3.4 VC Dimension

● Vapnik–Chervonenkis (VC) dimension measures the capacity of a hypothesis class.

● Higher VC → can fit more complex patterns.

● Example: A straight line in 2D can perfectly separate at most 3 points in all possible ways
(VC = 3).

3.5 PAC Learning

Prepared by:Prof Merlin Joshi 2

INTRODUCTION TO MACHINE LEARNING -module 1 short notes

Probably Approximately Correct learning framework.

States that a learning algorithm should produce a hypothesis that:

● Has error ≤ ϵ (approximately correct).

● With probability ≥ 1−δ1 (probably correct).

Example: With 95% confidence (δ=0.05), the error is less than 5% (ϵ=0.05).

3.6 Noise

● Unwanted variations in data that don’t represent the true pattern.

● Sources:

○ Wrong labels (label noise).

○ Faulty measurements (attribute noise).

● Noise can mislead the model, reducing accuracy.

3.7 Learning Multiple Classes

● Many problems have more than 2 categories.

● Approaches:

○ One-vs-All (OvA) → One classifier per class vs all others.

○ One-vs-One (OvO) → Classifier for each pair of classes.

○ Direct multi-class algorithms (e.g., decision trees, softmax regression).

3.8 Model Selection

Prepared by:Prof Merlin Joshi 3

INTRODUCTION TO MACHINE LEARNING -module 1 short notes

● Choosing the best model and hyperparameters for a task.

● Methods:

○ Cross-validation → test model on different data splits.

○ Grid search → try combinations of parameters.

● Goal: Best accuracy while avoiding overfitting.

3.9 Generalization

● The ability of a trained model to work well on new, unseen data.

● Overfitting: Learns noise, works badly on new data.

● Underfitting: Too simple, misses patterns.

● Good generalization comes from balanced model complexity and enough training data.

Prepared by:Prof Merlin Joshi 4

Machine Learning for CS Students
No ratings yet
Machine Learning for CS Students
35 pages
Machine Learning (R20a0518)
No ratings yet
Machine Learning (R20a0518)
87 pages
Deep Learning l1
No ratings yet
Deep Learning l1
47 pages
Lecture 1 - Introduction To Machine Learning-HO - Ch0
No ratings yet
Lecture 1 - Introduction To Machine Learning-HO - Ch0
44 pages
Unit 1
No ratings yet
Unit 1
10 pages
THEORY FILE - Machine Learning (6th Sem) !!
No ratings yet
THEORY FILE - Machine Learning (6th Sem) !!
26 pages
ITA6016 - Machine Learning Introduction
No ratings yet
ITA6016 - Machine Learning Introduction
13 pages
Module 1 Introduction Notes ML
No ratings yet
Module 1 Introduction Notes ML
63 pages
Tutorial Sheet1 (M.L.)
No ratings yet
Tutorial Sheet1 (M.L.)
49 pages
AI Module 1 Simple Notes
No ratings yet
AI Module 1 Simple Notes
14 pages
Machine Learning Concise Notes
No ratings yet
Machine Learning Concise Notes
7 pages
Unit 1
No ratings yet
Unit 1
15 pages
Machine Learning
No ratings yet
Machine Learning
256 pages
01 Introduction
No ratings yet
01 Introduction
28 pages
Machine Learning.
No ratings yet
Machine Learning.
50 pages
Session 8 - Machine Learning Techniques
No ratings yet
Session 8 - Machine Learning Techniques
48 pages
ML All Units Mca 3rd Semester Anna University
No ratings yet
ML All Units Mca 3rd Semester Anna University
100 pages
Class 1
No ratings yet
Class 1
16 pages
ML Module I
No ratings yet
ML Module I
71 pages
Deepseek Text 20250910 Ac5a52
No ratings yet
Deepseek Text 20250910 Ac5a52
1 page
Machine Learning Fundamentals Guide
No ratings yet
Machine Learning Fundamentals Guide
46 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
4 pages
Module 1
No ratings yet
Module 1
175 pages
Machine Learning-Lecture 01
No ratings yet
Machine Learning-Lecture 01
28 pages
1 - Machine Learning Overview
No ratings yet
1 - Machine Learning Overview
56 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
15 pages
ML Insights for Researchers & Practitioners
No ratings yet
ML Insights for Researchers & Practitioners
17 pages
Introduction To ML
No ratings yet
Introduction To ML
48 pages
Lecture 2
No ratings yet
Lecture 2
36 pages
Unit 1 ML
No ratings yet
Unit 1 ML
41 pages
ML 7th Sem AIML ITE Notes Complete LONG (1) - 10-33
No ratings yet
ML 7th Sem AIML ITE Notes Complete LONG (1) - 10-33
24 pages
ML1-Introduction To Machine Learning
No ratings yet
ML1-Introduction To Machine Learning
46 pages
ML Unit1
No ratings yet
ML Unit1
6 pages
Ai - Foundations of Machine Learning I
No ratings yet
Ai - Foundations of Machine Learning I
40 pages
Lecture Notes On Machine Learning Concepts
No ratings yet
Lecture Notes On Machine Learning Concepts
5 pages
Unit Iii
No ratings yet
Unit Iii
50 pages
ML Unit1
No ratings yet
ML Unit1
25 pages
Applied Machine Learning Course Overview
100% (4)
Applied Machine Learning Course Overview
22 pages
Machinelearning Unit1
No ratings yet
Machinelearning Unit1
9 pages
Unit Iii
No ratings yet
Unit Iii
39 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
48 pages
Intro to Machine Learning Concepts
No ratings yet
Intro to Machine Learning Concepts
13 pages
Sir Usman Ghani Lectures (Lecture 1-Lecture 1.6)
No ratings yet
Sir Usman Ghani Lectures (Lecture 1-Lecture 1.6)
9 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
64 pages
UNIT I - Introduction
No ratings yet
UNIT I - Introduction
76 pages
01 Introduction Overview
No ratings yet
01 Introduction Overview
43 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
5 pages
ML Lec 02 Introduction II
No ratings yet
ML Lec 02 Introduction II
22 pages
Report Rahul
No ratings yet
Report Rahul
26 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
19 pages
Unit-1 AML
No ratings yet
Unit-1 AML
32 pages
Machine Learning
No ratings yet
Machine Learning
87 pages
Understanding Machine Learning Basics
No ratings yet
Understanding Machine Learning Basics
21 pages
R22 Machine Learning Digital Notes Final
No ratings yet
R22 Machine Learning Digital Notes Final
143 pages
University Institute of Engineering Department of Computer Science and Engg
No ratings yet
University Institute of Engineering Department of Computer Science and Engg
27 pages
ML Unit-1
No ratings yet
ML Unit-1
64 pages
Lec 7 - 8 - Machine Learning Introduction
No ratings yet
Lec 7 - 8 - Machine Learning Introduction
55 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
5 pages
Midterm Test For Foundations in Special and Inclusive Education
No ratings yet
Midterm Test For Foundations in Special and Inclusive Education
6 pages
Outdoor Carnival Activities Lesson Plan
100% (2)
Outdoor Carnival Activities Lesson Plan
3 pages
CTS Machinist - CTS - NSQF-5
No ratings yet
CTS Machinist - CTS - NSQF-5
84 pages
Lesson Plan: Give Me a Clue
No ratings yet
Lesson Plan: Give Me a Clue
4 pages
Ryan Desjardins 1
No ratings yet
Ryan Desjardins 1
2 pages
Deep Learning Practical File
No ratings yet
Deep Learning Practical File
18 pages
Development Plans EDM
100% (1)
Development Plans EDM
2 pages
TTL MODULE-1-Lesson-4
No ratings yet
TTL MODULE-1-Lesson-4
8 pages
Pe9 q3 Mod0 FestivalDances v5
No ratings yet
Pe9 q3 Mod0 FestivalDances v5
16 pages
Penablanca East District Action-Plan-Esp-Template Final Template
No ratings yet
Penablanca East District Action-Plan-Esp-Template Final Template
6 pages
Building Trust and Inclusion - Brochure & Triangulation 2
No ratings yet
Building Trust and Inclusion - Brochure & Triangulation 2
4 pages
CMSD Reopening Plan
No ratings yet
CMSD Reopening Plan
34 pages
Tabango SHS Instructional Plan 2020-21
No ratings yet
Tabango SHS Instructional Plan 2020-21
6 pages
Fake Accounts Detection On Social Media (Instagram and Twitter)
No ratings yet
Fake Accounts Detection On Social Media (Instagram and Twitter)
8 pages
Unit 3 DB
No ratings yet
Unit 3 DB
14 pages
School Vocabulary & Memories
No ratings yet
School Vocabulary & Memories
17 pages
Agyekum Maths Final Project Work
No ratings yet
Agyekum Maths Final Project Work
46 pages
Vocabulary Games for Deaf Students
No ratings yet
Vocabulary Games for Deaf Students
12 pages
Paris France Lessons
No ratings yet
Paris France Lessons
4 pages
Adventure Time (Lesson 33 - 48)
No ratings yet
Adventure Time (Lesson 33 - 48)
15 pages
Reading &use of E Test 2 Exam Essentials Cambridge English Advanced 2 p31-42
33% (3)
Reading &use of E Test 2 Exam Essentials Cambridge English Advanced 2 p31-42
12 pages
TEGr 112 Lesson 3.1
No ratings yet
TEGr 112 Lesson 3.1
8 pages
EuroMPM: Masters for Project Pros
No ratings yet
EuroMPM: Masters for Project Pros
6 pages
IBT TOEFL Reading Tips
0% (1)
IBT TOEFL Reading Tips
3 pages
IGCSE E2L Assessment Training Overview
No ratings yet
IGCSE E2L Assessment Training Overview
15 pages
DLL English 10 q1 w1
No ratings yet
DLL English 10 q1 w1
4 pages
2003 Second Language Acquisition - Applied Linguistics and The Teaching of Foreign Languages by Kramsch
No ratings yet
2003 Second Language Acquisition - Applied Linguistics and The Teaching of Foreign Languages by Kramsch
9 pages
Svth: Đỗ Thị Hồng Vân, Ngô Hương Liên, Vũ Thị Hồng Hạnh Gvhd: Ths. Nguyễn Thị Thanh Hương Tóm Tắt
No ratings yet
Svth: Đỗ Thị Hồng Vân, Ngô Hương Liên, Vũ Thị Hồng Hạnh Gvhd: Ths. Nguyễn Thị Thanh Hương Tóm Tắt
24 pages
Letter Request
No ratings yet
Letter Request
2 pages
Interdisciplinary Robotics Dual Degree
No ratings yet
Interdisciplinary Robotics Dual Degree
3 pages

Module 1 ML Short Notes

Uploaded by

Module 1 ML Short Notes

Uploaded by

MODULE 1

1. Introduction to Machine Learning

●​ It improves automatically with more data and better algorithms.​

2. Machine Learning Paradigms

2.1 Supervised Learning

○​ Regression → predicts continuous values (e.g., temperature).​

○​ Classification → predicts categories (e.g., “pass” or “fail”).​

2.2 Semi-Supervised Learning

●​ Uses a small labeled dataset + a large unlabeled dataset.​

●​ Useful when labeling data is expensive or time-consuming.​

2.3 Unsupervised Learning

●​ Learns from unlabeled data (no output labels given).​

●​ Goal: Find hidden patterns, clusters, or structures.​

○​ Clustering: Group customers based on buying behavior.​

○​ Dimensionality Reduction: Reduce features while keeping important info (PCA).​

2.4 Reinforcement Learning

●​ Learn by interacting with an environment and receiving rewards or penalties.​

○​ Environment → system where agent acts.​

○​ Reward → positive or negative feedback.​

●​ Example: Game-playing AI that learns to win by trial and error.​

3. Supervised Learning – Key Concepts

Prepared by:​Prof Merlin Joshi 1

3.1 Input Representation

●​ How data is presented to the model (features).​

●​ Good feature selection improves performance.​

○​ Numeric → Age, height, salary.​

○​ Categorical → Gender, color, city (often converted to numeric using encoding).​

3.2 Hypothesis Class

●​ Bigger hypothesis class → more flexibility but risk of overfitting.​

3.3 Version Space

●​ Vapnik–Chervonenkis (VC) dimension measures the capacity of a hypothesis class.​

●​ Higher VC → can fit more complex patterns.​

3.5 PAC Learning

Prepared by:​Prof Merlin Joshi 2

Probably Approximately Correct learning framework.​

States that a learning algorithm should produce a hypothesis that:​

●​ Has error ≤ ϵ (approximately correct).​

●​ With probability ≥ 1−δ1 (probably correct).​

●​ Unwanted variations in data that don’t represent the true pattern.​

○​ Wrong labels (label noise).​

○​ Faulty measurements (attribute noise).​

●​ Noise can mislead the model, reducing accuracy.​

3.7 Learning Multiple Classes

●​ Many problems have more than 2 categories.​

○​ One-vs-All (OvA) → One classifier per class vs all others.​

○​ One-vs-One (OvO) → Classifier for each pair of classes.​

○​ Direct multi-class algorithms (e.g., decision trees, softmax regression).​

3.8 Model Selection

Prepared by:​Prof Merlin Joshi 3

●​ Choosing the best model and hyperparameters for a task.​

○​ Cross-validation → test model on different data splits.​

○​ Grid search → try combinations of parameters.​

●​ Goal: Best accuracy while avoiding overfitting.​

●​ The ability of a trained model to work well on new, unseen data.​

●​ Overfitting: Learns noise, works badly on new data.​

●​ Underfitting: Too simple, misses patterns.​

Prepared by:​Prof Merlin Joshi 4

You might also like

● It improves automatically with more data and better algorithms.

○ Regression → predicts continuous values (e.g., temperature).

○ Classification → predicts categories (e.g., “pass” or “fail”).

● Uses a small labeled dataset + a large unlabeled dataset.

● Useful when labeling data is expensive or time-consuming.

● Learns from unlabeled data (no output labels given).

● Goal: Find hidden patterns, clusters, or structures.

○ Clustering: Group customers based on buying behavior.

○ Dimensionality Reduction: Reduce features while keeping important info (PCA).

● Learn by interacting with an environment and receiving rewards or penalties.

○ Environment → system where agent acts.

○ Reward → positive or negative feedback.

● Example: Game-playing AI that learns to win by trial and error.

Prepared by:Prof Merlin Joshi 1

● How data is presented to the model (features).

● Good feature selection improves performance.

○ Numeric → Age, height, salary.

○ Categorical → Gender, color, city (often converted to numeric using encoding).

● Bigger hypothesis class → more flexibility but risk of overfitting.

● Vapnik–Chervonenkis (VC) dimension measures the capacity of a hypothesis class.

● Higher VC → can fit more complex patterns.

Prepared by:Prof Merlin Joshi 2

Probably Approximately Correct learning framework.

States that a learning algorithm should produce a hypothesis that:

● Has error ≤ ϵ (approximately correct).

● With probability ≥ 1−δ1 (probably correct).

● Unwanted variations in data that don’t represent the true pattern.

○ Wrong labels (label noise).

○ Faulty measurements (attribute noise).

● Noise can mislead the model, reducing accuracy.

● Many problems have more than 2 categories.

○ One-vs-All (OvA) → One classifier per class vs all others.

○ One-vs-One (OvO) → Classifier for each pair of classes.

○ Direct multi-class algorithms (e.g., decision trees, softmax regression).

Prepared by:Prof Merlin Joshi 3

● Choosing the best model and hyperparameters for a task.

○ Cross-validation → test model on different data splits.

○ Grid search → try combinations of parameters.

● Goal: Best accuracy while avoiding overfitting.

● The ability of a trained model to work well on new, unseen data.

● Overfitting: Learns noise, works badly on new data.

● Underfitting: Too simple, misses patterns.

Prepared by:Prof Merlin Joshi 4