Chapter 7 - Semi-Supervised Learning

The document discusses semi-supervised learning, which combines a small amount of labeled data with a large amount of unlabeled data to improve model accuracy. It outlines two main approaches: transductive and inductive learning, and details the expectation maximization algorithm used for optimizing model parameters. Additionally, it introduces pseudo labeling as an efficient technique for leveraging unlabeled data in training models.

Uploaded by

cs225114635

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views9 pages

Chapter 7 - Semi-Supervised Learning

Uploaded by

cs225114635

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

CHAPTER VII

Semi-supervised Learning
Dr. K. Rajkumar,
Dean of International Relations,
Associate Professor,
Department of Computer Science
Bishop Heber College
Semi-supervised Learning
• Supervised methods use labeled data for training, unsupervised
methods are applied on unlabeled data and have to determine feature
importance by themselves.
• Semi-supervised methods follow a hybrid approach using a small
number of labeled instances and a large number of unlabeled
instances.
Why Semi-supervised Learning?
• Labeling data is a time consuming and costly approach, is often
performed by a data scientist.
• Manual labeling can also introduce human bias.

Dr. K. Rajkumar, Dept of Computer Science 2

Semi-supervised Learning
• Semi-supervised methods perform considerably better using unlabeled
data during training which improves accuracy than unsupervised
methods.
• However, using semi-supervised methods is not always possible and
often it is hard to know the distribution of the unlabeled data.
• There are two basic semi-supervised approaches: Transductive and
inductive learning.
• The transductive learning tries to use the labeled data in order to infer
the labels of the unlabeled data.
• The inductive learning tries to deduce rules from the labeled data that
can then be applied to the unlabeled data set.

Dr. K. Rajkumar, Dept of Computer Science 3

Semi-supervised Learning
• Transductive approach steps:
➢First the learner is trained, such as na¨ıve Bayes classifier, to learn the classes.
➢Applying the trained learner to assign class probabilities to the unlabeled data.
This step is called the “expectation” step.
➢Then, training a new learner using the labels of all the data. This step is called
the “maximization” step.
➢The steps are repeated until the model does not produce a different estimate
anymore.
➢This procedure is called the expectation maximization algorithm or EM
algorithm. Each expectation maximization iteration generalizes the model
more.
➢The expectation maximization procedure guarantees finding model parameters
that have equal or greater likelihood at each iteration.
Dr. K. Rajkumar, Dept of Computer Science 4
Expectation Maximization
• Expectation maximization (EM), also called expectation
maximization cluster analysis, is a method to solve maximum
likelihood function when some of the variables in the model cannot be
directly observed, i.e., latent variables.
• Expectation maximization assumes that the data is composed of
multiple multivariate normal distributions, which is a strong
assumption.
• It iteratively tries to find an optimal model by alternatingly improving
the model and the object assignment to the model.

Dr. K. Rajkumar, Dept of Computer Science 5

Expectation Maximization
• Under certain conditions we might still be able to determine the means
for each group by iteratively applying expectation maximization.
• The expectation maximization is used to determine the mean and
standard deviation parameters for each group.
• To verify whether there is some structure in the class and not just
random data, the algorithm can be applied to single classes.
• The expectation maximization algorithm tends to be very slow.
• Especially with high dimensional data, the expectation step can be
very slow.
• Also, the algorithm can get stuck in a local maxima that is far from the
global maxima.

Dr. K. Rajkumar, Dept of Computer Science 6

Pseudo Labeling
• Pseudo labeling is a simple and efficient technique used for semi-
supervised learning.
• It is also used in deep learning.
• In fact, pseudo labeling can be used for most neural networks and
training methods.
• In semi-supervised learning, the features are learned from the labeled
data.
• It its an advantage of the information in the unlabeled data to get a
better understanding of the structure of the data.
• Pseudo labeling can be used to learn from the unlabeled data.

Dr. K. Rajkumar, Dept of Computer Science 7

Pseudo Labeling
• Pseudo labeling goes through following steps:
i. Train a model or several models using the labeled data set. The
training data set might have to be manually labeled.
ii. The model which performed best is then used on the unlabeled
data to predict the class.
iii. Combine the training set with the labels and the one with the
pseudo labels.
iv. Train the model like before but with the combined data sets.

Dr. K. Rajkumar, Dept of Computer Science 8

Pseudo Labeling
• Deep learners often use unsupervised methods for pre-training.
• The initial weights of the deep neural network are initialized by
applying layerwise unsupervised training.
• After the weights are initialized, they are fine-tuned using labeled data
and the backpropagation algorithm in a supervised fashion.
• This also works using semi-supervised methods.
• In many cases, even when using older approaches, such as the na¨ıve
Bayes classifier, we can obtain superior performance by adding
unlabeled data and using semi-supervised learning.

Dr. K. Rajkumar, Dept of Computer Science 9

Chapter 4
No ratings yet
Chapter 4
43 pages
TR1530
No ratings yet
TR1530
39 pages
Chapter 05 - 1732187374
No ratings yet
Chapter 05 - 1732187374
15 pages
AML Unit-3 Material
No ratings yet
AML Unit-3 Material
26 pages
Semi-Supervised Learning Overview
No ratings yet
Semi-Supervised Learning Overview
10 pages
第八章
No ratings yet
第八章
28 pages
Types of Learning
No ratings yet
Types of Learning
19 pages
Semi-Supervised Learning Literature Survey
No ratings yet
Semi-Supervised Learning Literature Survey
59 pages
Understanding Semi-Supervised Learning
No ratings yet
Understanding Semi-Supervised Learning
5 pages
Advances in Semi-Supervised Learning
No ratings yet
Advances in Semi-Supervised Learning
46 pages
Semi-Supervised Learning in ML
No ratings yet
Semi-Supervised Learning in ML
7 pages
Semi (v3)
No ratings yet
Semi (v3)
31 pages
Semi-Supervised Learning A Brief Review
No ratings yet
Semi-Supervised Learning A Brief Review
6 pages
Lecture 4
No ratings yet
Lecture 4
21 pages
Semi-: Supervised Learning
No ratings yet
Semi-: Supervised Learning
40 pages
(Synthesis Lectures On Artificial Intelligence and Machine Learning) Xiaojin Zhu, Andrew. B Goldberg - Introduction To Semi-Supervised Learning-Springer (2009)
No ratings yet
(Synthesis Lectures On Artificial Intelligence and Machine Learning) Xiaojin Zhu, Andrew. B Goldberg - Introduction To Semi-Supervised Learning-Springer (2009)
122 pages
Article AReviewOfVariousSemi Supervise
No ratings yet
Article AReviewOfVariousSemi Supervise
16 pages
SSL RL and Intro To DL
No ratings yet
SSL RL and Intro To DL
39 pages
5 Le
100% (1)
5 Le
36 pages
Pseudo Label Final
No ratings yet
Pseudo Label Final
6 pages
Unit 1.1 Learning
No ratings yet
Unit 1.1 Learning
16 pages
Machine Learning and Decision Trees
No ratings yet
Machine Learning and Decision Trees
30 pages
Machine Learning Types Explained
No ratings yet
Machine Learning Types Explained
26 pages
DUnit I
No ratings yet
DUnit I
25 pages
Semi Supervised Learning Slide
No ratings yet
Semi Supervised Learning Slide
156 pages
Module 1
No ratings yet
Module 1
47 pages
1 - Machine Learning
No ratings yet
1 - Machine Learning
26 pages
Is Active Learning Same As Semi Supervised Learning
No ratings yet
Is Active Learning Same As Semi Supervised Learning
2 pages
Machine Learning Techniques Guide
No ratings yet
Machine Learning Techniques Guide
16 pages
2-Capacity, Underfitting, overfitting-15-Jul-2020Material - I - 15-Jul-2020 - ML - Fundamentals
No ratings yet
2-Capacity, Underfitting, overfitting-15-Jul-2020Material - I - 15-Jul-2020 - ML - Fundamentals
35 pages
2023 ML
No ratings yet
2023 ML
69 pages
Title: Understanding Semi-Supervised Learning
No ratings yet
Title: Understanding Semi-Supervised Learning
8 pages
Unit 4 - Aia
No ratings yet
Unit 4 - Aia
32 pages
PR & ML: CS5691: Machine Learning
No ratings yet
PR & ML: CS5691: Machine Learning
42 pages
AI Chapter 5
No ratings yet
AI Chapter 5
31 pages
Pseudo Label Final
No ratings yet
Pseudo Label Final
7 pages
Imp Questions
No ratings yet
Imp Questions
42 pages
Week-2 ML and CC
No ratings yet
Week-2 ML and CC
27 pages
Semi-Supervised Learning Accuracy
No ratings yet
Semi-Supervised Learning Accuracy
9 pages
Lec 24
No ratings yet
Lec 24
39 pages
Business Analytics For Decision Making Machine Learning 4-6
No ratings yet
Business Analytics For Decision Making Machine Learning 4-6
16 pages
Machine Learning
No ratings yet
Machine Learning
56 pages
LM #02-ML Concepts & Frameworks
No ratings yet
LM #02-ML Concepts & Frameworks
31 pages
On Consistency of Graph-Based Semi-Supervised Learning: Chengan Du Yunpeng Zhao Feng Wang
No ratings yet
On Consistency of Graph-Based Semi-Supervised Learning: Chengan Du Yunpeng Zhao Feng Wang
9 pages
Machine Learning Techniques
100% (2)
Machine Learning Techniques
45 pages
Lec1 Introduction
No ratings yet
Lec1 Introduction
60 pages
Lecture 01 Introduction
No ratings yet
Lecture 01 Introduction
20 pages
Semi-Supervised Learning for Experts
No ratings yet
Semi-Supervised Learning for Experts
19 pages
Overview of Machine Learning Techniques
No ratings yet
Overview of Machine Learning Techniques
138 pages
Understanding Semi-Supervised Learning
No ratings yet
Understanding Semi-Supervised Learning
1 page
Selected T Chapter 3
No ratings yet
Selected T Chapter 3
62 pages
Bayesian Learning Note
No ratings yet
Bayesian Learning Note
20 pages
Machine Learning
No ratings yet
Machine Learning
20 pages
Machine Learning L1
No ratings yet
Machine Learning L1
34 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
6 pages
Unit-4 Learning in AI
No ratings yet
Unit-4 Learning in AI
32 pages
Lecture#12 DM MS (DEIM) Spring 2025
No ratings yet
Lecture#12 DM MS (DEIM) Spring 2025
21 pages
Chapter 9 - Learning Techniques
No ratings yet
Chapter 9 - Learning Techniques
25 pages
Machine Learning Fundamentals Overview
No ratings yet
Machine Learning Fundamentals Overview
14 pages
Unsupervised Learning: Clustering
No ratings yet
Unsupervised Learning: Clustering
12 pages
Library Management System
No ratings yet
Library Management System
46 pages
MHS Chat Bot
No ratings yet
MHS Chat Bot
1 page
Class 6 Worksheet Elementary Shapes
No ratings yet
Class 6 Worksheet Elementary Shapes
1 page
DC Module3 - Error Detection
No ratings yet
DC Module3 - Error Detection
98 pages
C++ Project For Graphic Scientific Calculator
No ratings yet
C++ Project For Graphic Scientific Calculator
34 pages
DS Unit-1 IMP Notes
No ratings yet
DS Unit-1 IMP Notes
12 pages
Understanding Correlation Analysis
No ratings yet
Understanding Correlation Analysis
13 pages
Final Report - CAB 420
No ratings yet
Final Report - CAB 420
13 pages
Building Cost & QS Rate Guide
No ratings yet
Building Cost & QS Rate Guide
2 pages
P.G. Sem-1 Real Analysis All Model Ques-Ans 2025... !!
No ratings yet
P.G. Sem-1 Real Analysis All Model Ques-Ans 2025... !!
69 pages
Action Research in Divide
No ratings yet
Action Research in Divide
3 pages
Estimation of The Effevtive Power Without Indicatir Diagrams
No ratings yet
Estimation of The Effevtive Power Without Indicatir Diagrams
2 pages
Uzan-The Arrow of Time and Meaning PDF
No ratings yet
Uzan-The Arrow of Time and Meaning PDF
29 pages
Dynamics Lecture Notes: Newton's Laws
No ratings yet
Dynamics Lecture Notes: Newton's Laws
24 pages
Stat6201 ch3-3
No ratings yet
Stat6201 ch3-3
3 pages
CBCS Course Codes for Postgraduates
No ratings yet
CBCS Course Codes for Postgraduates
39 pages
Week 11 Graded Solution
No ratings yet
Week 11 Graded Solution
10 pages
Assignment On Properties of Determinants-1
No ratings yet
Assignment On Properties of Determinants-1
3 pages
Numerical Analyses of Steel Beam-Column Joints Subjected To Catenary Action
No ratings yet
Numerical Analyses of Steel Beam-Column Joints Subjected To Catenary Action
11 pages
ECE Numerical Methods Test
No ratings yet
ECE Numerical Methods Test
4 pages
Decision Making Under Uncertainty
No ratings yet
Decision Making Under Uncertainty
12 pages
Biostatistics in Neonatal Nursing
No ratings yet
Biostatistics in Neonatal Nursing
275 pages
Sven O Krumke Integer Programming Polyhedra and Algorithms Lecture Notes
No ratings yet
Sven O Krumke Integer Programming Polyhedra and Algorithms Lecture Notes
188 pages
Detailed Lesson Plan in Multiplication On Whole Numbers Again-2
No ratings yet
Detailed Lesson Plan in Multiplication On Whole Numbers Again-2
7 pages
Decision Tree Lecture 1
No ratings yet
Decision Tree Lecture 1
7 pages
Applied Statistics For Public and Nonprofit Administration 8th Edition Solution Manual
No ratings yet
Applied Statistics For Public and Nonprofit Administration 8th Edition Solution Manual
6 pages
02 - Robust Tomography and Tomostaticswave-Equation Datuming Examples
No ratings yet
02 - Robust Tomography and Tomostaticswave-Equation Datuming Examples
4 pages
Aggregate Demand I: Building The - Model: IS LM
No ratings yet
Aggregate Demand I: Building The - Model: IS LM
30 pages
Continuity & Differentiability Worksheet
No ratings yet
Continuity & Differentiability Worksheet
3 pages
Surviviorship Bubble Lab
No ratings yet
Surviviorship Bubble Lab
7 pages
Lecture 16 - Worm Gears Worked Out Problems
50% (2)
Lecture 16 - Worm Gears Worked Out Problems
19 pages
001 MMW Elementary Logic Lect PDF
No ratings yet
001 MMW Elementary Logic Lect PDF
47 pages