0% found this document useful (0 votes)

14 views18 pages

Unsupervised Classification

Uploaded by

Poubelle

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views18 pages

Unsupervised Classification

Uploaded by

Poubelle

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

SCAN: Learning to Classify Images

without Labels

Wouter Van Gansbeke, Simon Vandenhende, Stamatios

Georgoulis, Marc Proesmans and Luc Van Gool
Unsupervised Image Classification
Task: Group a set unlabeled images into semantically
meaningful clusters.
Bird Cat
Unlabeled Data

Cluster

Car Deer
Prior work – Two dominant paradigms
I. Representation Learning II. End-To-End Learning
Idea: Use a self-supervised learning pretext task Idea: - Leverage architecture of CNNs as a prior.
+ off-line clustering (K-means) (e.g. DAC, DeepCluster, DEC, etc.)

or - Maximize mutual information between an

image and its augmentations
(e.g. IMSAT, IIC)

Ex 1: Predict Transformations
Problems:
- Cluster learning depends on initialization,
and is likely to latch onto low-level features.

Ex 2: Instance Discrimination - Special mechanisms required

(Sobel, PCA, cluster re-assignments, etc.).
Problem: K-means leads to cluster degeneracy.
[1] Unsupervised representation learning by predicting image rotations, Gidaris et al. (2018)
[2] Colorful Image Colorization, Richard et al. (2016)
[3] Unsupervised feature learning via non-parametric instance discrimination, Wu et al. (2018)
SCAN: Semantic Clustering by Adopting Nearest Neighbors
Approach: A two-step approach where feature learning and
clustering are decoupled.
Step 1: Solve a pretext task + Mine k-NN Step 2: Train clustering model by imposing
consistent predictions among neighbors
Step 1: Solve a pretext task + Mine k-NN
Question: How to select a pretext task appropriate for the
down-stream task of semantic clustering?
Problem: Pretext tasks which try to predict image
transformations result in a feature representation that is
covariant to the applied transformation.

→ Undesired for the down-stream task of semantic clustering.

→ Solution: Pretext model should minimize the distance

between an image and its augmentations.

[1] Unsupervised representation learning by predicting image rotations, Gidaris et al. (2018)
[2] Colorful Image Colorization, Richard et al. (2016)
[3] AET vs AED, Zhang et al. (2019)
Step 1: Solve a pretext task + Mine k-NN
Question: How to select a pretext task appropriate for the
down-stream task of semantic clustering?

Instance discrimination satisfies the

invariance criterion w.r.t. augmentations
applied during training.

[1] Unsupervised feature learning via non-parametric instance discrimination, Wu et al. (2018)
Step 1: Solve a pretext task + Mine k-NN
The nearest neighbors tend to belong to the same semantic
class.
Step 2: Train clustering model
- SCAN-Loss:
(1) Enforce consistent predictions
among neighbors. Maximize:

→ Dot product forces predictions

to be one-hot (confident)

(2) Maximize entropy to avoid

all samples being assigned to
the same cluster.
Step 2b: Refinement through self-labeling
- Refine the model through self-labeling

- Apply a cross-entropy loss on

strongly augmented [1] versions of
confident samples.

- Applying strong augmentations

avoids overfitting.

[1] RandAugment, Cubuk et al. (2020)

[2] FixMatch, Sohn et al. (2020)
[3] Probability of error, Scudder H. (1965)
Experimental setup
- ResNet backbone + Identical hyperparameters.

- SimCLR and MoCo implementation for the pretext task.

- Experiments on four datasets

Ablation studies - SCAN

- Pretext task - Number of NNs (K)

Pretext Task ACC

(Avg +- Std)
Rotation Prediction 74.3 +- 3.9

Instance 87.6 +- 0.4

Discrimination
Ablation studies - Self-label

Self-labeling (CIFAR-10) Threshold self-labeling

Step ACC
(Avg +- Std)
SCAN 81.8 +- 0.3

Self-labeling 87.6 +- 0.4

Comparison with SOTA
Comparison with SOTA

CIFAR100-20 STL10 CIFAR10

100%
88%
 Large performance gains w.r.t. to prior works:
Classification Accuracy [%]

81%
80% +26:6% on CIFAR10, +25:0% on CIFAR100-20
60%62% and +21:3% on STL10
60% 52% 51%
47%
 SCAN outperforms SimCLR + K-means
40% 36% 37%
33%
30%
24% 26%  Close to supervised performance on CIFAR-10
19% 19%
20% and STL-10

0%
DEC DeepCluster DAC IIC SCAN (Ours)
(ICML16) (ECCV18) (ICCV17) (ICCV19)
ImageNet Results
 Scalable: First method  Semantic clusters: We observe  Confusion matrix shows
which scales to ImageNet that the clusters capture a large ImageNet hierarchy containing
(1000 classes) variety of different backgrounds, dogs, insects, primates,
viewpoints, etc. snakes, clothing, buildings,
birds etc.
Comparison with supervised methods

 Trained with 1% of the labels

 SCAN: Top-1: 39.9%, Top-5: 60.0%, NMI: 72.0%, ARI: 27.5%
Prototypical behavior
Prototype: The closest sample to the mean embedding of
ImageNet
the high confident samples of a certain class.

Prototypes:
- show what each cluster
represents
- are often more pure

STL10

CIFAR10
Conclusion

 Two step approach: decouple feature learning and clustering

 Nearest neighbors capture variance in viewpoints and backgrounds
 Promising results on large scale datasets

Future directions
 Extension to other modalities, e.g. video, audio
 Other domains, e.g. segmentation, semi-supervised, etc.

Code is available on Github

[Link]/wvangansbeke/Unsupervised-Classification

SCAN: Learning To Classify Images Without Labels: 1 Introduction and Prior Work
No ratings yet
SCAN: Learning To Classify Images Without Labels: 1 Introduction and Prior Work
26 pages
Semantic - Prmopt Paper
No ratings yet
Semantic - Prmopt Paper
11 pages
4.1 - Unsupervised Visual Representation Learning by Context Prediction
No ratings yet
4.1 - Unsupervised Visual Representation Learning by Context Prediction
10 pages
Self-Supervised Learning in Deep Learning
No ratings yet
Self-Supervised Learning in Deep Learning
50 pages
Weakly Supervised Contrastive Learning
No ratings yet
Weakly Supervised Contrastive Learning
10 pages
Contrastive Learning With Semantic Consistency Constraint
No ratings yet
Contrastive Learning With Semantic Consistency Constraint
9 pages
He APSeg Auto-Prompt Network For Cross-Domain Few-Shot Semantic Segmentation CVPR 2024 Paper
No ratings yet
He APSeg Auto-Prompt Network For Cross-Domain Few-Shot Semantic Segmentation CVPR 2024 Paper
11 pages
What Should Not Be Contrastive in Contrastive Learning
No ratings yet
What Should Not Be Contrastive in Contrastive Learning
13 pages
Vaze Generalized Category Discovery CVPR 2022 Paper
No ratings yet
Vaze Generalized Category Discovery CVPR 2022 Paper
10 pages
Self Supervised Learning
No ratings yet
Self Supervised Learning
5 pages
Taskonomy: Disentangling Task Transfer Learning
No ratings yet
Taskonomy: Disentangling Task Transfer Learning
11 pages
Tell Me What You See and I Will Show You Where It Is
No ratings yet
Tell Me What You See and I Will Show You Where It Is
8 pages
Deep Learning UDA
No ratings yet
Deep Learning UDA
44 pages
No Time To Train
No ratings yet
No Time To Train
18 pages
Advanced Image Classification Techniques
No ratings yet
Advanced Image Classification Techniques
102 pages
Rethinking Self-Supervised Semantic Segmentation Achieving End-To-End Segmentation
No ratings yet
Rethinking Self-Supervised Semantic Segmentation Achieving End-To-End Segmentation
11 pages
(2021 - CVPR) Spatially Consistent Representation Learning
No ratings yet
(2021 - CVPR) Spatially Consistent Representation Learning
14 pages
Introduction To Object Recognition: Slides Adapted From Fei-Fei Li, Rob Fergus, Antonio Torralba, and Others
No ratings yet
Introduction To Object Recognition: Slides Adapted From Fei-Fei Li, Rob Fergus, Antonio Torralba, and Others
60 pages
Weakly-Supervised Semantic Alignment
No ratings yet
Weakly-Supervised Semantic Alignment
12 pages
Self-Supervised Features for Unsupervised Instance Segmentation
No ratings yet
Self-Supervised Features for Unsupervised Instance Segmentation
12 pages
Exploring Patch-Wise Semantic Relation For Contrastive Learning in Image-to-Image Translation Tasks
No ratings yet
Exploring Patch-Wise Semantic Relation For Contrastive Learning in Image-to-Image Translation Tasks
10 pages
4 s2.0 S0957417422016803 Main
No ratings yet
4 s2.0 S0957417422016803 Main
12 pages
Deep Learning For Geometric and Semantic Tasks in Photogrammetry and Remote Sensing
No ratings yet
Deep Learning For Geometric and Semantic Tasks in Photogrammetry and Remote Sensing
11 pages
Telling Stories For Common Sense Zero-Shot Action Recognition
No ratings yet
Telling Stories For Common Sense Zero-Shot Action Recognition
16 pages
Over Lock
No ratings yet
Over Lock
15 pages
Mask-Attention-Free Transformer For 3D Instance Segmentation
No ratings yet
Mask-Attention-Free Transformer For 3D Instance Segmentation
11 pages
IT5413 Ch4 Self Supervising
No ratings yet
IT5413 Ch4 Self Supervising
29 pages
Semantic Contrastive Learning for Image Retrieval
No ratings yet
Semantic Contrastive Learning for Image Retrieval
10 pages
A Review On Multiscale-Deep-Learning Applications
No ratings yet
A Review On Multiscale-Deep-Learning Applications
28 pages
IT5409 - Ch7 - Part2 - Object Recognition - v2 - 4pages
No ratings yet
IT5409 - Ch7 - Part2 - Object Recognition - v2 - 4pages
38 pages
Repurposing Gans For One-Shot Semantic Part Segmentation
No ratings yet
Repurposing Gans For One-Shot Semantic Part Segmentation
14 pages
Segment Anything Is Not Always Perfect: An Investigation of SAM On Different Real-World Applications
No ratings yet
Segment Anything Is Not Always Perfect: An Investigation of SAM On Different Real-World Applications
9 pages
From Text To Mask Localizing Entities Using The
No ratings yet
From Text To Mask Localizing Entities Using The
43 pages
AML - Lecture - 10 - 15nov24
No ratings yet
AML - Lecture - 10 - 15nov24
169 pages
Few-Shot Segmentation Advances
No ratings yet
Few-Shot Segmentation Advances
18 pages
Depth-Aware Scene Adaptation Framework
No ratings yet
Depth-Aware Scene Adaptation Framework
11 pages
Choi Self-Ensembling With GAN-Based Data Augmentation For Domain Adaptation in Semantic ICCV 2019 Paper
No ratings yet
Choi Self-Ensembling With GAN-Based Data Augmentation For Domain Adaptation in Semantic ICCV 2019 Paper
11 pages
Hypercolumns for Object Segmentation
No ratings yet
Hypercolumns for Object Segmentation
49 pages
Scheibenreif Self-Supervised Vision Transformers For Land-Cover Segmentation and Classification CVPRW 2022 Paper
No ratings yet
Scheibenreif Self-Supervised Vision Transformers For Land-Cover Segmentation and Classification CVPRW 2022 Paper
10 pages
Dint A 00062
No ratings yet
Dint A 00062
16 pages
Joint Unsupervised Learning for Image Clustering
No ratings yet
Joint Unsupervised Learning for Image Clustering
10 pages
283 Conformal Prediction For S
No ratings yet
283 Conformal Prediction For S
11 pages
Ji Invariant Information Clustering For Unsupervised Image Classification and Segmentation ICCV 2019 Paper
No ratings yet
Ji Invariant Information Clustering For Unsupervised Image Classification and Segmentation ICCV 2019 Paper
10 pages
Enhancing Object Detection with Common Sense
No ratings yet
Enhancing Object Detection with Common Sense
17 pages
Self-Supervised Contrastive Representation Learning For Semi-Supervised Time-Series Classification
No ratings yet
Self-Supervised Contrastive Representation Learning For Semi-Supervised Time-Series Classification
15 pages
Remotesensing 13 04712 v2
No ratings yet
Remotesensing 13 04712 v2
51 pages
Cotdet: Affordance Knowledge Prompting For Task Driven Object Detection
No ratings yet
Cotdet: Affordance Knowledge Prompting For Task Driven Object Detection
11 pages
Dual-Branch Domain Adaptation Few-Shot Learning For Hyperspectral Image Classification
No ratings yet
Dual-Branch Domain Adaptation Few-Shot Learning For Hyperspectral Image Classification
16 pages
Adaptive Segmentation for AI Experts
No ratings yet
Adaptive Segmentation for AI Experts
11 pages
Hierarchical Semantic Tree Concept Whitening For Image Understanding (1) - Updated
No ratings yet
Hierarchical Semantic Tree Concept Whitening For Image Understanding (1) - Updated
10 pages
Dense Contrastive Learning for Visual Tasks
No ratings yet
Dense Contrastive Learning for Visual Tasks
11 pages
Spatial Context-Aware Object-Attentional Network For Multi-Label Image Classification
No ratings yet
Spatial Context-Aware Object-Attentional Network For Multi-Label Image Classification
13 pages
Remotesensing 16 00818
No ratings yet
Remotesensing 16 00818
25 pages
Intro to Image Classification
No ratings yet
Intro to Image Classification
98 pages
Self-Supervised Contrastive Representation Learning For Semi-Supervised Time-Series Classification
No ratings yet
Self-Supervised Contrastive Representation Learning For Semi-Supervised Time-Series Classification
17 pages
Transforming Sensor Data To The Image Domain For Deep Learning - An Application To Footstep Detection
No ratings yet
Transforming Sensor Data To The Image Domain For Deep Learning - An Application To Footstep Detection
8 pages
Image Classification AI
No ratings yet
Image Classification AI
150 pages
Lecture 2 PDF
No ratings yet
Lecture 2 PDF
62 pages
uCAP: An Unsupervised Prompting Method For Vision-Language Models
No ratings yet
uCAP: An Unsupervised Prompting Method For Vision-Language Models
16 pages
Computer Graphics Exam Questions 2023
No ratings yet
Computer Graphics Exam Questions 2023
3 pages
Plant Maintenance Lifecycle Management
No ratings yet
Plant Maintenance Lifecycle Management
2 pages
Teamcenter Developer Resume Summary
No ratings yet
Teamcenter Developer Resume Summary
6 pages
DL Unit 4
No ratings yet
DL Unit 4
58 pages
Company Profile Sesco
No ratings yet
Company Profile Sesco
8 pages
REMA Program New
No ratings yet
REMA Program New
7 pages
Consultation Form
No ratings yet
Consultation Form
1 page
PC Assembly: Tools and Equipment
No ratings yet
PC Assembly: Tools and Equipment
9 pages
Iiith Student Resume
No ratings yet
Iiith Student Resume
1 page
Efficient SQL Server License Key For All Version TechAid24
No ratings yet
Efficient SQL Server License Key For All Version TechAid24
1 page
SAP EWM Exam Questions and Answers
No ratings yet
SAP EWM Exam Questions and Answers
27 pages
Presentation Topic: Cyber Crimes and Security: by Ashwini Awatare
50% (2)
Presentation Topic: Cyber Crimes and Security: by Ashwini Awatare
47 pages
Infinite Prep Guide
No ratings yet
Infinite Prep Guide
5 pages
Digital Litracy Exam
No ratings yet
Digital Litracy Exam
4 pages
5 Mux Model-907-Fos
No ratings yet
5 Mux Model-907-Fos
2 pages
BSBCRT512 Student Assessment Oyuna
No ratings yet
BSBCRT512 Student Assessment Oyuna
49 pages
Document
No ratings yet
Document
4 pages
Enfusion Porffolio Monitoring 2022
No ratings yet
Enfusion Porffolio Monitoring 2022
2 pages
Accurate - Wide - Range - Design - Equations - For Microstrip
No ratings yet
Accurate - Wide - Range - Design - Equations - For Microstrip
8 pages
RE Re Updated EULA - Recovery Care - Monthly Subscription - $ 30.00
No ratings yet
RE Re Updated EULA - Recovery Care - Monthly Subscription - $ 30.00
9 pages
Virtualization Tools for Developers
No ratings yet
Virtualization Tools for Developers
12 pages
Popular Computer Uses & Benefits
No ratings yet
Popular Computer Uses & Benefits
3 pages
Management Science Chapter 2 Notes
No ratings yet
Management Science Chapter 2 Notes
2 pages
Microsft Print To PDF - AR
No ratings yet
Microsft Print To PDF - AR
15 pages
M Economics The Basics 3rd Edition Mandel Solutions Manual 1
100% (74)
M Economics The Basics 3rd Edition Mandel Solutions Manual 1
24 pages
Belmont Trading Colombia Invoice BT2288
No ratings yet
Belmont Trading Colombia Invoice BT2288
2 pages
HCI Exam Questions Fall 2008
No ratings yet
HCI Exam Questions Fall 2008
12 pages
Introduction To Node-Red v2
No ratings yet
Introduction To Node-Red v2
22 pages
Day - 2 C-Programming
No ratings yet
Day - 2 C-Programming
28 pages
History of Industrial Automation PDF
100% (1)
History of Industrial Automation PDF
21 pages