0% found this document useful (0 votes)

82 views18 pages

06 Pytorch Transfer Learning

This document discusses transfer learning with PyTorch. It provides examples of using transfer learning for computer vision and natural language processing tasks. Transfer learning can leverage existing neural network architectures and pre-trained weights from similar problems, resulting in better performance with less data compared to training from scratch. The document outlines different transfer learning techniques like feature extraction and fine-tuning and where to find pre-trained models for transfer learning in PyTorch.

Uploaded by

sashikumar_123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

82 views18 pages

06 Pytorch Transfer Learning

Uploaded by

sashikumar_123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Transfer Learning with

Where can you get help?

“If in doubt, run the code”

• Follow along with the code

• Try it for yourself
• Press SHIFT + CMD + SPACE to read the docstring
• Search for it
• Try again
• Ask

https://www.github.com/mrdbourke/pytorch-deep-learning/discussions
“What is transfer learning?”
Surely someone has spent the time crafting the right model for the job…
Example transfer learning use cases
Computer vision

Natural language processing

To: [email protected] To: [email protected]

Hey Daniel, Hay daniel…

This deep learning course is incredible! C0ongratu1ations! U win $1139239230

I can’t wait to use what I’ve learned!

Not spam Spam

Model learns patterns/weights from similar problem space Patterns get used/tuned to speci c problem
fi
“Why use transfer learning?”
Why use transfer learning?
• Can leverage an existing neural network architecture proven to work on problems similar to our
own

• Can leverage a working network architecture which has already learned patterns on similar
data to our own (often results in great results with less data)

Pretrained E cientNet
Learn patterns in a Extract/tune patterns/weights
architecture (already works Model performs better
wide variety of images to suit our own problem
really well on computer vision than from scratch
(using ImageNet) (FoodVision Mini)
tasks)
ffi
Improving a model
Method to improve a model
What does it do?
(reduce over tting)

Gives a model more of a chance to learn patterns between samples

More data (e.g. if a model is performing poorly on images of pizza, show it more
images of pizza).

Increase the diversity of your training dataset without collecting more

data (e.g. take your photos of pizza and randomly rotate them 30°).
Data augmentation
Increased diversity forces a model to learn more generalisation
patterns.

Not all data samples are created equally. Removing poor samples
Better data from or adding better samples to your dataset can improve your
model’s performance.

Take an existing model’s pre-learned patterns from one problem and

Use transfer learning tweak them to suit your own problem. For example, take a model
trained on pictures of cars to recognise pictures of trucks.
fi
Where to find pretrained models

PyTorch domains libraries (torchvision, torchtext, torchaudio, Torch Image Models (timm library).
torchrec). Source: https://pytorch.org/vision/stable/models.html Source: https://github.com/rwightman/pytorch-image-models

🤗 HuggingFace Hub. Paperswithcode SOTA.

Source: https://huggingface.co/models Source: https://paperswithcode.com/sota
What we’re going to cover
(broadly)
• Getting setup (importing previously written code)

• Introduce transfer learning with PyTorch

• Customise a pretrained model for our own use case

(FoodVision Mini 🍕🥩🍣)

• Evaluating a transfer learning model

• Making predictions on our own custom data

👩🍳 👩🔬
(we’ll be cooking up lots of code!)
How:
Let’s code!
Original Model vs. Feature Extraction
Output layer(s) gets trained
ImageNet has Changes on new data
1000 classes Output Layer (shape = 1000) 3

…
Layer 235 Layer 235

Layer 234 Layer 234

Working Stays same (frozen)
architecture …

…
(original model layers
(e.g. E cientNet) don’t update during training)
Layer 2 Layer 2

Input Layer Input Layer

…

…
Changes

Large dataset (e.g. ImageNet) Di erent dataset (e.g. 3 classes of food 🍕🥩🍣)

Original Model Feature Extraction Transfer Learning Model

ff
ffi
Kinds of Transfer Learning Top layers get trained
on new data

Output Layer (shape = 1000) Changes 3 Stays same 3

…
Layer 235 Layer 235 Layer 235
Changes
(unfrozen)
Layer 234 Stays same Layer 234 Layer 234
(frozen)
…

…
Layer 2 Layer 2 Layer 2
Stays same
(frozen)
Input Layer Input Layer Input Layer
…

…
Fine-t
Changes Might change usual uning
ly req
more d uires
ata th
featur an
Large dataset (e.g. ImageNet) Di erent dataset (e.g. 3 classes of food 🍕🥩🍣) extrac e
tion

Original Model Feature Extraction Fine-tuning

ff
Kinds of Transfer Learning
Type Description What happens When to use

Take a pretrained model as it is and

The original model remains Helpful if you have the exact same kind of data
Original model (“As is”) apply it to your task without any
unchanged. the original model was trained on.
changes.

Take the underlying patterns (also Helpful if you have a small amount of custom
Most of the layers in the original
called weights) a pretrained model data (similar to what the original model was
Feature extraction has learned and adjust its outputs
model remain frozen during training
trained on) and want to utilise a pretrained model
(only the top 1-3 layers get updated).
to be more suited to your problem. to get better results on your speci c problem.

Helpful if you have a large amount of custom data

Take the weights of a pretrained Some, many or all of the layers in
and want to utilise a pretrained model and
Fine-tuning model and adjust ( ne-tune) them the pretrained model are updated
improve its underlying patterns to your speci c
to your own problem. during training.
problem.
fi
fi
fi
EfficientNet feature extractor
Input data
(Pizza, Steak, Sushi) Changes
Stays same (same shape as number
(frozen, pretrained on ImageNet) of classes)

🥩
🍣

3
🍕
E cientNetB0 architecture. Source: https://ai.googleblog.com/2019/05/e cientnet-improving-accuracy-and.html

EfficientNetB0 Backbone si f i e r l a y e r
Linear clas
(torchvision.models.efficientnet_b0) n . L i n e a r )
(torch.n
ffi
ffi
EfficientNet feature extractor
EfficientNetB0 Backbone
(torchvision.models.efficientnet_b0(prertained=True)
Extracts features
from image

Turns features into a feature

vector (by taking the average)
Turns feature vector
into prediction logits
Can adjust depending on the
number of classes you have
EfficientNet feature extractor —
changing the classifier head
EfficientNetB0 Backbone
(torchvision.models.efficientnet_b0(prertained=True)

Same

Changed

Original Model Original Model + Changed Classi er Head

(1000 output classes for ImageNet) (3 output classes for 🍕, 🥩, 🍣)
fi
torchinfo.summary(model, input_size=(32, 3, 224, 224))
Are the layers trainable?
(unfrozen)
Input shape of data per layer
Output shape of data per layer

Total number of parameters

and trainable parameters
torchinfo.summary(model, input_size=(32, 3, 224, 224))
Many layers
untrainable (frozen)

Only last layers are trainable

Final layer output (same as

number of classes 🍕🥩🍣)

Less trainable parameters

because many layers are
frozen

Transfer Learning with EfficientNet
No ratings yet
Transfer Learning with EfficientNet
29 pages
NB4-10 PT V Transfer Learning
No ratings yet
NB4-10 PT V Transfer Learning
16 pages
Aai TT1
No ratings yet
Aai TT1
50 pages
Program 5n6 DL
No ratings yet
Program 5n6 DL
9 pages
Unit 2
No ratings yet
Unit 2
9 pages
Fine-Tuning Transfer Learning Guide
No ratings yet
Fine-Tuning Transfer Learning Guide
24 pages
Transfer Learning Guide for ML Enthusiasts
No ratings yet
Transfer Learning Guide for ML Enthusiasts
18 pages
Fine-Tuning LLMs with PEFT & LoRa Techniques
No ratings yet
Fine-Tuning LLMs with PEFT & LoRa Techniques
25 pages
Pytorch Tutorial: Narges Honarvar Nazari January 30
No ratings yet
Pytorch Tutorial: Narges Honarvar Nazari January 30
29 pages
PROGRAM 5n6 DL - Final
No ratings yet
PROGRAM 5n6 DL - Final
9 pages
Intro to Convolutional Networks
No ratings yet
Intro to Convolutional Networks
17 pages
Unit Iii
No ratings yet
Unit Iii
26 pages
Cat and Dog 1
No ratings yet
Cat and Dog 1
9 pages
Lecture25 TransferLearningOverviewPart1
No ratings yet
Lecture25 TransferLearningOverviewPart1
54 pages
Transfer Learnring
No ratings yet
Transfer Learnring
5 pages
Transfer Learning: Objectives
No ratings yet
Transfer Learning: Objectives
16 pages
Session15 TransferLearning
No ratings yet
Session15 TransferLearning
13 pages
04 Pytorch Custom Datasets
No ratings yet
04 Pytorch Custom Datasets
17 pages
Transfer Learning Techniques Explained
No ratings yet
Transfer Learning Techniques Explained
50 pages
Cats and Dogs Image Classification
No ratings yet
Cats and Dogs Image Classification
32 pages
Py Torch
No ratings yet
Py Torch
786 pages
Chapter 6 - Notes PDF
No ratings yet
Chapter 6 - Notes PDF
22 pages
NNDL - Unit 3CBS
No ratings yet
NNDL - Unit 3CBS
6 pages
Lecture 17 Transfer Learning
No ratings yet
Lecture 17 Transfer Learning
12 pages
Video 18 - Transfer Learning and Fine-Tuning Pretrained Models
No ratings yet
Video 18 - Transfer Learning and Fine-Tuning Pretrained Models
14 pages
Make 04 00002 v2
No ratings yet
Make 04 00002 v2
20 pages
DL Exp-6 16010422230
No ratings yet
DL Exp-6 16010422230
8 pages
PyTorch Data Science Guide
No ratings yet
PyTorch Data Science Guide
30 pages
Arijit Dey - 34230822006 - PCCAIML 602
No ratings yet
Arijit Dey - 34230822006 - PCCAIML 602
15 pages
Data Augmentation & Transfer Learning Guide
No ratings yet
Data Augmentation & Transfer Learning Guide
4 pages
Deep Learning With PyTorch
No ratings yet
Deep Learning With PyTorch
19 pages
PyTorch For Deep Learning Zero To Mastery
No ratings yet
PyTorch For Deep Learning Zero To Mastery
6 pages
Transfer Learning with Pre-trained Models
No ratings yet
Transfer Learning with Pre-trained Models
16 pages
AAI Module 4
No ratings yet
AAI Module 4
13 pages
Classic CNN
No ratings yet
Classic CNN
39 pages
Building Deep Learning Models Using The PyTorch Library
No ratings yet
Building Deep Learning Models Using The PyTorch Library
4 pages
Chapter 3
No ratings yet
Chapter 3
26 pages
Chapter 3
No ratings yet
Chapter 3
26 pages
FX-Based Feature Extraction in PyTorch
No ratings yet
FX-Based Feature Extraction in PyTorch
9 pages
CNNs & Computer Vision with PyTorch
No ratings yet
CNNs & Computer Vision with PyTorch
29 pages
Deep Learning with PyTorch Course
No ratings yet
Deep Learning with PyTorch Course
9 pages
Popular Pre-Trained CNN Models
No ratings yet
Popular Pre-Trained CNN Models
15 pages
Pytorch Neural Networks Guide 1717173717
No ratings yet
Pytorch Neural Networks Guide 1717173717
17 pages
Day 8
No ratings yet
Day 8
20 pages
AMLlab 06
No ratings yet
AMLlab 06
3 pages
Assign PDF
No ratings yet
Assign PDF
19 pages
PyTorch 1.0: Bridging Research and Production
No ratings yet
PyTorch 1.0: Bridging Research and Production
108 pages
Ethinking The Yperparameters FOR INE Tuning
No ratings yet
Ethinking The Yperparameters FOR INE Tuning
20 pages
Transfer Learning Seminar
No ratings yet
Transfer Learning Seminar
12 pages
Stars 4 0 0 0 + Forks 7 0 0 + License MIT
No ratings yet
Stars 4 0 0 0 + Forks 7 0 0 + License MIT
19 pages
Transfer Learning
No ratings yet
Transfer Learning
13 pages
Chapter 9
No ratings yet
Chapter 9
15 pages
05 Pytorch Going Modular
No ratings yet
05 Pytorch Going Modular
13 pages
Train Your Image Classifier Model With PyTorch
No ratings yet
Train Your Image Classifier Model With PyTorch
6 pages
Transfer Learning with MRCNN
No ratings yet
Transfer Learning with MRCNN
12 pages
Plant Disease Identification
No ratings yet
Plant Disease Identification
17 pages
NN From Scratch
No ratings yet
NN From Scratch
5 pages
CNN - Case Study
No ratings yet
CNN - Case Study
4 pages
Apple Device Management For Beginners
100% (1)
Apple Device Management For Beginners
25 pages
Endpoint Protector 5 JAMF Deployment Guide EN
No ratings yet
Endpoint Protector 5 JAMF Deployment Guide EN
24 pages
Finops Azure
No ratings yet
Finops Azure
20 pages
DCM Migration Journey Overview
No ratings yet
DCM Migration Journey Overview
64 pages
Kubernetes Deployment 8 24
No ratings yet
Kubernetes Deployment 8 24
21 pages
1 Rotated
No ratings yet
1 Rotated
1 page
DevOps Strategies for Banking Success
No ratings yet
DevOps Strategies for Banking Success
41 pages
Vmware Validated Design 62 SDDC Introduction
No ratings yet
Vmware Validated Design 62 SDDC Introduction
71 pages
M365 Enterprise
No ratings yet
M365 Enterprise
993 pages
Viewpower: User Manual
No ratings yet
Viewpower: User Manual
54 pages
MITPE - Learning Facilitator - Role Description
No ratings yet
MITPE - Learning Facilitator - Role Description
3 pages
Chapter Ii: Unpacking The Self: Lesson 8
No ratings yet
Chapter Ii: Unpacking The Self: Lesson 8
4 pages
03 MFG Inventory Tracker v1
No ratings yet
03 MFG Inventory Tracker v1
8 pages
Module One Wellness Plan
No ratings yet
Module One Wellness Plan
8 pages
Fundametal Differences
No ratings yet
Fundametal Differences
7 pages
Cipher Techniques Explained
No ratings yet
Cipher Techniques Explained
24 pages
Dga - Ariel-P-Ip67 - Datasheet (Option 2 Spike)
No ratings yet
Dga - Ariel-P-Ip67 - Datasheet (Option 2 Spike)
3 pages
Homework Psychology Definition
100% (1)
Homework Psychology Definition
5 pages
Initial Imperfection Analysis and Design of Concrete-Filled Stiffened Thin-Walled Steel Tubular
No ratings yet
Initial Imperfection Analysis and Design of Concrete-Filled Stiffened Thin-Walled Steel Tubular
13 pages
Grade 8 Revision Sheet Answers
No ratings yet
Grade 8 Revision Sheet Answers
12 pages
Student Study Guide Earths Surface
No ratings yet
Student Study Guide Earths Surface
7 pages
MBA Students: Seven-Eleven Japan Analysis
No ratings yet
MBA Students: Seven-Eleven Japan Analysis
5 pages
001 PDF
No ratings yet
001 PDF
2 pages
Management Concepts and Practices Exam Guide
No ratings yet
Management Concepts and Practices Exam Guide
4 pages
Ureter
No ratings yet
Ureter
30 pages
Furniture & Cabinetmaking 279 - Jan 2019 PDF
No ratings yet
Furniture & Cabinetmaking 279 - Jan 2019 PDF
84 pages
Ind 3 Basic Human Aspirations - Their Fulfilment v1
No ratings yet
Ind 3 Basic Human Aspirations - Their Fulfilment v1
32 pages
The Half Moon
No ratings yet
The Half Moon
12 pages
Nobel Prize Winners in Medicine (1901-1982)
No ratings yet
Nobel Prize Winners in Medicine (1901-1982)
19 pages
Topical Ratio Analysis For Prada
No ratings yet
Topical Ratio Analysis For Prada
4 pages
TLE 7-8 Front Office Service Q1 - M2 For Printing
No ratings yet
TLE 7-8 Front Office Service Q1 - M2 For Printing
22 pages
New Masculinity in Post-War Stories
No ratings yet
New Masculinity in Post-War Stories
18 pages
Electronic Fuel Injection Guide
No ratings yet
Electronic Fuel Injection Guide
113 pages
Project On Transformers Class XII
93% (14)
Project On Transformers Class XII
22 pages
Energy Economica - Power Trading Within An Energy Community
No ratings yet
Energy Economica - Power Trading Within An Energy Community
14 pages
Scan Through TAP
No ratings yet
Scan Through TAP
10 pages
Engage Older Adults & Caregivers Online
No ratings yet
Engage Older Adults & Caregivers Online
16 pages
The Pittston Dispatch 09-16-2012
No ratings yet
The Pittston Dispatch 09-16-2012
70 pages
Chakra Cleansing and Awakening Guide
No ratings yet
Chakra Cleansing and Awakening Guide
8 pages

06 Pytorch Transfer Learning

Uploaded by

06 Pytorch Transfer Learning

Uploaded by

Transfer Learning with

Where can you get help?

• Follow along with the code

Natural language processing

To: [email protected] To: [email protected]

This deep learning course is incredible! C0ongratu1ations! U win $1139239230

Not spam Spam

Gives a model more of a chance to learn patterns between samples

Increase the diversity of your training dataset without collecting more

Take an existing model’s pre-learned patterns from one problem and

🤗 HuggingFace Hub. Paperswithcode SOTA.

• Introduce transfer learning with PyTorch

• Customise a pretrained model for our own use case

• Evaluating a transfer learning model

• Making predictions on our own custom data

Layer 234 Layer 234

Input Layer Input Layer

Original Model Feature Extraction Transfer Learning Model

Output Layer (shape = 1000) Changes 3 Stays same 3

Original Model Feature Extraction Fine-tuning

Take a pretrained model as it is and

Helpful if you have a large amount of custom data

Turns features into a feature

Original Model Original Model + Changed Classi er Head

Total number of parameters

Only last layers are trainable

Final layer output (same as

Less trainable parameters

You might also like