0% found this document useful (0 votes)

13 views54 pages

Lecture25 TransferLearningOverviewPart1

The lecture provides an overview of transfer learning using the skorch library, focusing on the ResNet18 model and its application in developing an image-based classifier. It emphasizes the steps for a project involving transfer learning, including setting up the skorch tutorial and modifying the ResNet18 model to classify images of bees and ants. Key concepts include the reduction of trainable parameters by freezing layers of the pre-trained model, making it suitable for smaller datasets.

Uploaded by

Grant Rustan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views54 pages

Lecture25 TransferLearningOverviewPart1

Uploaded by

Grant Rustan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Lecture 25: Transfer

Learning Overview (Part 1)

ENGR:3110
Introduction to AI and Machine Learning in Engineering
Today’s topics
• Reminder of steps of “recommended” project 2
• Getting more insight into skorch’s transfer learning tutorial (Part 1)
• Today: basic concepts of ResNet18 and ImageNet
• Work with project groups
Reminder of steps of
“recommended” project 2
Build upon skorch’s transfer learning tutorial to develop your own image-
based classifier
Reminder on how to get started on
“recommended” project 2
1. Get the skorch transfer learning tutorial working on
development system of your choice (see lecture 23 notes)
and save model. Example options:
• Google Colab
• Your own environment (note: requires installing of pytorch/skorch in
your own Python environment)
2. In a separate notebook, be able to classify a bee or ant image
(download from internet) from the saved model (see lecture
24 notes for example notebook) on the environment of your
choice.
Also see Additional Notes
under Lecture 23 Introductory
STEP 1 Notes

https://skorch.readthedocs.io/en/stable/user/tutorials.html
Link on Tutorial Page Broken
• https://nbviewer.org/github/skorch-
dev/skorch/blob/master/notebooks/Transfer_Learning.ipynb
Example notebook can be found in
Additional Notes under Lecture 24:
STEP 2
Introductory Notes. You will need to
download your own bee/ant image.

(You will need to have already run the tutorial from Step 1 to have the saved model file)
Getting more insight into
skorch’s transfer learning
tutorial
Recall linear regression with multiple
features (Lecture 11)
𝑦 = 𝑚0 𝑓0 + 𝑚1 𝑓1 + … + 𝑚𝑛−1 𝑓𝑛−1 + 𝑏

Predicted Coefficient Feature 0 Coefficient Feature 1 Coefficient Feature n-1 Intercept

target value for feature 0 value for feature 1 value for feature n-1 value

Training goal: find [m0, m1, …, mn-1] and b such that the sum of the squared
errors between the actual (training) target values and predicted (training) target
values (based on applying equation to training samples) is as small as possible

# train the model

fit = model.fit(train_ftrs, train_tgt)
# determined coefficients in model.coef_
# determined intercept in model.intercept_
Recall linear regression with multiple
features (Lecture 11)

𝑓0
𝑓1
Linear Regression
𝑦
Model
…
𝑓𝑛−1

features output

Q: How many (trainable) parameters does the linear regression model have?
Recall linear regression with multiple
features (Lecture 11)

𝑓0
𝑓1
Linear Regression
𝑦
Model
…
𝑓𝑛−1

features output

Q: How many (trainable) parameters does the linear regression model have? n+1
Installations
Imports
Transforms

RandomResizedCrop helps the model learn

different scales and crops.
RandomHorizontalFlip adds variation
(augmentation).
ToTensor converts PIL images to 3D tensors with
pixel values in [0,1].
Normalize ensures your input distribution
matches what pre-trained ImageNet models
expect (mean & std of ImageNet dataset).

tensor: multi-dimensional array (scalar, vector,

matrix) -- structure used by PyTorch
The skorch tutorial is based on a ResNet18
model (deep-learning-based model)
Updated (use this on colab)

Replacing model.fc in fully connected layer with a new linear layer that
outputs features, so fine-tune it for our own task (e.g., 2 for ants/bees).
We supply this value (2) later.
Text
class PretrainedModel(nn.Module):
def __init__(self, output_features):
super().__init__()
#model = models.resnet18(pretrained=True)
model = models.resnet18(weights='IMAGENET1K_V1') # or use weights='DEFAULT'
num_ftrs = model.fc.in_features
model.fc = nn.Linear(num_ftrs, output_features)
self.model = model

def forward(self, x):

return self.model(x)
Writes model to a file called "best_model.pt'
Use this (on colab)
You may get a better model with more epochs --
at the cost of time.

We access the "output_features" variable using

"module__output_features=2

stochastic gradient descent is an optimization approach

Modest changes here

Tutorial's Description
How quickly to adjust weights (small is
slow, big can be unstable)

Update the model after this

number of records

Describes how much previous direction (of

gradient) to use (think up/down a hill)
Text
net = NeuralNetClassifier(
module=PretrainedModel,
criterion=nn.CrossEntropyLoss(),
lr=0.001,
batch_size=4,
max_epochs=5, #25
module__output_features=2,
optimizer=optim.SGD,
optimizer__momentum=0.9,
iterator_train__shuffle=True,
iterator_train__num_workers=2,
iterator_valid__num_workers=2,
train_split=predefined_split(val_ds),
callbacks=[lrscheduler, checkpoint, freezer],
classes=[0, 1], # <- explicitly set classes here (e.g., [0,1] for binary)
device='cuda' if torch.cuda.is_available() else 'cpu'
device = torch.device("mps" if torch.backends.mps.is_available() else "cpu")
)
Black-box view of ResNet18 model
Historical note:
original ResNet
𝑦0 network published
in 2015/2016
𝑦1
ResNet18
Model

…
𝑦999

image 1000 outputs

Intuitively, likelihood of each of

There are ~11 million parameters in the ResNet18 1000 categories (e.g., goldfish)
model (and it is one of the smaller ResNet models)!

Image from: https://github.com/EliSchwartz/imagenet-sample-images/blob/master/gallery.md

ResNet18
• 512 input features

• (224 X 224) pixels, in 3 color channels = 150,528 pixels

• Convolution and down sampling in early layers (reduce pixels)

• 4 layers further down sample

• Global average pooling of a 7x7 map is reduced to a single number

ResNet18 was originally trained on a
subset of the ImageNet dataset
Full dataset: ~15 million images; Challenge dataset: ~1 million images;
~20,000 categories 1000 categories

Historical note: ImageNet originally

presented at CVPR 2009

Key idea: focus on importance of

data in building models
Aside: Dr. Fei Fei Li TED talk
(key creator of ImageNet)

https://www.youtube.com/watch?v=40riCqvRoMs
ImageNet
• 2009 Release
• 14 million images grouped into categories with labels (cat, dog,
truck, etc.)
• Used to train/evaluate computer vision models
• 2012 - Toronto team (AlexNet) used a deep convolutional neural
network and significantly outperformed other teams

• Deep learning models are pre-trained on ImageNet, then fine-

tuned on other smaller datasets.
The last “layer” of ResNet18 takes model.fc.in_features
(i.e., 512) inputs and effectively using a linear-type model
to predict each of the output classes
Historical note:
original ResNet
𝑦0 network published
in 2015/2016
𝑦1
ResNet18
Note: there is an
Model

…
additional
𝑦999 “softmax” step
to scale the
outputs to
image 1000 outputs probabilities

Intuitively, likelihood of each of

There are ~11 million parameters in the ResNet18 1000 categories (e.g., goldfish)
model (and it is one of the smaller ResNet models)!

Image from: https://github.com/EliSchwartz/imagenet-sample-images/blob/master/gallery.md

The last “layer” of ResNet18 takes model.fc.in_features
(i.e., 512) inputs and effectively using a linear-type model
to predict each of the output classes

𝑦0
𝑓0
𝑦1
𝑓1 Note: there is an

…
additional

…
𝑦999 “softmax” step
𝑓511 to scale the
outputs to
1000 outputs probabilities

Intuitively, likelihood of each of

1000 categories (e.g., goldfish)
In transfer learning, we replace the last layer of the
pre-trained network to output our desired number
of outputs (e.g., 2).

𝑦0
𝑓0
𝑓1 Note: there is an
additional

…
𝑦1 “softmax” step
𝑓511 to scale the
outputs to
2 outputs probabilities

Intuitively, likelihood of each of 2

categories (e.g., ant/bee)
Keeping the other parameters/weights “fixed/frozen”
results in only ~1000 parameters to train rather than 11
million.
In transfer learning, we replace the last layer of the
pre-trained network to output our desired number
of outputs (e.g., 2)

𝑦0

(Slightly modified)
Note: there is an
ResNet18
additional
Model
𝑦1 “softmax” step
to scale the
outputs to
2 outputs probabilities
image
Intuitively, likelihood of each of 2
categories (e.g., ant/bee)
Keeping the other parameters/weights “fixed/frozen”
results in only ~1000 parameters to train rather than 11
million.
In transfer learning, we replace the last layer of the
pre-trained network to output our desired number
of outputs (e.g., 2)

𝑦0

(Slightly modified)
Note: there is an
ResNet18
additional
Model
𝑦1 “softmax” step
to scale the
outputs to
2 outputs probabilities
image
Intuitively, likelihood of each of 2
categories (e.g., ant/bee)
Keeping the other parameters/weights “fixed/frozen”
results in only ~1000 parameters to train rather than 11
million. Using smaller datasets for
training becomes feasible!
Last Layer
• 512 features → 2 classes

• Weights: 512 x 2 = 1024

• Biases: 2 (additive, subtractive):

• 1024 + 2 = 1026
Validation
Predictions, Truth (image 5)
One it got wrong (index 6)
1000 categories
Text
from torchvision import models
from torchvision.models import ResNet18_Weights

# Load the class labels from ImageNet

weights = ResNet18_Weights.DEFAULT
categories = weights.meta["categories"]

# Show the first 10 categories

print(categories[:10])

from torchvision.models import ResNet18_Weights

categories = ResNet18_Weights.DEFAULT.meta["categories"]

from pprint import pprint

# Join all categories with newlines for better readability

pprint(",".join(categories), width=100)
('tench,goldfish,great white shark,tiger shark,hammerhead,electric '
'ray,stingray,cock,hen,ostrich,brambling,goldfinch,house finch,junco,indigo '
'bunting,robin,bulbul,jay,magpie,chickadee,water ouzel,kite,bald eagle,vulture,great grey '
'owl,European fire salamander,common newt,eft,spotted salamander,axolotl,bullfrog,tree '
'frog,tailed frog,loggerhead,leatherback turtle,mud turtle,terrapin,box turtle,banded '
'gecko,common iguana,American chameleon,whiptail,agama,frilled lizard,alligator lizard,Gila '
'monster,green lizard,African chameleon,Komodo dragon,African crocodile,American '
'alligator,triceratops,thunder snake,ringneck snake,hognose snake,green snake,king snake,garter '
'snake,water snake,vine snake,night snake,boa constrictor,rock python,Indian cobra,green '
'mamba,sea snake,horned viper,diamondback,sidewinder,trilobite,harvestman,scorpion,black and gold
'
'garden spider,barn spider,garden spider,black widow,tarantula,wolf spider,tick,centipede,black '

etc...
Summary of key concepts
• The skorch transfer learning tutorial is based on a pre-trained
deep-learning model called ResNet18 with ~11 million
parameters (requiring lots of data for training)
• ResNet18 was originally trained using a subset of the ImageNet
dataset with ~1 million images (and 1000 categories)
• In the transfer learning tutorial, we replace the last “layer” of the
deep-learning network to only predict 2 categories.
• Only the weights of the last layer are updated (only ~1000
parameters rather than ~11 million), making it feasible for small
datasets
End

Train Your Image Classifier Model With PyTorch
No ratings yet
Train Your Image Classifier Model With PyTorch
6 pages
BreastCancer EXP
No ratings yet
BreastCancer EXP
8 pages
Deep Learning With PyTorch
No ratings yet
Deep Learning With PyTorch
19 pages
06 Pytorch Transfer Learning
No ratings yet
06 Pytorch Transfer Learning
18 pages
Pytorch Project Pedro Aguiar
No ratings yet
Pytorch Project Pedro Aguiar
27 pages
Transfer Learning Guide for ML Enthusiasts
No ratings yet
Transfer Learning Guide for ML Enthusiasts
18 pages
Code
No ratings yet
Code
10 pages
Assign PDF
No ratings yet
Assign PDF
19 pages
Video 18 - Transfer Learning and Fine-Tuning Pretrained Models
No ratings yet
Video 18 - Transfer Learning and Fine-Tuning Pretrained Models
14 pages
Deep Learning Project For Computer Vision With Python 2022
No ratings yet
Deep Learning Project For Computer Vision With Python 2022
297 pages
Deep Learning For Vision Lab Manual 2024
100% (1)
Deep Learning For Vision Lab Manual 2024
25 pages
Transfer Learning with EfficientNet
No ratings yet
Transfer Learning with EfficientNet
29 pages
Pytorch Tutorial: Narges Honarvar Nazari January 30
No ratings yet
Pytorch Tutorial: Narges Honarvar Nazari January 30
29 pages
"I C U N N ": Mage Lassification Sing Eural Etworks
No ratings yet
"I C U N N ": Mage Lassification Sing Eural Etworks
15 pages
Video 7 - Building A Multilayer Feedforward Network For Classification in PyTorch
No ratings yet
Video 7 - Building A Multilayer Feedforward Network For Classification in PyTorch
18 pages
DL7 2
No ratings yet
DL7 2
11 pages
07 - Assessmen (10) - JupyterLab
No ratings yet
07 - Assessmen (10) - JupyterLab
13 pages
DLV Lab Manual Print
No ratings yet
DLV Lab Manual Print
29 pages
Extract Image Feature Vectors with PyTorch
No ratings yet
Extract Image Feature Vectors with PyTorch
7 pages
Fine-Tuning Transfer Learning Guide
No ratings yet
Fine-Tuning Transfer Learning Guide
24 pages
DR Basit Assignments
No ratings yet
DR Basit Assignments
13 pages
Update on TensorFlow Loss Function
No ratings yet
Update on TensorFlow Loss Function
52 pages
Code
No ratings yet
Code
4 pages
NN From Scratch
No ratings yet
NN From Scratch
5 pages
Lenet - 5 - On - CIFAR - 10 - DATASET - Ipynb - Colab
No ratings yet
Lenet - 5 - On - CIFAR - 10 - DATASET - Ipynb - Colab
4 pages
Hiperparametre
No ratings yet
Hiperparametre
10 pages
Ai HW1
No ratings yet
Ai HW1
25 pages
Capstone Project-1
No ratings yet
Capstone Project-1
15 pages
CV - T3 - Unit-7
No ratings yet
CV - T3 - Unit-7
36 pages
NB4-10 PT V Transfer Learning
No ratings yet
NB4-10 PT V Transfer Learning
16 pages
PyTorch Neural Network Basics Guide
No ratings yet
PyTorch Neural Network Basics Guide
12 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
10 pages
FX-Based Feature Extraction in PyTorch
No ratings yet
FX-Based Feature Extraction in PyTorch
9 pages
07 - Assessmen (2) - JupyterLab
No ratings yet
07 - Assessmen (2) - JupyterLab
10 pages
Advance Questions Answers
No ratings yet
Advance Questions Answers
4 pages
Assignment3 AL
No ratings yet
Assignment3 AL
23 pages
Faster R-CNN
No ratings yet
Faster R-CNN
20 pages
Explore The Implementation of CNNs in Python
No ratings yet
Explore The Implementation of CNNs in Python
10 pages
Building Deep Learning Models Using The PyTorch Library
No ratings yet
Building Deep Learning Models Using The PyTorch Library
4 pages
DL Mannual For Reference
No ratings yet
DL Mannual For Reference
58 pages
Project Documentation
No ratings yet
Project Documentation
24 pages
Deep Learning with PyTorch Course
No ratings yet
Deep Learning with PyTorch Course
9 pages
Transfer Learning For Image Classification in Pytorch
No ratings yet
Transfer Learning For Image Classification in Pytorch
13 pages
Al3502 Deep Learning For Vision Lab Manuval
No ratings yet
Al3502 Deep Learning For Vision Lab Manuval
19 pages
Keras
No ratings yet
Keras
4 pages
CIFAR - 10 - Dataset - Using - CNN - Aniiiii - HTML
No ratings yet
CIFAR - 10 - Dataset - Using - CNN - Aniiiii - HTML
8 pages
Image Classification With Convolutional Neural Networks: Plotting
No ratings yet
Image Classification With Convolutional Neural Networks: Plotting
16 pages
Plant Disease Identification
No ratings yet
Plant Disease Identification
17 pages
Exercise Classification
No ratings yet
Exercise Classification
8 pages
Deep Learning for Image Processing
No ratings yet
Deep Learning for Image Processing
48 pages
Training A Classifier - PyTorch Tutorials 2.3.0+cu121 Documentation
No ratings yet
Training A Classifier - PyTorch Tutorials 2.3.0+cu121 Documentation
8 pages
Keras Deep Learning Guide by Lepetit
No ratings yet
Keras Deep Learning Guide by Lepetit
33 pages
Deep Learning for Visual Recognition
No ratings yet
Deep Learning for Visual Recognition
82 pages
Tensorflow, Keras and Deep Learning
No ratings yet
Tensorflow, Keras and Deep Learning
51 pages
Waste Classification with PyTorch
No ratings yet
Waste Classification with PyTorch
14 pages
CNNs & Computer Vision with PyTorch
No ratings yet
CNNs & Computer Vision with PyTorch
29 pages
Tensorflow and Deep Learning
No ratings yet
Tensorflow and Deep Learning
51 pages
Keras Image Classification with ResNet50
No ratings yet
Keras Image Classification with ResNet50
16 pages
lecture19-FromTreesToForests RandomForests
No ratings yet
lecture19-FromTreesToForests RandomForests
50 pages
Lecture21 GreedyFeatureSelection
No ratings yet
Lecture21 GreedyFeatureSelection
11 pages
Lecture15 evaluatingClassifiersPartII
No ratings yet
Lecture15 evaluatingClassifiersPartII
35 pages
Physics Exam 1 Study Guide Key
No ratings yet
Physics Exam 1 Study Guide Key
5 pages
Physics All Sample Questions
No ratings yet
Physics All Sample Questions
20 pages
Numerical Integration: Msa / L-6 Cse-3512 - Numerical Methods 27 January, 2005
No ratings yet
Numerical Integration: Msa / L-6 Cse-3512 - Numerical Methods 27 January, 2005
3 pages
Trivial and Non-Trivial Solutions
No ratings yet
Trivial and Non-Trivial Solutions
3 pages
Row Reduction
No ratings yet
Row Reduction
27 pages
Foundations of Neural Networks in ML
No ratings yet
Foundations of Neural Networks in ML
52 pages
Mat511 Advanced-Numerical-Methods TH 1.10 Ac26
No ratings yet
Mat511 Advanced-Numerical-Methods TH 1.10 Ac26
2 pages
Aravali International School Class - Viii Subject - Mathematics Algebraic Expressions and Identities Name: - Roll No: - Teacher's Signature
No ratings yet
Aravali International School Class - Viii Subject - Mathematics Algebraic Expressions and Identities Name: - Roll No: - Teacher's Signature
3 pages
Iterative Methods For Solving Linear Problems
No ratings yet
Iterative Methods For Solving Linear Problems
36 pages
Numerical Solution of Black-Scholes Equation
No ratings yet
Numerical Solution of Black-Scholes Equation
8 pages
Central Difference Techniques
No ratings yet
Central Difference Techniques
19 pages
Simultaneous Linear Equation
0% (1)
Simultaneous Linear Equation
25 pages
Derivative-Free Optimization Guide
No ratings yet
Derivative-Free Optimization Guide
24 pages
Regression
No ratings yet
Regression
4 pages
Proposal
0% (1)
Proposal
3 pages
Handouts Mth603
No ratings yet
Handouts Mth603
147 pages
Sanoria Mariz LR5
No ratings yet
Sanoria Mariz LR5
5 pages
Linear Programming for Project Assignments
0% (1)
Linear Programming for Project Assignments
3 pages
Optimization Model Formulation
No ratings yet
Optimization Model Formulation
10 pages
Central Differences in Numerical Derivatives
No ratings yet
Central Differences in Numerical Derivatives
21 pages
Chapter 1 Errors
50% (2)
Chapter 1 Errors
22 pages
Checkpoint 1.4
No ratings yet
Checkpoint 1.4
2 pages
Benzing Be 122-125 1
No ratings yet
Benzing Be 122-125 1
1 page
Deep vs Shallow Neural Networks Explained
No ratings yet
Deep vs Shallow Neural Networks Explained
13 pages
4 D and C - Maximum Subarray
No ratings yet
4 D and C - Maximum Subarray
13 pages
Assignment 1 ME502
0% (1)
Assignment 1 ME502
4 pages
MTL712-BVPs (1) - 241107 - 100227
No ratings yet
MTL712-BVPs (1) - 241107 - 100227
2 pages
ETE - PYQs 1 2
No ratings yet
ETE - PYQs 1 2
2 pages
Indian Institute of Technology Guwahati: B.Tech. (Mathematics and Computing)
No ratings yet
Indian Institute of Technology Guwahati: B.Tech. (Mathematics and Computing)
8 pages
CFD Turbulence Models Explained
No ratings yet
CFD Turbulence Models Explained
29 pages
Algorithm Complexity
No ratings yet
Algorithm Complexity
7 pages
Linear Least Square and Euler Method
No ratings yet
Linear Least Square and Euler Method
18 pages

Lecture25 TransferLearningOverviewPart1

Uploaded by

Lecture25 TransferLearningOverviewPart1

Uploaded by

Lecture 25: Transfer

Learning Overview (Part 1)

Predicted Coefficient Feature 0 Coefficient Feature 1 Coefficient Feature n-1 Intercept

# train the model

RandomResizedCrop helps the model learn

tensor: multi-dimensional array (scalar, vector,

def forward(self, x):

We access the "output_features" variable using

stochastic gradient descent is an optimization approach

Modest changes here

Update the model after this

Describes how much previous direction (of

image 1000 outputs

Intuitively, likelihood of each of

Image from: https://github.com/EliSchwartz/imagenet-sample-images/blob/master/gallery.md

• (224 X 224) pixels, in 3 color channels = 150,528 pixels

• Convolution and down sampling in early layers (reduce pixels)

• 4 layers further down sample

• Global average pooling of a 7x7 map is reduced to a single number

Historical note: ImageNet originally

Key idea: focus on importance of

• Deep learning models are pre-trained on ImageNet, then fine-

Intuitively, likelihood of each of

Image from: https://github.com/EliSchwartz/imagenet-sample-images/blob/master/gallery.md

Intuitively, likelihood of each of

Intuitively, likelihood of each of 2

• Weights: 512 x 2 = 1024

# Load the class labels from ImageNet

# Show the first 10 categories

from torchvision.models import ResNet18_Weights

from pprint import pprint

# Join all categories with newlines for better readability

You might also like