0% found this document useful (0 votes)

18 views60 pages

Lecture 16 - Custom Applications and Transfer Learning

Uploaded by

Raj panchal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views60 pages

Lecture 16 - Custom Applications and Transfer Learning

Uploaded by

Raj panchal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 60

Building an ML

Application
and Transfer
Learning
Applied Machine Learning
Derek Hoiem

Dall-E
Today’s lecture

• Review a few exam questions

• Example of building an ML application

• Transfer learning
Exam
• Well done!
False: It’s possible (and common) for a method to achieve low/zero training error, but still perform badly in
testing, especially if the training examples are few compared to the model size
(a): The parameters optimize the objective for the training data, so evaluation on the training data is a
strongly biased optimistic estimate of performance, and is not a good indicator of expected performance
for future examples
(c) The trees are independently trained (a) All features are used to train each tree
(b) x=3y~=3 for regression and
nearest neighbor
False: The weight update is not sampled randomly from a uniform distribution, but computed
from a random sample of data. Also, SGD does not proceed by checking whether an update
decreases the loss -- it just takes a step according to the loss gradient for that mini-batch.
False: Sigmoid activations are very non-linear. The problem is that the gradient
is always less than 1 and often very small, so with many layers, the gradient
becomes negligible.
We’ve covered a lot of ground in deep networks

• ReLU activations, residual connections, and improved

optimization techniques enabled training arbitrarily large
and deep models

• Transformers provide a general and scalable way to process

many kinds of data

• Training on large annotated datasets or even larger

unannotated datasets yields impressive models that are
useful for many applications
How do you make your own ML application?
Example: Safety inspector wants to know what fraction of
workers are wearing helmets, gloves, and boots on each job site

• PPE use is low (e.g., 60% use in a study in Egypt; frequent lack of use in US and other countries too)
• 1,008 fatal and 174,100 non-fatal injuries in US construction in 2020
• Consistently using PPE would significantly reduce injury and sometimes death
Image src
Step 1: Propose a solution in more technical terms
Proposed solution: Process images from the job site to detect the
workers and count what fraction of detected workers are
wearing each item

Left Glove: No
Right Glove: No
Hard hat: Yes
Vest: Yes
Boots: Yes
Step 1: Propose a solution in more technical terms
Main ML problem: Given an image, detect each worker and
whether each detected worker is wearing: (a) glove on left hand;
(b) glove on right hand; (c) boots; (d) hard hat; (e) vest

Note: There are lots of other aspects to the problem that we won’t consider in this example
• How to get images onto a server where we can process them
• How to avoid duplicate counts when the same person is in more than one image on the same day src
• How to summarize results and report them to the safety inspector src
Step 2: Decide how to measure success
• What matters?
– We want the overall estimate of
fraction of workers wearing each
item to be accurate
– We want to report specific
instances of workers not wearing
an item, so that they can be
checked as problematic or not
Step 2: Decide how to measure success
• Key aspects of performance
– Human detection performance
• Do we care about “small” or heavily occluded
workers?
• What counts as correct? (maybe high overlap
in bounding boxes)
• Measure precision (fraction of detections that src
are correct) and recall (fraction of workers
that are detected)
• Can measure Precision and Recall for each
level of confidence and generate a P-R curve
• Common overall performance measure is
average precision
• We may care about recall at a high precision
value because we don’t care about counting
the number of workers, just knowing how
likely a worker is to wear PPE
src
Step 2: Decide how to measure success
• Key aspects of performance
– Human detection performance
– Apparel classification performance,
for correctly detected humans and
each item: EER
• TP rate: fraction of actual items that
are detected
• FP rate: fraction of item detections
that are false
• Summarize with equal error rate,
accuracy when confidence is set so
that FP rate = (1- detection rate)
Step 2: Decide how to measure success
• Key aspects of performance
– Human detection performance
– Apparel detection performance, for correctly detected humans
and each item
– Overall: Deviance between the estimated fraction of workers
wearing equipment from the true fraction over a set of images
• Difference in fractions
• Bias: tends to overcount or undercount
• Variance: how much could the difference be expected to vary, given a
particular number of images
Step 3: collect and annotate validation/test images
1. Collect images
– Should be the same kind of images that will be processed in deployment
– Collect from a variety of sites and different dates. Try to get representative diversity
2. Annotate
– Draw boxes around each worker, even very small and hard to detect ones
– For each PPE item, label “present”, “absent”, or “not visible”
– How to get annotations
• In house:
– Use open source tool, such as VGG image annotator, or commercial tool like LabelBox
– Develop custom tool (e.g. to process 360 images or fully integrate into existing application)
• Outsource:
– Amazon Mechanical Turk or other crowdsourcing tool
– Commercial service
• In this case, creating a small initial development validation set in-house and larger set by
outsourcing could make sense
3. Split into a validation set and test set
Step 4: Determine technical details of approach
• For this example, we’ll base the approach on Mask-RCNN
Detects objects and person keypoints

Includes additional branch to

detect person keypoints

Modifications
• Remove bounding box detections and masks for non-
person objects
• Add classification layer to keypoint branch to classify
• Wearing left glove
• Wearing right glove
• Wearing hard hat
• Wearing boots
• Wearing safety vest
Step 5: Collect training data
• Consider combination of existing data (with applicable
licenses) and new data
• Existing
– Papers with code
– Google for existing papers/datasets, e.g.
• Collect own data
– similar to collecting test/validation, but not quite as much concern
about being representative or reflecting actual use cases
– E.g., could ask job sites to send photos of workers wearing and not
wearing PPE (on purpose, briefly) while in natural poses
Step 6: Develop model
(from Chat GPT)

• Whenever possible, start

with a pretrained model

• Alternatively, you could

use unsupervised
pretraining to initialize
your model (e.g. Masked
Autoencoder)

https://huggingface.co/models
Step 6a: Develop model: establish baselines
• Run the model as-is on your validation data and
measure human detection performance

• Train a linear probe for classifying PPE item

presence and measure all performance metrics

• Manually validate your evaluation code by

displaying images and detections and checking
against metrics
Step 6b: Develop model: refine model
• Fine-tune the model on your
data
• Train using mix of existing and
application-specific data
– Apply only the losses that are
applicable (e.g. detection or src
pose only for some datasets)
• Use tools like TensorBoard or
Weights and Biases to
monitor training and compare
results
– Always plot validation and
training loss, and measure
validation performance at
training milestones
https://huggingface.co/autotrain
Step 7: Evaluate on test set
• Measure performance metrics and characterize when it works
and doesn’t
– As function of occlusion, person size, camera viewpoint, etc
Step 8: Integrate into application

• Beta test in complete workflows

• Write guides for when it works and doesn’t

• Improve efficiency, refine approach

Summary of how to build a new ML application
1. Identify problem and general approach to solution
– This also involves thinking ahead to metrics, available models, data, and more, to ensure viability
2. Specify success metrics
– Check with product managers and/or users to ensure these metrics reflect important performance
characteristics
– Often, the metrics can’t be optimized directly
3. Create evaluation sets
– Achieving targets for success metrics on these sets should indicate high likelihood of application success
4. Select model, objectives, and other design details
– Usually this involves finding an analogous approach that has been successful
5. Collect data for training
– Custom data and labeling is expensive and time-consuming, so exploit available data sources where
available, and as allowed by license terms
6. Develop model, starting with baselines and simple approaches
– Starting simple is critical so that it is easier to debug and validate changes
7. Evaluate on your test set
– It’s not just about the performance number, but about predictability and effectiveness within the
application
8. Integrate into the application
– This requires a lot of work and testing
2 minute break
Thank you to Yuxiong Wang for following slides on
domain adaptation and transfer learning!
Challenge for Machine Learning Models
• Development and real-world application may face different
scenarios

• Limiting model performance and reliability

Curated Trained Real-world Questionable

Dataset for ML Model Setting Performance
Development
29
Slide credit: Yuxiong Wang
Types of Shifts
• Mainly two types of shifts from one scenario to another:

Task shift Domain shift

30
Slide credit: Yuxiong Wang
Task Shift: Changed Model Objectives

Classifying dogs and cats Classifying squirrels and birds

Source (Old) Task Target (New) Task

31
Slide credit: Yuxiong Wang
Domain Shift: Changed Input Data Distributions

Classifying dogs and cats in studio Classifying dogs and cats on grass
Source (Old) Domain Target (New) Domain

32
Slide credit: Yuxiong Wang
Types of Shifts: Task or Domain?

• Task shift
– Objective of model is changed
– But data distributions are usually assumed similar or related
• Domain shift
– Input data come from changed distributions
– But model task usually remains the same

33
Slide credit: Yuxiong Wang
Overcoming Task/Domain Shift

Curated Trained Real-world Questionable

Dataset for ML Model Setting Performance
Development

Adapted Real-world Improved

ML Model Setting Performance
34
Slide credit: Yuxiong Wang
Overcoming Task/Domain Shift

• Task adaptation
• Task shift
– Transfer learning
– Changed task objective
– Meta-learning
• Domain adaptation
• Domain shift
– Instance translation
– Changed data distribution
– Domain adversarial training

• Some adaptation ideas may be applicable for both (e.g., Meta-

learning)
35
Slide credit: Yuxiong Wang
Application: Autonomous Driving
• Adapt to different weather conditions, lighting conditions, or
driving environments

Normal Weather Condition Foggy Weather Condition

Slide credit: Yuxiong Wang

Images from Sakaridis et al. IJCV '18 36
Application: Robotics
• Adapt from simulated environment to real-world robotic
systems, or adapt from one learned task to another

Slide credit: Yuxiong Wang

Images from Google Research, 2020 37
Application: Speech recognition
• Adapt to different accents, speaking styles, or environmental
conditions

• Example: Model trained with American English could be

adapted to British English by fine-tuning on new domain

38
Slide credit: Yuxiong Wang
Methods for Task Adaptation
• Transfer learning: Pre-training and fine-tuning

• Meta-learning: Model-Agnostic Meta-Learning (MAML) and

variants

39
Slide credit: Yuxiong Wang
Transfer Learning
• Goal: To reuse knowledge learned from one task (which
usually has abundant supervisory information), to another
related task

• Implementation is simple
– "Pre-train" model on source task
– Copy learned weights from learned model
– "Fine-tune" new model on target task

40
Slide credit: Yuxiong Wang
Transfer Learning

Model 1
Task 1 Data Backbone Head Task 1 Outputs

Initialize
weights
Model 2
Task 2 Data Backbone New Head Task 2 Outputs

41
Slide credit: Yuxiong Wang
Transfer Learning
• Step 1: Pre-train Model 1 on Task 1

Model 1
Task 1 Data Backbone Head Task 1 Outputs

Initialize
weights
Model 2
Task 2 Data Backbone New Head Task 2 Outputs

42
Slide credit: Yuxiong Wang
Transfer Learning
• Step 2: Initialize weights using learned Model 1

Model 1
Task 1 Data Backbone Head Task 1 Outputs

Initialize
weights
Model 2
Task 2 Data Backbone New Head Task 2 Outputs

43
Slide credit: Yuxiong Wang
Transfer Learning
• Step 3: Fine-tune Model 2 on Task 2
– Backbone may use a smaller learning rate or even be "frozen"

Model 1
Task 1 Data Backbone Head Task 1 Outputs

Initialize
weights
Model 2
Task 2 Data Backbone New Head Task 2 Outputs

44
Slide credit: Yuxiong Wang
Model-Agnostic Meta-Learning (MAML)
• Proposed by Finn et al. ICML '17

• Goal: To learn a good parameter initialization that can be

quickly adapted to new tasks

• Model-agnostic: Can be applied to any differentiable model

– Flexible, can be used in a wide range of applications
– Including computer vision, natural language processing, and robotics

45
Slide credit: Yuxiong Wang
Model-Agnostic Meta-Learning (MAML)
• Assumption and setting
– Have a pool of various tasks
– Each task contains a set of training/validation samples

• An example of task pool

– Classify Dogs into Shepherd, Labrador, Golden, Husky ...
– Classify Cat into Siamese, Maine, Persian, Shorthair ...
– Classify Bird into Canary, Parrot, Dove, Sparrow ...

46
Slide credit: Yuxiong Wang
Model-Agnostic Meta-Learning (MAML)

• Meta-learning phase
– Use pool of tasks to obtain a good
parameter initialization
– Learn from the "experience of learning"

• Adaptation phase
– Use few samples and optimization steps to
adapt to new task
– New task can be outside the task pool used
in meta-learning
47
Slide credit: Yuxiong Wang
Find gradient step(s) to
improve parameters for
each few-shot task

Update parameters so
that those update steps
reduce the loss as much
as possible for all tasks
MAML is “learning to learn” – it learns parameters that are close
to good parameters for many classification tasks, so that new
tasks can be learned from a few examples and optimization steps
Methods for Domain Adaptation
• Instance translation
– Transform target-domain data into source-domain

• Domain adversarial training

– Align source-domain and target-domain feature spaces

54
Slide credit: Yuxiong Wang
Instance Translation
• Use generative models (e.g., CycleGAN by Zhu et al. ICCV '17)
to create instances

• Look like source domain but preserve same target domain

content

• Then feed source-like instances into source-domain model ✅

55
Slide credit: Yuxiong Wang
Instance Translation
CycleGAN by Zhu et al. ICCV
'17

56
Slide credit: Yuxiong Wang
Domain Adversarial Training
• Proposed by Ganin et al. JMLR '17

• Goal: Learn a domain-invariant model

– Model produces features that do not change with domain shift
– Only reflect contents about labels, but not domain characteristics

57
Slide credit: Yuxiong Wang
Domain Adversarial Training
• Attach a domain classifier network and apply adversarial training
• Aim of domain classifier: To distinguish source vs. target domains

58
Slide credit: Yuxiong Wang
Domain Adversarial Training
• Aim of main network: 1) Correctly predict label of source-
domain data;

59
Slide credit: Yuxiong Wang
Domain Adversarial Training
• Aim of main network: 1) Correctly predict label of source-
domain data; 2) Using features that cannot distinguish
between source and target domains

60
Slide credit: Yuxiong Wang
Domain Adversarial Training
• Adversarial training: Domain classifier 𝜃𝜃𝑑𝑑 minimizes
discrimination loss 𝐿𝐿𝑑𝑑 , while main network's feature extractor
𝜃𝜃𝑓𝑓 maximizes 𝐿𝐿𝑑𝑑

Adversarial training is
implemented by
reversing gradients here

61
Slide credit: Yuxiong Wang
Domain Adversarial Training
• One mainstream of domain adaptation
– Various follow-up methods study how to better learn domain-
invariant models or feature representations

• Other ideas (may be combined with domain adversarial

training)
– Instance translation
– Pseudo-labeling and self-training
– Domain randomization

62
Slide credit: Yuxiong Wang
Summary
• Task adaptation for changed task objective
– Transfer learning
– Meta-learning
• Domain adaptation for changed data distribution
– Instance translation
– Domain adversarial training

72
Slide credit: Yuxiong Wang
Coming up
• Thursday: Ethics and Impact of AI

Automated PPE
No ratings yet
Automated PPE
21 pages
Report
No ratings yet
Report
6 pages
Assignment m2 Machine Learning Final
No ratings yet
Assignment m2 Machine Learning Final
5 pages
Safety Measure Detection Using Deep Learning
No ratings yet
Safety Measure Detection Using Deep Learning
8 pages
Ppe Detection Using Yolo v8
No ratings yet
Ppe Detection Using Yolo v8
22 pages
Midterm Study Guide Csci566
No ratings yet
Midterm Study Guide Csci566
20 pages
CNN-Based Face Recognition Seminar
No ratings yet
CNN-Based Face Recognition Seminar
17 pages
Detection of Safety Equipment in The Manufacturing Industry Using Image Recognition
No ratings yet
Detection of Safety Equipment in The Manufacturing Industry Using Image Recognition
62 pages
Mask Detection with Teachable Machine
No ratings yet
Mask Detection with Teachable Machine
28 pages
Visual Detection of Personal Protective Equipment and Safety Gear On Industry Workers
No ratings yet
Visual Detection of Personal Protective Equipment and Safety Gear On Industry Workers
8 pages
Fabric Defect Final Black Book Abcdeffg
No ratings yet
Fabric Defect Final Black Book Abcdeffg
64 pages
An Interactive Web-Based Application To Track COVID-19 Protocols in Real Time
No ratings yet
An Interactive Web-Based Application To Track COVID-19 Protocols in Real Time
18 pages
Kirkvik Acit2022
No ratings yet
Kirkvik Acit2022
155 pages
Abinayasri. Mailam ECE 02
No ratings yet
Abinayasri. Mailam ECE 02
9 pages
Fabric Defect Classification - Code Analysis and Methodology
No ratings yet
Fabric Defect Classification - Code Analysis and Methodology
5 pages
Implementasi Deep Learning Untuk Deteksi Alat Pelindung Diri APD Pada Pekerja Menggunakan Algoritma Yolo
No ratings yet
Implementasi Deep Learning Untuk Deteksi Alat Pelindung Diri APD Pada Pekerja Menggunakan Algoritma Yolo
8 pages
Covid Mask
No ratings yet
Covid Mask
15 pages
Real-Time Illegal Activity Detection
No ratings yet
Real-Time Illegal Activity Detection
4 pages
Driver Drowsiness Detection AI
No ratings yet
Driver Drowsiness Detection AI
9 pages
Project Final Report Group 9
No ratings yet
Project Final Report Group 9
7 pages
Miniproject B 66,72,75
No ratings yet
Miniproject B 66,72,75
19 pages
Enhancing Laboratory Safety With AI: PPE Detection and Non-Compliant Activity Monitoring Using Object Detection and Pose Estimation
No ratings yet
Enhancing Laboratory Safety With AI: PPE Detection and Non-Compliant Activity Monitoring Using Object Detection and Pose Estimation
10 pages
Comprehensive PyTorch Coding Challenges Across Mac
No ratings yet
Comprehensive PyTorch Coding Challenges Across Mac
5 pages
? Task
No ratings yet
? Task
23 pages
Ai Project Cycle Short Note
No ratings yet
Ai Project Cycle Short Note
9 pages
SE Lab
No ratings yet
SE Lab
18 pages
AWS Machine Learning Specialty Master Cheat Sheet
No ratings yet
AWS Machine Learning Specialty Master Cheat Sheet
24 pages
Deep Learning For Vision Lab Manual 2024
100% (1)
Deep Learning For Vision Lab Manual 2024
25 pages
MYPPTT
No ratings yet
MYPPTT
19 pages
Project Report
No ratings yet
Project Report
38 pages
Edge-Based PPE Detection System
No ratings yet
Edge-Based PPE Detection System
17 pages
Capstone Project
No ratings yet
Capstone Project
6 pages
Applsci 13 10700
No ratings yet
Applsci 13 10700
16 pages
Machine Learning Engineer Interview Preparation Guide
No ratings yet
Machine Learning Engineer Interview Preparation Guide
14 pages
Helmet Detection
No ratings yet
Helmet Detection
30 pages
Validation and Optimization of 3d-Human Body Pose Estimation Approaches For Use in Motion Analysis, Ab. Sami Noorzad and Malek Zedan
No ratings yet
Validation and Optimization of 3d-Human Body Pose Estimation Approaches For Use in Motion Analysis, Ab. Sami Noorzad and Malek Zedan
87 pages
Deep Learning with Keras Basics
No ratings yet
Deep Learning with Keras Basics
58 pages
Class Xii Ai Worksheet Booklet Part2 2023-2024
No ratings yet
Class Xii Ai Worksheet Booklet Part2 2023-2024
26 pages
Assignment 4
No ratings yet
Assignment 4
5 pages
Unsupervised Time Series Outlier Detection123
No ratings yet
Unsupervised Time Series Outlier Detection123
56 pages
Paper Publish
No ratings yet
Paper Publish
9 pages
Exercise #1 7 - 4 - 2025
100% (1)
Exercise #1 7 - 4 - 2025
3 pages
Sustainability 15 00391 v2
No ratings yet
Sustainability 15 00391 v2
15 pages
Weekly Activity 6
No ratings yet
Weekly Activity 6
5 pages
Fbuil 06 00136
No ratings yet
Fbuil 06 00136
10 pages
Machine Learning Model Training Insights
No ratings yet
Machine Learning Model Training Insights
60 pages
Introduction To AI and Machine Learning - PPTX - 20241231 - 193227 - 0000
No ratings yet
Introduction To AI and Machine Learning - PPTX - 20241231 - 193227 - 0000
10 pages
AlexNet and Other Pretrained Models - Presentation
No ratings yet
AlexNet and Other Pretrained Models - Presentation
182 pages
MNIST Based Handwritten Digits Recognition
No ratings yet
MNIST Based Handwritten Digits Recognition
5 pages
AI Face Mask Detection Project 2022
No ratings yet
AI Face Mask Detection Project 2022
9 pages
Lecture06 - Copie
No ratings yet
Lecture06 - Copie
52 pages
Cep Dip
No ratings yet
Cep Dip
9 pages
Machine Learning Models For Hotspot Detection
No ratings yet
Machine Learning Models For Hotspot Detection
6 pages
Under The Guidance of BY, Prof. Meiliu Lu Shekhar Shiroor
No ratings yet
Under The Guidance of BY, Prof. Meiliu Lu Shekhar Shiroor
17 pages
ML Advice Lecture
No ratings yet
ML Advice Lecture
87 pages
Capstone Project
No ratings yet
Capstone Project
28 pages
Project Review (Face Mask Detection Using Machine Learning)
No ratings yet
Project Review (Face Mask Detection Using Machine Learning)
19 pages
AIML Updated 5th, 6th (25 - 08 - 22)
No ratings yet
AIML Updated 5th, 6th (25 - 08 - 22)
31 pages
Bangla Food Review Sentimental Analysis Using Machine Learning
No ratings yet
Bangla Food Review Sentimental Analysis Using Machine Learning
7 pages
Human vs Machine Learning Explained
No ratings yet
Human vs Machine Learning Explained
34 pages
Data Science Training Report 2023
No ratings yet
Data Science Training Report 2023
32 pages
Synopsis Isl
No ratings yet
Synopsis Isl
36 pages
1SJ18CS101 Subhash K V 7 33
No ratings yet
1SJ18CS101 Subhash K V 7 33
27 pages
Functional Bid Landscape Forecasting
No ratings yet
Functional Bid Landscape Forecasting
16 pages
AIML-PART-b & PART-C
No ratings yet
AIML-PART-b & PART-C
42 pages
Raja Final Report
No ratings yet
Raja Final Report
59 pages
LSTM-Based Battery Remaining Useful Life Predictio
No ratings yet
LSTM-Based Battery Remaining Useful Life Predictio
13 pages
Battu, Mukul - 2024-25 WESEF Paper
No ratings yet
Battu, Mukul - 2024-25 WESEF Paper
23 pages
Practical Guide To Keras
No ratings yet
Practical Guide To Keras
28 pages
Classification and Prediction Guide
No ratings yet
Classification and Prediction Guide
98 pages
Privacy in Medical Imaging AI
No ratings yet
Privacy in Medical Imaging AI
6 pages
Plant Leaf Disease Detection Using Resnet-50 Based On Deep Learning
No ratings yet
Plant Leaf Disease Detection Using Resnet-50 Based On Deep Learning
17 pages
Acoustic Cues For Person Identificatio - 2025 - Computer Methods and Programs in
No ratings yet
Acoustic Cues For Person Identificatio - 2025 - Computer Methods and Programs in
11 pages
Real-Time Sign Language Translator
No ratings yet
Real-Time Sign Language Translator
22 pages
A Model Combining Lightgbm and Neural Network For High-Frequency Realized Volatility Forecasting
No ratings yet
A Model Combining Lightgbm and Neural Network For High-Frequency Realized Volatility Forecasting
7 pages
Plant Disease-SLR PDF
No ratings yet
Plant Disease-SLR PDF
16 pages
Brown Sugar Classification via Machine Learning
No ratings yet
Brown Sugar Classification via Machine Learning
8 pages
Anomaly Detection in Surveillance Videos Using Deep Learning
No ratings yet
Anomaly Detection in Surveillance Videos Using Deep Learning
6 pages
Unit 1-1
No ratings yet
Unit 1-1
75 pages
Crack Identification From Concrete Structure Images Using Deep Transfer Learning
No ratings yet
Crack Identification From Concrete Structure Images Using Deep Transfer Learning
7 pages
Project Report Final Draft1
No ratings yet
Project Report Final Draft1
65 pages
Potato Leaf Disease Detection Using Deep Learning IJERTV11IS110029
No ratings yet
Potato Leaf Disease Detection Using Deep Learning IJERTV11IS110029
5 pages
ML for Climate Prediction in Telangana
No ratings yet
ML for Climate Prediction in Telangana
10 pages
A Hybrid Approach For Mortality Prediction For Heart Patients Using ACO-HKNN 2020
No ratings yet
A Hybrid Approach For Mortality Prediction For Heart Patients Using ACO-HKNN 2020
8 pages
CVPR 2021 Learning Continuous Image Representation With Local Implicit Image Function
No ratings yet
CVPR 2021 Learning Continuous Image Representation With Local Implicit Image Function
11 pages
Semi-Supervised Learning Based Big Data-Driven Anomaly Detection in Mobile Wireless Networks 2018
No ratings yet
Semi-Supervised Learning Based Big Data-Driven Anomaly Detection in Mobile Wireless Networks 2018
17 pages
An Efficient Deep Learning Model To Categorize Bra
No ratings yet
An Efficient Deep Learning Model To Categorize Bra
32 pages

Lecture 16 - Custom Applications and Transfer Learning

Uploaded by

Lecture 16 - Custom Applications and Transfer Learning

Uploaded by

Building an ML

• Review a few exam questions

• Example of building an ML application

• ReLU activations, residual connections, and improved

• Transformers provide a general and scalable way to process

• Training on large annotated datasets or even larger

Includes additional branch to

• Whenever possible, start

• Alternatively, you could

• Train a linear probe for classifying PPE item

• Manually validate your evaluation code by

• Beta test in complete workflows

• Write guides for when it works and doesn’t

• Improve efficiency, refine approach

• Limiting model performance and reliability

Curated Trained Real-world Questionable

Task shift Domain shift

Classifying dogs and cats Classifying squirrels and birds

Curated Trained Real-world Questionable

Adapted Real-world Improved

• Some adaptation ideas may be applicable for both (e.g., Meta-

Normal Weather Condition Foggy Weather Condition

Slide credit: Yuxiong Wang

Slide credit: Yuxiong Wang

• Example: Model trained with American English could be

• Meta-learning: Model-Agnostic Meta-Learning (MAML) and

• Goal: To learn a good parameter initialization that can be

• Model-agnostic: Can be applied to any differentiable model

• An example of task pool

• Domain adversarial training

• Look like source domain but preserve same target domain

• Then feed source-like instances into source-domain model ✅

• Goal: Learn a domain-invariant model

• Other ideas (may be combined with domain adversarial

You might also like