0% found this document useful (0 votes)

36 views97 pages

IT5409 Ch7 Part1 Object Detection v2

This document discusses object detection techniques in computer vision. It begins with an overview of window-based generic object detection pipelines that involve building object models, generating candidate windows, and scoring candidates with classifiers. Boosting classifiers and their use in face detection are then covered. Features for representation and discriminative classifiers like support vector machines and boosting are also discussed. The document focuses in depth on the Viola-Jones face detection method, covering its integral image-based features and use of AdaBoost for feature selection and classifier training.

Uploaded by

Bui Minh Duc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views97 pages

IT5409 Ch7 Part1 Object Detection v2

Uploaded by

Bui Minh Duc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 97

Computer Vision

Chapter 7 (part 1): Object detection

Course Content
• Chapter 1. Introduction
• Chapter 2. Image formation, acquisition and digitization
• Chapter 3. Image Processing
• Chapter 4. Feature detection and matching
• Chapter 5. Segmentation
• Chapter 6. Motion object detection and tracking
• Chapter 7. Object recognition and deep learning
‒ Object Detection
‒ Object Recognition
‒ Deep Learning
Contents
• Window-based generic object detection: basic
pipeline
• Boosting classifiers
• Face detection as case study
• SVM + HOG for human detection as case study
• Object proposals
• [DPM]
• Evaluation

3
Object Detection
• Problem: Detecting and localizing generic objects
from various categories, such as cars, people, etc.

• Challenges:
‒ Illumination,
‒ viewpoint,
‒ deformations,
‒ Intra-class
variability

4
Window-based generic
object detection
Basic pipeline

5
Generic category recognition:
basic framework
• Build/train object model
‒ Choose a representation
‒ Learn or fit parameters of model / classifier

• Generate candidates in new image

• Score the candidates

6
Window-based models
Building an object model
Given the representation, train a binary classifier

Car/non-car
Classifier

No,Yes,
notcar.
a car.

Slide: Kristen Grauman

7
Window-based models
Generating and scoring candidates

Car/non-car
Classifier

Slide: Kristen Grauman

8
Window-based models
Generating and scoring candidates
• Slide through the image and check if there is an
object at every location

YES!! Person match found

9
Window-based models
Generating and scoring candidates
• But what if we were looking
for buses?

No bus found!

• We will never find the object

if we don’t choose our
window size wisely!

Bus found

10
Multi-scale sliding window
• Work with multiple size windows

• Create a feature pyramid

11
Window-based object detection: recap
Training:
1. Obtain training data
2. Define features
3. Define classifier

Given new image:

1. Slide window Training examples
2. Score by classifier

Car/non-car
Classifier
Feature
extraction

Slide: Kristen Grauman

12
13

Features
• HOG

• Bags of visual words

• Haar features, …
Discriminative classifier construction
Nearest neighbor Neural networks

106 examples

Support Vector Machines Boosting Conditional Random Fields

14
Boosting classifiers

15
Boosting intuition

Weak
Classifier 1

Slide credit: Paul Viola

16
Boosting illustration

Weights
Increased

17
Boosting illustration

Weak
Classifier 2

18
Boosting illustration

Weights
Increased

19
Boosting illustration

Weak
Classifier 3

20
Boosting illustration

Final classifier is
a combination of weak
classifiers

21
Boosting: training
• Initially, weight each training example equally
• In each boosting round:
‒ Find the weak learner that achieves the lowest weighted training error
‒ Raise weights of training examples misclassified by current weak
learner

• Compute final classifier as linear combination of all weak

learners
‒ (weight of each learner is directly proportional to its accuracy)
• Exact formulas for re-weighting and combining weak
learners depend on the particular boosting scheme
(e.g., AdaBoost)
Slide credit: Lana Lazebnik
22
Face detection
as case study

23
Viola-Jones face detector

24
Viola-Jones face detector
Main idea:
‒ Represent local texture with efficiently
computable “rectangular” features within window
of interest
‒ Select discriminative features to be weak classifiers
‒ Use boosted combination of them as final classifier
‒ Form a cascade of such classifiers, rejecting clear
negatives quickly

25
Viola-Jones detector: features
• “Rectangular” filters
Feature output is difference
between adjacent regions

Value at (x,y) is
• Efficiently computable sum of pixels
above and to the
with integral image: left of (x,y)
any sum can be
computed in constant
time.
Integral image

Slide: Kristen Grauman 26

Computing the integral image

Lana Lazebnik
27
Computing the integral image

ii(x, y-1)
s(x-1, y)

i(x, y)

• Cumulative row sum: s(x, y) = s(x–1, y) + i(x, y)

• Integral image: ii(x, y) = ii(x, y−1) + s(x, y)
Lana Lazebnik
28
Computing sum within a rectangle
• Let A,B,C,D be the values of
the integral image at the
corners of a rectangle D B

• Then the sum of original image

values within the rectangle can A
be computed as: C
sum = A – B – C + D

• Only 3 additions are required

for any size of rectangle!

Lana Lazebnik
29
Viola-Jones detector: features
• “Rectangular” filters
Feature output is difference
between adjacent regions

Value at (x,y) is
• Efficiently computable with sum of pixels
above and to the
integral image: any sum left of (x,y)
can be computed in
constant time
Avoid scaling images →
scale features directly for Integral image

same cost
30
Viola-Jones detector: features
Considering all
possible filter
parameters: position,
scale, and type:
180,000+ possible
features associated
with each 24 x 24
window
Which subset of these features should we use
to determine if a window has a face?
Use AdaBoost both to select the informative features
and to form the classifier

31
Viola-Jones detector: AdaBoost
• Want to select the single rectangle feature and threshold
that best separates positive (faces) and negative (non-
faces) training examples, in terms of weighted error.

Resulting weak classifier:

For next round, reweight the

…

examples according to errors,

Outputs of a possible choose another filter/threshold
rectangle feature on
combo.
faces and non-faces.

Slide: Kristen Grauman

32
Start with
AdaBoost Algorithm uniform weights
on training
examples

For T rounds {x1,…xn}

Evaluate
weighted error
for each feature,
pick best.

Re-weight the examples:

Incorrectly classified -> more weight
Correctly classified -> less weight

Final classifier is combination of the

weak ones, weighted according to error
they had.
33
Viola-Jones Face Detector: Results

First two features

selected

34
• Even if the filters are fast to compute, each
new image has a lot of possible windows to
search.

• How to make the detection more efficient?

35
Cascading classifiers for detection

• Form a cascade with low false negative rates early on

• Apply less accurate but faster classifiers first to immediately
discard windows that clearly appear to be negative
Slide: Kristen Grauman
36
Training the cascade
• Set target detection and false positive rates for each
stage
• Keep adding features to the current stage until its target
rates have been met
‒ Need to lower AdaBoost threshold to maximize detection (as
opposed to minimizing total classification error)
‒ Test on a validation set
• If the overall false positive rate is not low enough, then
add another stage
• Use false positives from current stage as the
negative training examples for the next stage

37
Viola-Jones detector: summary
Train cascade of
classifiers with
AdaBoost
Faces
New image

Selected features,
Non-faces thresholds, and weights

• Train with 5K positives, 350M negatives

• Real-time detector using 38 layer cascade
• 6061 features in all layers
[Implementation available in OpenCV] Slide: Kristen Grauman
38
Viola-Jones detector: summary
• A seminal approach to real-time object detection
‒ 26.949 citations
• Training is slow, but detection is very fast
• Key ideas
‒ Integral images for fast feature evaluation
‒ Boosting for feature selection
‒ Attentional cascade of classifiers for fast rejection of non-face
windows

P. Viola and M. Jones. Rapid object detection using a boosted cascade of simple features.
CVPR 2001.
P. Viola and M. Jones. Robust real-time face detection. IJCV 57(2), 2004.

39
Viola-Jones Face Detector: Results

40
Viola-Jones Face Detector: Results

41
Viola-Jones Face Detector: Results

42
Detecting profile faces?
Can we use the same detector?

43
Viola-Jones Face Detector: Results

Paul Viola, ICCV tutorial 44

Example using Viola-Jones detector

Frontal faces detected and then tracked, character names

inferred with alignment of script and subtitles.
Everingham, M., Sivic, J. and Zisserman, A.
"Hello! My name is... Buffy" - Automatic naming of characters in TV video,
BMVC 2006. http://www.robots.ox.ac.uk/~vgg/research/nface/index.html

45
46
Slide: Kristen Grauman
47
Consumer application: iPhoto

http://www.apple.com/ilife/iphoto/
Slide credit: Lana Lazebnik
48
Consumer application: iPhoto

Things iPhoto thinks are faces

Slide credit: Lana Lazebnik

49
Consumer application: iPhoto
• Can be trained to recognize pets!

http://www.maclife.com/article/news/iphotos_faces_recognizes_cats

Slide credit: Lana Lazebnik

50
Privacy Gift Shop – CV Dazzle
• http://www.wired.com/2015/06/facebook-can-recognize-even-dont-show-face/
• Wired, June 15, 2015

Slide: Kristen Grauman

51
Boosting: pros and cons
• Advantages of boosting
‒ Integrates classification with feature selection
‒ Complexity of training is linear in the number of training examples
‒ Flexibility in the choice of weak learners, boosting scheme
‒ Testing is fast
‒ Easy to implement

• Disadvantages
‒ Needs many training examples
‒ Other discriminative models may outperform in practice (SVMs,
CNNs,…)
• especially for many-class problems

Slide credit: Lana Lazebnik

52
Window-based models:
Two case studies

Boosting + face SVM + person

detection detection

Viola & Jones e.g., Dalal & Triggs

53
SVM + HOG for human detection
as case study

54
Linear classifiers

55
Linear classifiers
• Find linear function to separate positive and negative
examples

x i positive : xi  w + b  0
x i negative : xi  w + b  0

Which line
is best?

56
Support Vector Machines (SVMs)
• Discriminative
classifier based on
optimal separating
line (for 2d case)

• Maximize the margin

between the positive
and negative training
examples

57
Support vector machines
• Want line that maximizes the margin

x i positive ( yi = 1) : xi  w + b  1
x i negative ( yi = −1) : x i  w + b  −1

For support, vectors, x i  w + b = 1

Support vectors Margin

C. Burges, A Tutorial on Support Vector Machines for Pattern Recognition, Data Mining and
Knowledge Discovery, 1998

58
Support vector machines
• Want line that maximizes the margin

x i positive ( yi = 1) : xi  w + b  1
x i negative ( yi = −1) : x i  w + b  −1

For support vectors, x i  w + b = 1

Distance between point | xi  w + b |

and line: || w ||
For support vectors:
wΤ x + b  1 1 −1 2
= M= − =
Support vectors Margin M w w w w w

59
Support vector machines
• Want line that maximizes the margin

x i positive ( yi = 1) : xi  w + b  1
x i negative ( yi = −1) : x i  w + b  −1

For support vectors, x i  w + b = 1

Distance between point | xi  w + b |

and line: || w ||

Therefore, the margin is 2 / ||w||

Support vectors Margin M

60
Finding the maximum margin line
1. Maximize margin 2/||w||
2. Correctly classify all training data points:
x i positive ( yi = 1) : xi  w + b  1
x i negative ( yi = −1) : x i  w + b  −1

Quadratic optimization problem:

1 T
Minimize 2
w w

Subject to yi(w·xi+b) ≥ 1

C. Burges, A Tutorial on Support Vector Machines for Pattern Recognition, Data Mining and
Knowledge Discovery, 1998
61
Finding the maximum margin line
• Solution: w = i  i yi x i

learned Support
weight vector

C. Burges, A Tutorial on Support Vector Machines for Pattern Recognition, Data Mining and
Knowledge Discovery, 1998
62
Finding the maximum margin line
• Solution: w = i  i yi x i
b = yi – w·xi (for any support vector)
w  x + b = i  i yi x i  x + b
• Classification function:
f ( x) = sign (w  x + b)
= sign (  y x  x + b)
i i i i

If f(x) < 0, classify as negative,

if f(x) > 0, classify as positive
C. Burges, A Tutorial on Support Vector Machines for Pattern Recognition, Data Mining and
Knowledge Discovery, 1998

63
Person detection
with HoG’s & linear SVM’s
• Histogram of oriented gradients (HoG):
‒ Map each grid cell in the input window to a histogram
counting the gradients per orientation.
• Train a linear SVM
‒ using training set of pedestrian vs. non-pedestrian
windows.

Dalal & Triggs, CVPR 2005

64
Person detection
with HoGs & linear SVMs

• For more detail about HoG:

‒ Histograms of Oriented Gradients for Human Detection, Navneet Dalal,
Bill Triggs, International Conference on Computer Vision & Pattern
Recognition - June 2005
‒ http://lear.inrialpes.fr/pubs/2005/DT05/

65
Window-based detection: strengths
• Sliding window detection and global appearance
descriptors:
‒ Simple detection protocol to implement
‒ Good feature choices critical
‒ Past successes for certain classes

Slide: Kristen Grauman

66
Window-based detection: Limitations
• High computational complexity
‒ For example: 250,000 locations x 30 orientations x 4
scales = 30,000,000 evaluations!
‒ If training binary detectors independently, means cost
increases linearly with number of classes
• With so many windows, false positive rate better
be low

Slide: Kristen Grauman

67
Limitations (continued)
• Not all objects are “box” shaped

Slide: Kristen Grauman

68
Limitations (continued)
• Non-rigid, deformable objects not captured well with
representations assuming a fixed 2d structure; or must assume
fixed viewpoint
• Objects with less-regular textures not captured well with holistic
appearance-based descriptions

Slide: Kristen Grauman

69
Limitations (continued)

Sliding window Detector’s view

If considering windows in isolation,

context is lost

Figure credit: Derek Hoiem

Slide: Kristen Grauman
70
Limitations (continued)
• In practice, often entails large, cropped training set
(expensive)
• Requiring good match to a global appearance
description can lead to sensitivity to partial occlusions

Slide: Kristen Grauman

71
Object proposals

72
Object proposals
Main idea:
• Learn to generate category-independent regions/boxes
that have object-like properties.
• Let object detector search over “proposals”, not
exhaustive sliding windows

Alexe et al. Measuring the objectness of image windows, PAMI 2012

73
Object proposals

Multi-scale
saliency

Color
contrast

Alexe et al. Measuring the objectness of image windows, PAMI 2012

74
Object proposals
Edge density Superpipxel straddling

Alexe et al. Measuring the objectness of image windows, PAMI 2012

75
Object proposals Yellow box: object detected
Cyan box: groundtruth
More proposals

Alexe et al. Measuring the objectness of image windows, PAMI 2012

76
Deformable Part Model (DPM)
• Represents an object as a
collection of parts arranged in a
deformable configuration
• Each part represents local
appearances
• Spring-like connections between
certain pairs of parts

Fischler and Elschlager, Pictoral Structures,

1973
Felzenszwalb et al. , PAMI 2010
78
Deformable Part Model (DPM)

79
Deformable Part Model (DPM)
• References
‒ Pedro F. Felzenszwalb & Daniel P. Huttenlocher, Pictorial Structures for Object
Recognition, IJCV 2005
• https://www.cs.cornell.edu/~dph/papers/pict-struct-ijcv.pdf
‒ P. F. Felzenszwalb, R. B. Girshick, D. McAllester, and D. Ramanan. Object
detection with discriminatively trained part based models. IEEE Transactions
on Pattern Analysis and Machine Intelligence, 32(9):1627–1645, 2010

80
Object detection: Evaluation

81
Object Detection Benchmarks
• PASCAL VOC Challenge
• ImageNet Large Scale Visual Recognition Challenge
(ILSVR)
‒ 200 Categories for detection

• Common Objects in Context (COCO)

‒ 80 Object categories

82
How do we evaluate object detection?

predictions
ground truth
True positive:
- The overlap of the prediction
with the ground truth is MORE
than a threshold value (0.5)

83
How do we evaluate object detection?

predictions
ground truth
True positive:
False positive:
- The overlap of the prediction
with the ground truth is LESS
than a threshold value (0.5)

84
How do we evaluate object detection?

predictions
ground truth
True positive:
False positive:
False negative:
- The objects that our model
doesn’t find

85
How do we evaluate object detection?

predictions
ground truth

True positive:
False positive:
False negative:
- The objects that our model
doesn’t find

What is a True Negative?

86
𝑇𝑃
𝑝𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 =
𝑇𝑃 + 𝐹𝑃

𝑇𝑃
𝑟𝑒𝑐𝑎𝑙𝑙 =
𝑇𝑃 + 𝐹𝑁

87
How do we evaluate object detection?

predictions
ground truth
True positive: 1
False positive: 2
False negative: 1

So what is the
- precision?
- recall?

88
Precision versus recall
• Precision:
‒ how many of the object detections
are correct?
𝑇𝑃
𝑝𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 =
𝑇𝑃 + 𝐹𝑃
• Recall:
‒ how many of the ground truth objects 𝑇𝑃
can the model detect? 𝑟𝑒𝑐𝑎𝑙𝑙 =
‒ True Positive Rate (TPR)
𝑇𝑃 + 𝐹𝑁

89
• In reality, our model makes a lot of predictions with varying scores
between 0 and 1

predictions
ground truth

Here are all the boxes that

are predicted with score > 0.

This means that our

- Recall is perfect!
- But our precision is BAD!

90
How do we evaluate object detection?

predictions
ground truth

Here are all the boxes that

are predicted with score > 0.5

We are setting a threshold of

0.5

91
Precision – recall curve (PR curve)

92
Which model is the best?

93
Which model is the best?

• Area under curve (AUC), average precision (AP)

• F1-score (highest value at optimal confidential score)

94
Which model is the best?

AP: The metric calculates the average precision (AP) for each
class individually across all of the IoU thresholds

mAP: the average of AP

95
Summary
• Object recognition as classification task
‒ Boosting (face detection ex)
‒ Support vector machines and HOG (human detection
ex)
‒ Sliding window search paradigm
• Pros and cons
• Speed up with attentional cascade
• Object proposals, proposal regions as alternative

96
References
Most of these slides were adapted from:

1. Kristen Grauman (CS 376: Computer Vision, Spring 2018, The

University of Texas at Austin)

97
Thank
you!

Pedestrian Detection - Kristina Pickl
No ratings yet
Pedestrian Detection - Kristina Pickl
45 pages
Deep Convolutional Neural Networks For Image Classification: Many Slides From Rob Fergus (NYU and Facebook)
No ratings yet
Deep Convolutional Neural Networks For Image Classification: Many Slides From Rob Fergus (NYU and Facebook)
55 pages
Machine Learning and Neural Networks: Riccardo Rizzo
100% (1)
Machine Learning and Neural Networks: Riccardo Rizzo
113 pages
Convolutional Neural Networks Overview
No ratings yet
Convolutional Neural Networks Overview
19 pages
Cau Truc Du Lieu & Giai Thuat
No ratings yet
Cau Truc Du Lieu & Giai Thuat
68 pages
Department of BCA: Government First Grade College, Tumkur
0% (1)
Department of BCA: Government First Grade College, Tumkur
48 pages
ASSIGNMENT-1 (20 Files Merged) PDF
No ratings yet
ASSIGNMENT-1 (20 Files Merged) PDF
70 pages
Tools For Structured and Object Oriented Design - Simple Sequence Control Structures
No ratings yet
Tools For Structured and Object Oriented Design - Simple Sequence Control Structures
24 pages
C++ Programs for Inheritance and Functions
No ratings yet
C++ Programs for Inheritance and Functions
46 pages
Practical File
100% (1)
Practical File
85 pages
C++ Basics for Beginners
No ratings yet
C++ Basics for Beginners
321 pages
C++ Basics for Beginners
No ratings yet
C++ Basics for Beginners
242 pages
Chuyển Đổi Số Trong Đại Học
No ratings yet
Chuyển Đổi Số Trong Đại Học
6 pages
Object Oriented Programming (OOP) - CS304 Power Point Slides Lecture 34
No ratings yet
Object Oriented Programming (OOP) - CS304 Power Point Slides Lecture 34
22 pages
PHP Email and File Upload Guide
No ratings yet
PHP Email and File Upload Guide
3 pages
Object Oriented Programming (OOP) - CS304 Power Point Slides Lecture 12
No ratings yet
Object Oriented Programming (OOP) - CS304 Power Point Slides Lecture 12
35 pages
#Include Void Main
No ratings yet
#Include Void Main
29 pages
C++ Unit 2
No ratings yet
C++ Unit 2
69 pages
Object Oriented Programming (OOP) - CS304 Power Point Slides Lecture 02
No ratings yet
Object Oriented Programming (OOP) - CS304 Power Point Slides Lecture 02
21 pages
Object Oriented Programming (OOP) - CS304 Power Point Slides Lecture 27
No ratings yet
Object Oriented Programming (OOP) - CS304 Power Point Slides Lecture 27
29 pages
C++ Unit 1
No ratings yet
C++ Unit 1
123 pages
C++ Programming Tasks
No ratings yet
C++ Programming Tasks
54 pages
Function in PHP
No ratings yet
Function in PHP
11 pages
Decision Making & Branching PDF
100% (1)
Decision Making & Branching PDF
34 pages
Gaddis Python 6e Chapter 02
No ratings yet
Gaddis Python 6e Chapter 02
81 pages
Industryal
No ratings yet
Industryal
187 pages
Chapter 4 - Linear Regression
100% (2)
Chapter 4 - Linear Regression
25 pages
Object Oriented Programming (OOP) - CS304 Power Point Slides Lecture 30
No ratings yet
Object Oriented Programming (OOP) - CS304 Power Point Slides Lecture 30
30 pages
Object Oriented Programming (OOP) - CS304 Power Point Slides Lecture 05
No ratings yet
Object Oriented Programming (OOP) - CS304 Power Point Slides Lecture 05
40 pages
Multiple Inheritance in OOP
No ratings yet
Multiple Inheritance in OOP
35 pages
C Programming Lab Manual by Om Prakash Mahato
100% (1)
C Programming Lab Manual by Om Prakash Mahato
7 pages
Object Oriented Programming (OOP) - CS304 Power Point Slides Lecture 22
No ratings yet
Object Oriented Programming (OOP) - CS304 Power Point Slides Lecture 22
34 pages
PHP Email Sending Example
No ratings yet
PHP Email Sending Example
2 pages
C++ Basics Code
No ratings yet
C++ Basics Code
37 pages
PHP File Handling
No ratings yet
PHP File Handling
8 pages
PHP File Upload Form Script
No ratings yet
PHP File Upload Form Script
2 pages
Starting Out With Programming Logic & Design - Chapter14 - Object Oriented Programming
No ratings yet
Starting Out With Programming Logic & Design - Chapter14 - Object Oriented Programming
19 pages
Semantic Hierarchies For Image Annotation - A Survey
No ratings yet
Semantic Hierarchies For Image Annotation - A Survey
41 pages
Assignment 5
100% (1)
Assignment 5
1 page
Area Calculation Program Steps
No ratings yet
Area Calculation Program Steps
1 page
Object-Oriented Programming (OOP) Lecture No. 1
No ratings yet
Object-Oriented Programming (OOP) Lecture No. 1
18 pages
Classification
100% (1)
Classification
37 pages
Web Development Course Overview
No ratings yet
Web Development Course Overview
81 pages
Programming Lab File
No ratings yet
Programming Lab File
71 pages
Ex - No:1a: Student Database Using Class: #Include #Include Const Size 25
No ratings yet
Ex - No:1a: Student Database Using Class: #Include #Include Const Size 25
27 pages
C++ Program For Bca
No ratings yet
C++ Program For Bca
24 pages
Object Oriented Programming (OOP) - CS304 Power Point Slides Lecture 21
No ratings yet
Object Oriented Programming (OOP) - CS304 Power Point Slides Lecture 21
49 pages
Introduction To Machine Learning: WWW - Seas.upenn - Edu/ Cis519
100% (1)
Introduction To Machine Learning: WWW - Seas.upenn - Edu/ Cis519
51 pages
Q1.Write A Program Using Multiple Inheritance For The Classes Student, Game and Person
100% (1)
Q1.Write A Program Using Multiple Inheritance For The Classes Student, Game and Person
55 pages
ML Classification Metrics Guide
100% (1)
ML Classification Metrics Guide
30 pages
Assignment 4
100% (1)
Assignment 4
1 page
Advanced: Dynamic Webpage Development
No ratings yet
Advanced: Dynamic Webpage Development
15 pages
SORTING
No ratings yet
SORTING
46 pages
Class Instances and Object Copies in Python
No ratings yet
Class Instances and Object Copies in Python
14 pages
JM International School: C++ Practical File
No ratings yet
JM International School: C++ Practical File
39 pages
Object Oriented Programming (OOP) - CS304 Power Point Slides Lecture 41
100% (1)
Object Oriented Programming (OOP) - CS304 Power Point Slides Lecture 41
38 pages
Analysis of Algorithms Overview
No ratings yet
Analysis of Algorithms Overview
60 pages
Data Science Intro
No ratings yet
Data Science Intro
52 pages
IT5409 Ch7 Part1 Object Detection v2 4pages
No ratings yet
IT5409 Ch7 Part1 Object Detection v2 4pages
24 pages
Viola/Jones Face Detection Method
No ratings yet
Viola/Jones Face Detection Method
21 pages
IT5409 Ch5 Segmentation v2
No ratings yet
IT5409 Ch5 Segmentation v2
70 pages
IT5409 Ch1 Intro
No ratings yet
IT5409 Ch1 Intro
14 pages
Ps 3
No ratings yet
Ps 3
15 pages
1 - Intro Paraphrasing
No ratings yet
1 - Intro Paraphrasing
1 page
Complex Trinomial Factoring Guide
No ratings yet
Complex Trinomial Factoring Guide
3 pages
Microcontroller Module-2 Notes
No ratings yet
Microcontroller Module-2 Notes
28 pages
Cybersecurity in Building Automation
No ratings yet
Cybersecurity in Building Automation
28 pages
How Collating Helps in Printing
No ratings yet
How Collating Helps in Printing
10 pages
CMP507 Computer Network
50% (2)
CMP507 Computer Network
227 pages
MIL 11 12 Q3 0701 The Ethical Use of Media Information
100% (1)
MIL 11 12 Q3 0701 The Ethical Use of Media Information
33 pages
250 Excel Keyboard Shortcuts - Engineering Books
No ratings yet
250 Excel Keyboard Shortcuts - Engineering Books
4 pages
JACOB Data Science Chatbot A Comprehensive Guide
No ratings yet
JACOB Data Science Chatbot A Comprehensive Guide
10 pages
Position Paper: SWIFT On Distributed Ledger Technologies
No ratings yet
Position Paper: SWIFT On Distributed Ledger Technologies
20 pages
MVQ201 Operation Manual
No ratings yet
MVQ201 Operation Manual
14 pages
Lesson 1.2 Part A: Your First Interactive UI: Submit Your App For Grading
No ratings yet
Lesson 1.2 Part A: Your First Interactive UI: Submit Your App For Grading
31 pages
Sample of Project File
No ratings yet
Sample of Project File
75 pages
Examen Scrum Master Agile
No ratings yet
Examen Scrum Master Agile
4 pages
Mathematics: Pearson Edexcel Level 3 GCE
No ratings yet
Mathematics: Pearson Edexcel Level 3 GCE
48 pages
TLE7-CSS Mod6 Testing-Electronic-Components V3
No ratings yet
TLE7-CSS Mod6 Testing-Electronic-Components V3
39 pages
Class 3 Computer Quiz and Answers
No ratings yet
Class 3 Computer Quiz and Answers
2 pages
Project Description
No ratings yet
Project Description
2 pages
FS Traffic Manual
No ratings yet
FS Traffic Manual
43 pages
AI Overview
No ratings yet
AI Overview
7 pages
Research Statement
No ratings yet
Research Statement
1 page
Spearman Rho
No ratings yet
Spearman Rho
18 pages
Essential ICT Terms for Education
No ratings yet
Essential ICT Terms for Education
2 pages
ITE S23-24 Lab2 PartA
No ratings yet
ITE S23-24 Lab2 PartA
4 pages
RAG For Educational Application
No ratings yet
RAG For Educational Application
14 pages
Defibrilator Cardio-Aid® 360-B - ServiceManual
No ratings yet
Defibrilator Cardio-Aid® 360-B - ServiceManual
46 pages
Red Hat Cloud-Native Microservices Development With Quarkus: ID DO378 Prezzo 2.400
No ratings yet
Red Hat Cloud-Native Microservices Development With Quarkus: ID DO378 Prezzo 2.400
3 pages
OpenSAP Sac1 Week 2 Transcript
100% (1)
OpenSAP Sac1 Week 2 Transcript
31 pages
Parapet User Guide
No ratings yet
Parapet User Guide
17 pages
AZ 204 Demo
No ratings yet
AZ 204 Demo
13 pages
Integrated Project Delivery With Blockchain: An Automated Financial System
No ratings yet
Integrated Project Delivery With Blockchain: An Automated Financial System
17 pages

IT5409 Ch7 Part1 Object Detection v2

Uploaded by

IT5409 Ch7 Part1 Object Detection v2

Uploaded by

Computer Vision

Chapter 7 (part 1): Object detection

• Generate candidates in new image

Slide: Kristen Grauman

Slide: Kristen Grauman

YES!! Person match found

• We will never find the object

• Create a feature pyramid

Given new image:

Slide: Kristen Grauman

• Bags of visual words

Support Vector Machines Boosting Conditional Random Fields

Slide credit: Paul Viola

• Compute final classifier as linear combination of all weak

Slide: Kristen Grauman 26

• Cumulative row sum: s(x, y) = s(x–1, y) + i(x, y)

• Then the sum of original image

• Only 3 additions are required

Resulting weak classifier:

For next round, reweight the

examples according to errors,

Slide: Kristen Grauman

For T rounds {x1,…xn}

Re-weight the examples:

Final classifier is combination of the

First two features

• How to make the detection more efficient?

• Form a cascade with low false negative rates early on

• Train with 5K positives, 350M negatives

Paul Viola, ICCV tutorial 44

Frontal faces detected and then tracked, character names

Things iPhoto thinks are faces

Slide credit: Lana Lazebnik

Slide credit: Lana Lazebnik

Slide: Kristen Grauman

Slide credit: Lana Lazebnik

Boosting + face SVM + person

Viola & Jones e.g., Dalal & Triggs

• Maximize the margin

For support, vectors, x i  w + b = 1

Support vectors Margin

For support vectors, x i  w + b = 1

Distance between point | xi  w + b |

For support vectors, x i  w + b = 1

Distance between point | xi  w + b |

Therefore, the margin is 2 / ||w||

Quadratic optimization problem:

If f(x) < 0, classify as negative,

Dalal & Triggs, CVPR 2005

• For more detail about HoG:

Slide: Kristen Grauman

Slide: Kristen Grauman

Slide: Kristen Grauman

Slide: Kristen Grauman

Sliding window Detector’s view

If considering windows in isolation,

Figure credit: Derek Hoiem

Slide: Kristen Grauman

Alexe et al. Measuring the objectness of image windows, PAMI 2012

Alexe et al. Measuring the objectness of image windows, PAMI 2012

Alexe et al. Measuring the objectness of image windows, PAMI 2012

Alexe et al. Measuring the objectness of image windows, PAMI 2012

Fischler and Elschlager, Pictoral Structures,

• Common Objects in Context (COCO)

What is a True Negative?

Here are all the boxes that

This means that our

Here are all the boxes that

We are setting a threshold of

• Area under curve (AUC), average precision (AP)

mAP: the average of AP

1. Kristen Grauman (CS 376: Computer Vision, Spring 2018, The

You might also like