0% found this document useful (0 votes)

38 views31 pages

Faster R-CNN - Deep Dive Into Object Detection

Faster R-CNN is a groundbreaking object detection framework developed in 2015 that integrates a Region Proposal Network for efficient region proposals, allowing for end-to-end training. It has significant applications in fields such as autonomous vehicles, medical imaging, and surveillance, enhancing both accuracy and speed in detection tasks. Despite its advantages, Faster R-CNN has limitations in speed compared to single-stage detectors and requires substantial computational resources.

Uploaded by

221210088

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views31 pages

Faster R-CNN - Deep Dive Into Object Detection

Uploaded by

221210088

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

Faster R-CNN: Deep Dive

into Object Detection

Faster R-CNN is a revolutionary approach to computer vision. It was
developed by Shaoqing Ren, Kaiming He, and their team in 2015. It
represented a breakthrough in real-time object detection technology.
Introduction to Object Detection
Definition Key Tasks Critical Applications
Object detection pinpoints and It performs precise localization and Essential for self-driving cars and
categorizes objects within images, accurate classification. advanced surveillance systems.
grappling with size variations and
intricate backgrounds
Evolution of Object Detection
Models
1 R-CNN (2014)
First deep learning approach; slow due to selective search.

2 Fast R-CNN (2015)

Improved computation efficiency but still dependent on
external region proposals.

3 Faster R-CNN
End-to-end trainable with Region Proposal Network (RPN).
R-CNN Family Overview
Region Proposal Network Shared Convolutional Anchor Boxes
(RPN) Features Handles multi-scale object
Core innovation for efficient Features shared across detection detection.
region proposals. stages.
Real-World Applications
Autonomous Vehicles Medical Imaging
Detects pedestrians, signs, Identifies anomalies and
and other vehicles for safe structures to assist in
navigation. diagnoses.

Retail
Manages inventory and tracks products for streamlined operations.

Faster R-CNN has future potential in robotics, security, and various AI

systems.
R-CNN: A Brief Recap
Selective Search
Identifies potential object regions within an image.

Feature Extraction
Extracts CNN features from each proposed region.

Classification
Classifies objects within extracted regions.

R-CNN is slow due to per-region CNN processing.

Fast R-CNN: A Recap

1 Single CNN Pass 2 RoI Pooling 3 Classification

The entire image is processed Region of Interest pooling Classifies objects and refines
once to extract features. extracts fixed-size feature maps. bounding box predictions.
Why Faster R-CNN?

1 Speed Bottleneck 2 Integrated Mechanism 3 End-to-End Training

Region proposals were slowing Faster R-CNN uses an integrated, The entire detection process can
down the entire pipeline. learnable proposal mechanism. be trained end-to-end,
optimizing performance.
Faster R-CNN Architecture
Backbone CNN
Extracts feature maps from input images.

Region Proposal Network (RPN)

Generates region proposals using anchor boxes.

RoI Pooling
Pools features from each region proposal.

Detector
Classifies objects and refines bounding boxes.
Feature Extraction

1 Backbone CNNs 2 Feature Maps 3 Deep Features

VGG16, ResNet, and MobileNet Backbone CNNs produce feature Deeper networks extract more
are common choices for feature maps from the input image. complex features for object
extraction. detection.
Region Proposal Network (RPN) Introduction

Object-Like Regions Anchor Boxes Fully Convolutional

The RPN quickly identifies RPN uses anchor boxes to RPN is a fully convolutional
regions that likely contain propose regions of various network for efficient processing.
objects. scales and ratios.
How RPN Works

Anchor Boxes Sliding Window Bounding Box Regression

RPN uses anchor boxes at each The RPN employs a sliding window RPN refines anchor boxes to better fit
location to propose regions of different approach on the feature map. the objects.
sizes.
Anchors in RPN

1 Fixed-Size Reference 2 Multiple Scales and Ratios 3 Location Specific

Boxes Anchors are generated at each
Anchors serve as the foundation They enable detection of objects location in the feature map.
for region proposals. with varying dimensions.
RPN Outputs
Objectness Score Bounding Box Offsets
Assigns a probability to each region proposal. Predicts adjustments to refine the anchor boxes.

Indicates likelihood of containing an object (foreground or Offsets are relative to the original anchor's location and
background). size.
RPN Process
Feature Map 1
The RPN takes a feature map as input.

2 Sliding Window
A sliding window scans across the feature map.

Anchor Boxes 3
At each location, anchor boxes propose regions.

4 Classification
Classify regions as object or background.

Regression 5
Refine bounding box coordinates for accuracy.
Anchors in RPN
Fixed Reference Boxes
Anchors are fixed-size reference boxes.

Multiple Scales
Anchors have multiple scales to capture objects of
various sizes.
Aspect Ratios
Multiple aspect ratios allows detection of different
shapes.
Location Specific
Anchors are generated at each location.
RPN Outputs

1 Objectness Score 2 Bounding Box Regression 3 Refined Proposals

This measures how likely a box RPN outputs refined region
contains an object. Offsets refine anchor boxes to proposals for detection.
precisely fit objects.
Loss Function in RPN

1 Classification Loss 2 Regression Loss 3 Combined Loss

Evaluates the accuracy in Calculates the error between RPN optimizes a combined loss
classifying region proposals as predicted and ground truth function for objectness and box
objects or background. bounding box coordinates. refinement.
Non-Maximum Suppression (NMS)
NMS removes duplicate proposals, refining object detection
results.

It keeps only high-confidence, non-overlapping bounding

boxes.

NMS enhances detection accuracy by eliminating

redundant detections.
Non-Maximum Suppression (NMS)

1 Duplicate Removal 2 Confidence Threshold 3 Accuracy

NMS eliminates redundant Keeps high-scoring, non- Enhances detection accuracy for
detections. overlapping boxes. clear results.
Sharing Convolutional Layers

1 Feature Sharing 2 Computational Efficiency 3 Improved Speed

The RPN and object detector The shared backbone enhances
share convolutional layers. Feature sharing avoids the speed.
redundant computation.
Region of Interest (RoI) Pooling

Fixed-Size Feature Maps Batch Processing Region of Interest

Converts variable-size proposals into RoI Pooling enables efficient batch Focuses processing on relevant regions
fixed-size feature maps. processing in object detection. to improve speed and reduce
computation.
Object Classification and Bounding Box Regression

Object Classification
Assign a category to each region proposal.

Bounding Box Regression

Refine the coordinates for accurate localization.

Output
The result is accurate object detection.
Multi-task Loss in Faster R-CNN

1 Combined Loss 2 End-to-End Optimization 3 Improved Accuracy

Faster R-CNN employs a multi- By unifying classification and
task loss function for It allows for end-to-end training, regression, accuracy is
classification and localization. optimizing object detection significantly enhanced.
performance.
Training Pipeline of Faster R-CNN
Faster R-CNN employs an alternating training process. It
refines both RPN and object detector.

1. Train the Region Proposal Network (RPN) initially.

1. Fix RPN proposals to train the detector.
1. Train the object detector using fixed RPN proposals.
1. Fine-tune RPN and detector jointly to optimize
performance.
Inference Pipeline of Faster R-CNN
Single Forward Pass
Faster R-CNN uses a streamlined inference process.

Feature extraction, RPN, RoI pooling, and prediction

happen.

This single pass ensures efficient object detection.

Real-World Applications
Assistive Technology
Apps for the visually impaired enhance object
recognition.
Self-Driving Cars
Object detection is critical for autonomous
navigation.
Surveillance
Surveillance systems use Faster R-CNN for security
monitoring.
Advantages of Faster R-CNN
State-of-the-Art Accuracy
It achieves high object detection accuracy.

End-to-End Trainable
It optimizes performance.

Flexible Backbones
It supports different convolutional networks.
Limitations of Faster R-CNN
Speed
Slower than some single-stage detectors. YOLO
and SSD can be faster.
Resources
Higher memory and compute requirements. This
can be a disadvantage.
Real-Time
Not always ideal for ultra real-time needs. Other
models may be preferred.
Variants and Improvements

Mask R-CNN Cascade R-CNN Faster R-CNN with FPN

Adds a mask branch for pixel-level Employs a cascade of detectors for Utilizes a Feature Pyramid Network for
segmentation. It performs object higher quality. Achieves better multi-scale detection. Improves
detection and segmentation. precision in object detection. detection of objects at different scales.
Thank You
We appreciate your time and attention.

Faster R-CNN represents a significant advancement. It has enabled

more accurate and efficient object detection.

Sahil Dhillon (221210092)

Riya (221210088)
Priya pandey (221210082)

Fast Methods For Deep Learning Based Object Detection
No ratings yet
Fast Methods For Deep Learning Based Object Detection
43 pages
BTP Report Faster R CNN Compressed
No ratings yet
BTP Report Faster R CNN Compressed
32 pages
Deep Learning Algorithms For Object Detection
No ratings yet
Deep Learning Algorithms For Object Detection
43 pages
Lecture Paola Object Detection
No ratings yet
Lecture Paola Object Detection
29 pages
L7 Detection
No ratings yet
L7 Detection
54 pages
Object Detection Using CNN-RCNN.-1
No ratings yet
Object Detection Using CNN-RCNN.-1
14 pages
MV cs4243 2024 Amir 6 p2
No ratings yet
MV cs4243 2024 Amir 6 p2
95 pages
Object Detection1
No ratings yet
Object Detection1
29 pages
R CNN Regions With Convolutional Neural Network Features
No ratings yet
R CNN Regions With Convolutional Neural Network Features
8 pages
R-CNN: Overview of Object Detection Models
No ratings yet
R-CNN: Overview of Object Detection Models
28 pages
Face Detection With The Faster R-CNN
No ratings yet
Face Detection With The Faster R-CNN
6 pages
Beginner's Guide to R-CNN Basics
No ratings yet
Beginner's Guide to R-CNN Basics
6 pages
Unit 3
No ratings yet
Unit 3
45 pages
RCNN
No ratings yet
RCNN
25 pages
A Comprehensive Survey of The R-CNN Family For Object Detection
No ratings yet
A Comprehensive Survey of The R-CNN Family For Object Detection
6 pages
Object Detection
No ratings yet
Object Detection
57 pages
Object Detection
No ratings yet
Object Detection
76 pages
CVR FDP
No ratings yet
CVR FDP
37 pages
Fast R-CNN: Enhancing Object Detection
No ratings yet
Fast R-CNN: Enhancing Object Detection
4 pages
R-CNN and Selective Search Overview
No ratings yet
R-CNN and Selective Search Overview
6 pages
Li 2021 J. Phys.: Conf. Ser. 1827 012085
No ratings yet
Li 2021 J. Phys.: Conf. Ser. 1827 012085
11 pages
Object Recognition with Deep Learning
No ratings yet
Object Recognition with Deep Learning
47 pages
An Improved Faster R-CNN For Same Object
No ratings yet
An Improved Faster R-CNN For Same Object
12 pages
cv2021 Lec6 Object Detection - 1600 - PDF - Gdrive.vip
No ratings yet
cv2021 Lec6 Object Detection - 1600 - PDF - Gdrive.vip
60 pages
Najibi G-CNN An Iterative CVPR 2016 Paper
No ratings yet
Najibi G-CNN An Iterative CVPR 2016 Paper
9 pages
Yolo Family
No ratings yet
Yolo Family
40 pages
R-CNN Minus R: Karel Lenc Andrea Vedaldi
No ratings yet
R-CNN Minus R: Karel Lenc Andrea Vedaldi
9 pages
139 Pretrained Networks Object Detection
No ratings yet
139 Pretrained Networks Object Detection
22 pages
R-CNN vs Fast R-CNN Analysis
No ratings yet
R-CNN vs Fast R-CNN Analysis
4 pages
Understanding Object Detection Techniques
No ratings yet
Understanding Object Detection Techniques
46 pages
Understanding and Implementing Faster R-CNN - by Rishabh Singh - Medium
No ratings yet
Understanding and Implementing Faster R-CNN - by Rishabh Singh - Medium
14 pages
Obstacle Detection and Classification Using Deep Learning For Tracking in High-Speed Autonomous Driving
No ratings yet
Obstacle Detection and Classification Using Deep Learning For Tracking in High-Speed Autonomous Driving
6 pages
BTP PPT Phase1
No ratings yet
BTP PPT Phase1
14 pages
Oriented R-CNN: Efficient Object Detection
No ratings yet
Oriented R-CNN: Efficient Object Detection
10 pages
R-CNN, Fast R-CNN, Faster R-CNN, YOLO - Object Detection Algorithms
No ratings yet
R-CNN, Fast R-CNN, Faster R-CNN, YOLO - Object Detection Algorithms
11 pages
Ref 16
No ratings yet
Ref 16
14 pages
Advanced Object Detection Guide
No ratings yet
Advanced Object Detection Guide
90 pages
Real Time Object Detection System
No ratings yet
Real Time Object Detection System
31 pages
Generalized R-CNN for Researchers
No ratings yet
Generalized R-CNN for Researchers
127 pages
Object Detection Techniques Overview
No ratings yet
Object Detection Techniques Overview
22 pages
Faster R-CNN: Real-Time Object Detection
No ratings yet
Faster R-CNN: Real-Time Object Detection
13 pages
Fast R-CNN
No ratings yet
Fast R-CNN
9 pages
IMINT Target Acquisition Using Deep Learning
No ratings yet
IMINT Target Acquisition Using Deep Learning
5 pages
Faster R-CNN with Region Proposal Networks
No ratings yet
Faster R-CNN with Region Proposal Networks
9 pages
Fast R-CNN
No ratings yet
Fast R-CNN
9 pages
Deep Learning for Daily Object Detection
No ratings yet
Deep Learning for Daily Object Detection
6 pages
Yolo
No ratings yet
Yolo
24 pages
Fast R-CNN (R Girshick 2015) PDF
No ratings yet
Fast R-CNN (R Girshick 2015) PDF
9 pages
Region-Based Object Detection and Classification Using Faster R-CNN
No ratings yet
Region-Based Object Detection and Classification Using Faster R-CNN
6 pages
Object Detection & Segmentation Guide
No ratings yet
Object Detection & Segmentation Guide
38 pages
Lenc 15 RCNN
No ratings yet
Lenc 15 RCNN
12 pages
Object Detection Report
No ratings yet
Object Detection Report
27 pages
Object Detection and Identification
67% (3)
Object Detection and Identification
20 pages
Multilateral OCC with CNN Models
No ratings yet
Multilateral OCC with CNN Models
9 pages
Faster RCNN with PyTorch Guide
No ratings yet
Faster RCNN with PyTorch Guide
1 page
L10 Lecture Detection - Segmentation v2.5
No ratings yet
L10 Lecture Detection - Segmentation v2.5
35 pages
Compiler
No ratings yet
Compiler
9 pages
DL Assignment 1
No ratings yet
DL Assignment 1
6 pages
HealthBotX - Voice-Based Multilingual Health Assistant For Rural India
No ratings yet
HealthBotX - Voice-Based Multilingual Health Assistant For Rural India
13 pages
Data Communication
No ratings yet
Data Communication
9 pages
Computer Vision
No ratings yet
Computer Vision
33 pages
Efflorescence FS Feb 11
No ratings yet
Efflorescence FS Feb 11
2 pages
Bender Koppitz2-Chap. 2
No ratings yet
Bender Koppitz2-Chap. 2
22 pages
Bohne 1984
100% (1)
Bohne 1984
4 pages
John Crane Type 5610 and 5610Q Single O-Ring Cartridge Seal Assembly and Installation Instructions
No ratings yet
John Crane Type 5610 and 5610Q Single O-Ring Cartridge Seal Assembly and Installation Instructions
6 pages
The Enjoyment of Math 1st Edition Hans Rademacher PDF Available
0% (1)
The Enjoyment of Math 1st Edition Hans Rademacher PDF Available
85 pages
Master American Accent: 9 Tips
No ratings yet
Master American Accent: 9 Tips
20 pages
UNIT III Design and Architecture Part II
No ratings yet
UNIT III Design and Architecture Part II
94 pages
Mickey Mouse Steamboat Willie Black-and-White Nen - ToyShnip
No ratings yet
Mickey Mouse Steamboat Willie Black-and-White Nen - ToyShnip
1 page
Simplified Piled Raft Design
No ratings yet
Simplified Piled Raft Design
7 pages
NVS Recruitment 2019: Principal & Teachers
No ratings yet
NVS Recruitment 2019: Principal & Teachers
14 pages
USF-50 Series Technical Training: Glory - LTD Ver. 3.0
100% (2)
USF-50 Series Technical Training: Glory - LTD Ver. 3.0
372 pages
BRTF14: Tetra Optical Macro Slave Repeater
No ratings yet
BRTF14: Tetra Optical Macro Slave Repeater
2 pages
FLR1600
No ratings yet
FLR1600
3 pages
Power BI Architecture
100% (2)
Power BI Architecture
47 pages
AC2A110350
No ratings yet
AC2A110350
1 page
Tybms Sem - Vi (Apr 2023)
No ratings yet
Tybms Sem - Vi (Apr 2023)
39 pages
Multiple-Choice Questions On Pressure in Fluids
100% (4)
Multiple-Choice Questions On Pressure in Fluids
3 pages
Quarter 3 Ppt1
No ratings yet
Quarter 3 Ppt1
30 pages
Evacuated Tube Collector
No ratings yet
Evacuated Tube Collector
5 pages
Online Examination
100% (1)
Online Examination
63 pages
Phys Exp 4
No ratings yet
Phys Exp 4
3 pages
Sakai SV521 - Spec 2019
No ratings yet
Sakai SV521 - Spec 2019
4 pages
Crisc D1 Qa
No ratings yet
Crisc D1 Qa
280 pages
Hitachi Thermal Power Equipment Guide
100% (1)
Hitachi Thermal Power Equipment Guide
16 pages
Iso 19036 - 2019 - Estimation of Measurement Uncertainty For Quantitative Determinations
No ratings yet
Iso 19036 - 2019 - Estimation of Measurement Uncertainty For Quantitative Determinations
46 pages
Essential AI Tools for Journalists
No ratings yet
Essential AI Tools for Journalists
20 pages
87 NURS FPX 6112 Assessment 3
No ratings yet
87 NURS FPX 6112 Assessment 3
3 pages
The Compliment as Social Strategy
No ratings yet
The Compliment as Social Strategy
12 pages
High Voltage Insulating Materials
No ratings yet
High Voltage Insulating Materials
8 pages
50 THE Effect of - Thiamine (Vitamin B1) ON OF Yeast: Fermentation
No ratings yet
50 THE Effect of - Thiamine (Vitamin B1) ON OF Yeast: Fermentation
7 pages