Fall Detection

Uploaded by

Shriramya Karur

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views3 pages

Fall Detection

Uploaded by

Shriramya Karur

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Fall Detection:

On a higher level, fall detection works in three modules; Object Detection, Pose Estimation and
Action Recognition.

The first module focuses on Object Detection. This employs the TinyYolov3 object detection
model. It is a lightweight version of the YOLO (You Look Only Once) object detection model,
designed for real time object detection. It follows the same basic structure as the original
YOLOv3 model, but with lesser number of convolutional layers and parameters to make it more
computationally efficient.
The structure of TinyYolov3 consists of an input layer wherein the input to the model is an
image. The next is the backbone network. This layer is responsible for extracting features from
the input image. The TinyYolov3's backbone network is a simplified version of Darknet-53
network. It consists of convolutional layers and residual blocks. The next layer is the Feature
Pyramid Network (FPN). It combines features from different layers of the backbone network and
allows the model t o detect objects at multiple scales. Consequently, the detection head is
responsible for predicting bounding boxes and class probabilities for the objects in the image.
The detection head consists of two YOLO layers, each responsible for detecting objects at
different scales. The first YOLO layer predicts bounding boxes and class probabilities for smaller
objects whereas the second YOLO layer predicts the bounding boxes and class probabilities for
larger objects.
These bounding boxes are generated using the concept of anchor boxes. Firstly, TinyYOLOv3
uses a set of predefined anchor boxes which are bounding boxes of different aspect ratios and
scales. These anchor boxes are chosen based on the distribution of object sizes and aspect ratios
in the training data. Each input image that is fed into the model is then divided into a grid of cells
- this can be either 13x13 or 26x26 according to the size of the image for TinyYOLOv3.
For each cell in the grid, the model predicts a fixed amount of bounding boxes. The model
predicts four values for each bounding box: x, y, width and height. The values represent offsets
from the corresponding anchor box dimensions and coordinates. The model also predicts a
confidence score for each bounding box, which represents the confidence that an object is
present in that bounding box. For each bounding box prediction, the model predicts a class
probability for each object class(person, table, cat etc). The class probabilities represent the
model's confidence in the object being present in the bounding box.
The last but not least layer is the Non-Max-Suppression. On obtaining the bounding box
predictions and class probabilities, a NMS algorithm is applied to filter out the overlapping
bounding boxes and keep only the most confident predictions. It first sorts the bounding boxes
by their confidence scores and then iterates through the sorted bounding boxes and removes any
unwanted boxes that overlap with a higher confidence bounding box of the same class.
The remaining bounding boxes after the NMS are the final object detections with their
corresponding class labels and confidence scores.

Coming to Pose Estimation, We use the FastPose model. It is a neural network architecture that
takes an input image and outputs heatmaps for each human body key point. These heatmaps
represent the likelihood or confidence of each key point being present at different places in the
image. The FastPose model handles a lot of tasks like model loading. This class initializes the
appropriate FastPose model namely InferenNet_fastRes50 based on the specified backbone
which is ResNet-50. It also additionally handles the cropping and resizing of the input image
based on the detected bounding boxes, preparing the data for the FastPose model. The
preprocessed input is then fed into the FastPose model and heatmap outputs are obtained for each
key point. Then the obtained heatmaps are processed and final key point coordinates are
obtained. Then finally, the a non-maximum suppression is applied to the predicted poses thereby
removing redundant or overlapping predictions. The FastPose model itself is a deep
convolutional network and was designed to be efficient and scalable, allowing for real-time
multi-person pose estimation on various computing platforms.

The action recognition component implements the Two-Stream Spatial Temporal Graph. It is
responsible for recognizing human actions based on the temporal sequence of estimated poses.
The class defines seven actions that it can recognize namely 'Standing', 'Walking', 'Sitting', 'Lying
Down', 'Stand Up', 'Sit Down' and 'Fall Down'. The prediction of the action occurs on the basis of
a sequence of pose key points. The pose key points are first normalized and scaled and then
converted into a PyTorch tensor and permuted to match the expected input format of the model.
Initially, the input is of a NumPy array of shape (t, v, c) where t is is the number of time
steps(frames), v is the number of key points(body parts) and c is the number of
channels(typically 2 for x and y coordinates and sometimes an additional channel for key point
confidence score) The model requires the motion information which is calculated by taking the
difference between consecutive frames of the pose key points. PyTorch models expect input
tensors to be in a specified format. So, the numpy array of shape t, v, c is converted into a shape
of c, t, v where c is the number of channels and t is the number of time steps and v is the number
of key points. The permutation rearranges the dimensions to match the expected input format of
the Two Stream Spatial Temporal Graph model. There is one more parameter required to match
the expected input of the Two Stream Spatial Temporal Graph model which is the batch
dimension. Even if this input is a single sample we add it by creating a batch size of 1 at the
beginning of the tensor while keeping the other dimensions constant. It has the shape now of 1, c,
t, v. By permuting the dimensions and adding the batch dimension, it ensures that the pose key
point data is properly formatted for input to the model.
The next step is for action recognition to take place. The action recognition component leverages
temporal information encoded in the sequence of pose key points to classify the observed human
actions. For each confirmed track (tracked person), the code checks if the length of the key
points list is equal to 30. If the key points list has 30 elements, it means that the model has
enough temporal information to predict the action. The key points list is converted to a numpy
array and passed to the predict method of Two Stream Spatial Temporal Graph. The predicted
action probabilities are obtained from the model's output and then the name of the action which
has the highest probability is retrieved. The action name and it's corresponding probability are
formatted into a string. The action string is then visualized on the output frame along with a color
code indicating the type of action.

Autonomous Robot Operation
No ratings yet
Autonomous Robot Operation
16 pages
Object Detection in Deep Learning
No ratings yet
Object Detection in Deep Learning
61 pages
2022 V13i3059
No ratings yet
2022 V13i3059
11 pages
Real Time Object Detection in Surveillance Cameras With 2xjeq74wam
No ratings yet
Real Time Object Detection in Surveillance Cameras With 2xjeq74wam
8 pages
"Object Detection With Yolo": A Seminar On
No ratings yet
"Object Detection With Yolo": A Seminar On
14 pages
Understanding The Motion Adaption of Machine Using Long Short - Term Memory Networks For Voiceless Virtual Assistant
No ratings yet
Understanding The Motion Adaption of Machine Using Long Short - Term Memory Networks For Voiceless Virtual Assistant
8 pages
EasyChair Preprint 2297
No ratings yet
EasyChair Preprint 2297
5 pages
Fire Image Dataset for YOLOv7
No ratings yet
Fire Image Dataset for YOLOv7
11 pages
Human Pose & Action Recognition Report
No ratings yet
Human Pose & Action Recognition Report
8 pages
Computer Vision in Painting Line
No ratings yet
Computer Vision in Painting Line
20 pages
Human Motion Detection in Manufacturing Process: Ágnes Lipovits, Mónika Gál, Péter József Kiss, Csaba Süveges
No ratings yet
Human Motion Detection in Manufacturing Process: Ágnes Lipovits, Mónika Gál, Péter József Kiss, Csaba Süveges
8 pages
Simple Baselines For Human Pose Estimation and Tracking: Bin - Xiao, V-Haipwu, Yichenw
No ratings yet
Simple Baselines For Human Pose Estimation and Tracking: Bin - Xiao, V-Haipwu, Yichenw
16 pages
Signature Object Detection Based On YOLOv3
No ratings yet
Signature Object Detection Based On YOLOv3
4 pages
21 Yu2019
No ratings yet
21 Yu2019
5 pages
Blazepose: On-Device Real-Time Body Pose Tracking
No ratings yet
Blazepose: On-Device Real-Time Body Pose Tracking
4 pages
Real-Time Multi-Person Pose Estimation
No ratings yet
Real-Time Multi-Person Pose Estimation
4 pages
Freeman Chain Code
No ratings yet
Freeman Chain Code
8 pages
Sensors 22 04544
No ratings yet
Sensors 22 04544
15 pages
Advanced Object Detection Guide
No ratings yet
Advanced Object Detection Guide
90 pages
Human3 6m
No ratings yet
Human3 6m
37 pages
Human Activity Recognition Based On Spatial Transform in Video Surveillance
No ratings yet
Human Activity Recognition Based On Spatial Transform in Video Surveillance
5 pages
Kernel-Based Hand Tracking
No ratings yet
Kernel-Based Hand Tracking
9 pages
Weapon Detection Using Artificial Intelligence and Deep Learning For Security Applications
No ratings yet
Weapon Detection Using Artificial Intelligence and Deep Learning For Security Applications
5 pages
V2I41
No ratings yet
V2I41
7 pages
PR Project Ankit
No ratings yet
PR Project Ankit
9 pages
Hand Gesture Recognition With Convolution Neural Networks
No ratings yet
Hand Gesture Recognition With Convolution Neural Networks
4 pages
Pose Estimation and Correcting Exercise Posture
No ratings yet
Pose Estimation and Correcting Exercise Posture
7 pages
Cohen Me Dioni
No ratings yet
Cohen Me Dioni
24 pages
YOLO v1: Architecture and Loss Function
No ratings yet
YOLO v1: Architecture and Loss Function
46 pages
Hand Detection Using Multiple Proposals: Arpit@robots - Ox.ac - Uk
No ratings yet
Hand Detection Using Multiple Proposals: Arpit@robots - Ox.ac - Uk
11 pages
Gesture Recognition for HCI Experts
No ratings yet
Gesture Recognition for HCI Experts
10 pages
Weapon Detection with YOLO Models
No ratings yet
Weapon Detection with YOLO Models
10 pages
Yi 等 - 2024 - Estimating Body and Hand Motion in an Ego-sensed World
No ratings yet
Yi 等 - 2024 - Estimating Body and Hand Motion in an Ego-sensed World
15 pages
Developing A Real-Time Gun Detection Classifier
No ratings yet
Developing A Real-Time Gun Detection Classifier
4 pages
3D Human Pose Recovery from Images
No ratings yet
3D Human Pose Recovery from Images
15 pages
Densepose: Dense Human Pose Estimation in The Wild: Seminar: Vision Systems Ma-Inf 4208
No ratings yet
Densepose: Dense Human Pose Estimation in The Wild: Seminar: Vision Systems Ma-Inf 4208
10 pages
Object Detection Method Based On YOLOv3 Using - Deep Learning Networks
No ratings yet
Object Detection Method Based On YOLOv3 Using - Deep Learning Networks
4 pages
Sigal Encyclopedia CVdraft
No ratings yet
Sigal Encyclopedia CVdraft
12 pages
Computer Vision Based Sign Language Recognition For Numbers: Kth@ece - Cmu.edu Jwng@andrew - Cmu.edu Shirlene@cmu - Edu
No ratings yet
Computer Vision Based Sign Language Recognition For Numbers: Kth@ece - Cmu.edu Jwng@andrew - Cmu.edu Shirlene@cmu - Edu
7 pages
Ref 14
No ratings yet
Ref 14
5 pages
G. Papandreou, Et Al 2018 Person Pose Estimation and Instance Segmentation With A Bottom-Up, Part-Based, Geometric Embedding Model
No ratings yet
G. Papandreou, Et Al 2018 Person Pose Estimation and Instance Segmentation With A Bottom-Up, Part-Based, Geometric Embedding Model
18 pages
Detection v2
No ratings yet
Detection v2
12 pages
CAPformer Pedestrian Crossing Action Prediction Us
No ratings yet
CAPformer Pedestrian Crossing Action Prediction Us
22 pages
Stacked Hourglass Networks For Human Pose Estimation
No ratings yet
Stacked Hourglass Networks For Human Pose Estimation
17 pages
The Method of The Real-Time Human Detection and Tracking: ISSN 2710 - 1673 Artificial Intelligence 2023 1
No ratings yet
The Method of The Real-Time Human Detection and Tracking: ISSN 2710 - 1673 Artificial Intelligence 2023 1
8 pages
YOLO Object Detection Project Overview
No ratings yet
YOLO Object Detection Project Overview
15 pages
Real Time Finger Tracking and Contour Detection For Gesture Recognition Using Opencv
No ratings yet
Real Time Finger Tracking and Contour Detection For Gesture Recognition Using Opencv
4 pages
Cassava Leaf Classification
No ratings yet
Cassava Leaf Classification
11 pages
Object Detection
No ratings yet
Object Detection
76 pages
Keypoint Recognition with Trees
No ratings yet
Keypoint Recognition with Trees
29 pages
Finalreport
No ratings yet
Finalreport
56 pages
Object Detection Method Based On Yolov3 Using Deep Learning Networks
No ratings yet
Object Detection Method Based On Yolov3 Using Deep Learning Networks
4 pages
YOLOv3 for Object Detection Evaluation
No ratings yet
YOLOv3 for Object Detection Evaluation
6 pages
YOLOv2: Real-Time Object Detection
No ratings yet
YOLOv2: Real-Time Object Detection
5 pages
Coupled Prediction Classification For Robust Visual Tracking
No ratings yet
Coupled Prediction Classification For Robust Visual Tracking
15 pages
IJCSN 2012 1 6 31handregtn11 PDF
No ratings yet
IJCSN 2012 1 6 31handregtn11 PDF
5 pages
Ijcsn 2012 1 6 31
No ratings yet
Ijcsn 2012 1 6 31
5 pages
Human Pose Estimation Using Convolutional Neural Networks
No ratings yet
Human Pose Estimation Using Convolutional Neural Networks
7 pages
Exaclear Brochure
No ratings yet
Exaclear Brochure
8 pages
Telecom GIS
100% (1)
Telecom GIS
14 pages
EDI Greater Good Sustainability Report 2024 A4 v1 AW
No ratings yet
EDI Greater Good Sustainability Report 2024 A4 v1 AW
22 pages
Foundations of Organizational Structure
No ratings yet
Foundations of Organizational Structure
54 pages
Factory Efficiency Goals 2022
No ratings yet
Factory Efficiency Goals 2022
16 pages
Phy Motion 2,3 Dimensionassign
No ratings yet
Phy Motion 2,3 Dimensionassign
3 pages
Adobe Scan 14 Mar 2025
No ratings yet
Adobe Scan 14 Mar 2025
6 pages
Individual Assignment TRM Izzah
No ratings yet
Individual Assignment TRM Izzah
4 pages
Bgas Chapt 4
No ratings yet
Bgas Chapt 4
36 pages
Meet Mitel - Overview 2025
No ratings yet
Meet Mitel - Overview 2025
16 pages
Top Down Basement at One Hyde Park
100% (1)
Top Down Basement at One Hyde Park
10 pages
SAP BAPI Commit Best Practices
No ratings yet
SAP BAPI Commit Best Practices
1 page
Base DLL For w6 CNF
No ratings yet
Base DLL For w6 CNF
8 pages
Mathematics Questions and Answers - Grade 7 Opener Exams Term 3 2023 Set 1
No ratings yet
Mathematics Questions and Answers - Grade 7 Opener Exams Term 3 2023 Set 1
3 pages
12th Answer Key - Docx - 1503566966715 PDF
No ratings yet
12th Answer Key - Docx - 1503566966715 PDF
6 pages
Marketing Planning in Hospitality
No ratings yet
Marketing Planning in Hospitality
5 pages
Psychotherapy's Evidence Challenge
No ratings yet
Psychotherapy's Evidence Challenge
9 pages
Career Planning Guide for Students
No ratings yet
Career Planning Guide for Students
5 pages
(Ebook) Behavior, Analysis and Design of Structural Steel Elements by El-Sayed Bahaa Machaly ISBN 9789771966296, 9771966294 PDF Download
No ratings yet
(Ebook) Behavior, Analysis and Design of Structural Steel Elements by El-Sayed Bahaa Machaly ISBN 9789771966296, 9771966294 PDF Download
82 pages
CSMLS Ethics Review Guidelines
No ratings yet
CSMLS Ethics Review Guidelines
40 pages
Understanding Newton's Laws of Motion
No ratings yet
Understanding Newton's Laws of Motion
14 pages
Lab Exercise #7
No ratings yet
Lab Exercise #7
11 pages
Problem Set 3
No ratings yet
Problem Set 3
5 pages
Parse 2023 Are The Extant Nursing Theories Endangered Species
No ratings yet
Parse 2023 Are The Extant Nursing Theories Endangered Species
1 page
Basic Electronics
No ratings yet
Basic Electronics
13 pages
8049 - Data Sheet
No ratings yet
8049 - Data Sheet
9 pages
Translate Verbal Phrases to Equations
No ratings yet
Translate Verbal Phrases to Equations
2 pages
Models of Communication Explained
No ratings yet
Models of Communication Explained
8 pages
Functions of Schools in Society
No ratings yet
Functions of Schools in Society
38 pages
Gelit1 New Course Plan
No ratings yet
Gelit1 New Course Plan
11 pages

Fall Detection

Uploaded by

Fall Detection

Uploaded by

Fall Detection:

You might also like