Academia.edu no longer supports Internet Explorer.
To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to upgrade your browser.
2009, Advanced Concepts for Intelligent Vision Systems
AI
This paper addresses the challenge of pedestrian tracking in high-density crowd situations, particularly under conditions where traditional full-body tracking methods fail due to occlusions and motion. A new method focusing on detecting and tracking human heads is proposed, utilizing a Viola and Jones Haar-like AdaBoost cascade for detection and a particle filter for tracking. The system demonstrates promising results with a hit rate of 76.8% while processing an average of 35.35 individuals per frame on standard hardware, indicating potential for real-time application in crowded environments.
18th International Conference on Pattern Recognition (ICPR'06), 2006
This paper introduces a multiple human objects tracking system to detect and track multiple objects in the crowded scene in which occlusions occur. Our method assign each pixel to different human object based on its relative distance to that object and the corresponding color model. If no occlusion, we easily track each object independently based on each segmented object region and optical flow. With occlusion, we analyze the color distribution of the occlusion group to differentiate each object in the group. By calculating the distances between objects, we can determine whether an object is separated from the occlusion group and to be tracked individually afterwards.
Human MotionUnderstanding, Modeling, Capture …, 2007
Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004.
Tracking of humans in dynamic scenes has been an important topic of research. Most techniques, however, are limited to situations where humans appear isolated and occlusion is small. Typical methods rely on appearance models that must be acquired when the humans enter the scene and are not occluded. We present a method that can track humans in crowded environments, with significant and persistent occlusion by making use of human shape models in addition to camera models, the assumption that humans walk on a plane and acquired appearance models. Experimental results and a quantitative evaluation are included.
7'th International Symposium on Telecommunications (IST'2014), 2014
Multiple people detection and tracking is a challenging task in real-world crowded scenes. In this paper, we have presented an online multiple people tracking-by-detection approach with a single camera. We have detected objects with deformable part models and a visual background extractor. In the tracking phase we have used a combination of support vector machine (SVM) person-specific classifiers, similarity scores, the Hungarian algorithm and inter-object occlusion handling. Detections have been used for training person-specific classifiers and to help guide the trackers by computing a similarity score based on them and spatial information and assigning them to the trackers with the Hungarian algorithm. To handle inter-object occlusion we have used explicit occlusion reasoning. The proposed method does not require prior training and does not impose any constraints on environmental conditions. Our evaluation showed that the proposed method outperformed the state of the art approaches by 10% and 15% or achieved comparable performance
Visual Information Processing XXI, 2012
Video cameras are widely used for monitoring public areas, such as train stations, airports and shopping centers. When crowds are dense, automatically tracking individuals becomes a challenging task. We propose a new tracker which employs a particle filter tracking framework, where the state transition model is estimated by an optical-flow algorithm. In this way, the state transition model directly uses the motion dynamics across the scene, which is better than the traditional way of a pre-defined dynamic model. Our result shows that the proposed tracker performs better on different tracking challenges compared with the state-of-the-art trackers, while also improving on the quality of the result.
2001
This work addresses the problem of automatic tracking of pedestrians observed by a fixed camera in outdoor scenes. Tracking isolated pedestrians is not a difficult task. The challenge arises when the tracking system has to deal with temporary occlusions and groups of pedestrians. In both cases it is not possible to track each pedestrian during the whole video sequence. However, the system should be able to recognize each pedestrian as soon as he/she becomes visible and isolated from the group. This paper presents methods to tackle these difficulties. The proposed system is based on a hierarchical approach which allows the application of the same methods for tracking isolated pedestrians and groups.
2019
Recently, attention has been focused on monitoring crowded areas using video surveillance systems to provide security, safety, monitor human activity, etc. In video surveillance systems human detection and tracking is a very important obligation. Although it is a well-known subject, many challenges are yet to be resolved for real world applications. These include, changes in illumination, camera motion, independent and random human movements, etc. On the other hand, interesting information such as interaction between people or between them and vehicles can also be obtained. Crowded scenes include public places such as airports, bus station, concerts, subway, religious festivals, football matches, railway stations, shopping malls, etc., where many people gather is a challenge for those interested in safety and security systems. In such cases, for effective video surveillance, we may need to monitor a specific place or area. One application of such a system is in crowd control to maintain the general security in public places. It is my privilege to express my sincere sense of gratitude and wholehearted indebtedness to my guide, my supervisor
2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012
Single camera-based multiple-person tracking is often hindered by difficulties such as occlusion and changes in appearance. In this paper, we address such problems by proposing a robust part-based tracking-by-detection framework. Human detection using part models has become quite popular, yet its extension in tracking has not been fully explored. Our approach learns part-based person-specific SVM classifiers which capture the articulations of the human bodies in dynamically changing appearance and background. With the part-based model, our approach is able to handle partial occlusions in both the detection and the tracking stages. In the detection stage, we select the subset of parts which maximizes the probability of detection, which significantly improves the detection performance in crowded scenes. In the tracking stage, we dynamically handle occlusions by distributing the score of the learned person classifier among its corresponding parts, which allows us to detect and predict partial occlusions, and prevent the performance of the classifiers from being degraded. Extensive experiments using the proposed method on several challenging sequences demonstrate state-of-the-art performance in multiple-people tracking.
Computer Vision and Image Understanding, 2000
A computer vision system for tracking multiple people in relatively unconstrained environments is described. Tracking is performed at three levels of abstraction: regions, people and groups. A novel, adaptive background subtraction method that combines color and gradient information is used to cope with shadows and unreliable color cues. People are tracked through mutual occlusions as they form groups and separate from one another. Strong use is made of color information to disambiguate occlusions and to provide qualitative estimates of depth ordering and position during occlusion. Simple interactions with objects can also be detected. The system is tested using both indoor and outdoor sequences. It is robust and should provide a useful mechanism for bootstrapping and reinitialization of tracking using more specific but less robust human models.
Multi-target tracking in crowd scenes is a highly challenging problem due to appearance ambiguity and frequent occlusions between different targets. While many impressive works have been done on complex appearance models and data association framework, we address the importance of social behaviour knowledge to overcome these challenges. The proposed model, termed Crowd Context Model (CCM), offers a general framework which jointly models the appearance features and behaviour rules together, with cooperation methods to achieve model-driven multi-target tracking. We use behaviour modelling approach to make reasonable prediction on pedestrian's location. A Multi-template Appearance Model (MAM) using simple appearance features is adopted for target localization. Experiments on real video sequences show that the proposed model-driven method improves the performance of multi-target tracking successfully, especially during occlusions.
2015 IEEE 18th International Conference on Intelligent Transportation Systems, 2015
This paper focuses on tracking in typical traffic monitoring scenarios with emphasis on handling occlusions caused by trees, lampposts and cables. We extend the existing TRacking with Occlusion handling and Drift correction (TROD) algorithm with a novel occlusion detection algorithm, based on measuring the changes in the object motion pattern. The motion information is extracted via frame differencing and described a the HOG descriptor. Occlusions are handled by preventing the model update and predicting the object location based on prior observations. Our proposed system clearly outperforms state-of-the-art tracking algorithms for larger occlusions in the specific pedestrian surveillance scenario, that is, the percentage of successfully tracked objects grows with 10-15%. At the same time, for non-specific public datasets, the performance is similar to existing state-of-the-art tracking algorithms.
arXiv (Cornell University), 2016
Studies on microscopic pedestrian requires large amounts of trajectory data from real-world pedestrian crowds. Such data collection, if done manually, needs tremendous effort and is very time consuming. Though many studies have asserted the possibility of automating this task using video cameras, we found that only a few have demonstrated good performance in very crowded situations or from a top-angled view scene. This paper deals with tracking pedestrian crowd under heavy occlusions from an angular scene. Our automated tracking system consists of two modules that perform sequentially. The first module detects moving objects as blobs. The second module is a tracking system. We employ probability distribution from the detection of each pedestrian and use Bayesian update to track the next position. The result of such tracking is a database of pedestrian trajectories over time and space. With certain prior information, we showed that the system can track a large number of people under occlusion and clutter scene.
Signal Processing: Image Communication, 2015
Recently significant progress has been made in the field of person detection and tracking. However, crowded scenes remain particularly challenging and can deeply affect the results due to overlapping detections and dynamic occlusions. In this paper, we present a method to enhance human detection and tracking in crowded scenes. It is based on introducing additional information about crowds
International Symposium on Visual Computing, 2010
This paper addresses the problem of tracking moving objects of variable appearance in challenging scenes rich with features and texture. Reliable tracking is of pivotal importance in surveillance applications. It is made particularly difficult by the nature of objects encountered in such scenes: these too change in appearance and scale, and are often articulated (e.g. humans). We propose a method which uses fast motion detection and segmentation as a constraint for both building appearance models and their robust propagation (matching) in time. The appearance model is based on sets of local appearances automatically clustered using spatio-kinetic similarity, and is updated with each new appearance seen. This integration of all seen appearances of a tracked object makes it extremely resilient to errors caused by occlusion and the lack of permanence of due to low data quality, appearance change or background clutter. These theoretical strengths of our algorithm are empirically demonstrated on two hour long video footage of a busy city marketplace.
2008
Segmentation and tracking of multiple humans in crowded situations is made difficult by interobject occlusion. We propose a model-based approach to interpret the image observations by multiple partially occluded human hypotheses in a Bayesian framework. We define a joint image likelihood for multiple humans based on the appearance of the humans, the visibility of the body obtained by occlusion reasoning, and foreground/background separation. The optimal solution is obtained by using an efficient sampling method, data-driven Markov chain Monte Carlo (DDMCMC), which uses image observations for proposal probabilities. Knowledge of various aspects, including human shape, camera model, and image cues, are integrated in one theoretically sound framework. We present experimental results and quantitative evaluation, demonstrating that the resulting approach is effective for very challenging data.
2004
We present a novel approach for continuous detection and tracking of moving objects observed by multiple stationary cameras. We address the tracking problem by simultaneously modeling motion and appearance of the moving objects. The object's appearance is represented using color distribution model invariant to 2D rigid and scale transformation. It provides an efficient blob similarity measure for tracking. The motion models are obtained using a Kalman Filter process, which predicts the position of the moving object in both 2D and 3D. The tracking is performed by the maximization of a joint probability model reflecting objects' motion and appearance. The novelty of our approach consists in integrating multiple cues and multiple views in a Joint Probability Data Association Filter for tracking a large number of moving people with partial and total occlusions. We demonstrate the performance of the proposed method on a soccer game captured by two stationary cameras.
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2000
In this paper, we address the problem of automatically detecting and tracking a variable number of persons in complex scenes using a monocular, potentially moving, uncalibrated camera. We propose a novel approach for multi-person tracking-bydetection in a particle filtering framework. In addition to final high-confidence detections, our algorithm uses the continuous confidence of pedestrian detectors and online trained, instance-specific classifiers as a graded observation model. Thus, generic object category knowledge is complemented by instance-specific information. The main contribution of this paper is to explore how these unreliable information sources can be used for robust multi-person tracking. The algorithm detects and tracks a large number of dynamically moving persons in complex scenes with occlusions, does not rely on background modeling, requires no camera or ground plane calibration, and only makes use of information from the past. Hence, it imposes very few restrictions and is suitable for online applications. Our experiments show that the method yields good tracking performance in a large variety of highly dynamic scenarios, such as typical surveillance videos, webcam footage, or sports sequences. We demonstrate that our algorithm outperforms other methods that rely on additional information. Furthermore, we analyze the influence of different algorithm components on the robustness.
2023
With the rapid development of deep learning, object detection and tracking play a vital role in today's society. Being able to identify and track all the pedestrians in the dense crowd scene with computer vision approaches is a typical challenge in this field, also known as the Multiple Object Tracking (MOT) challenge. Modern trackers are required to operate on more and more complicated scenes. According to the MOT20 challenge result, the pedestrian is 4 times denser than the MOT17 challenge. Hence, improving the ability to detect and track in extremely crowded scenes is the aim of this work. In light of the occlusion issue with the human body, the heads are usually easier to identify. In this work, we have designed a joint head and body detector in an anchorfree style to boost the detection recall and precision performance of pedestrians in both small and medium sizes. Innovatively, our model does not require information on the statistical head-body ratio for common pedestrians detection for training. Instead, the proposed model learns the ratio dynamically. To verify the effectiveness of the proposed model, we evaluate the model with extensive experiments on different datasets, including MOT20, Crowdhuman, and HT21 datasets. As a result, our proposed method significantly improves both the recall and precision rate on small&medium sized pedestrians and achieves state-of-the-art results in these challenging datasets.
1998
A combined 2D, 3D approach is presented that allows for robust tracking of moving bodies in a given environment as observed via a single, uncalibrated video camera. Lowlevel features are often insufficient for detection, segmentation, and tracking of non-rigid moving objects. Therefore, an improved mechanism is proposed that combines lowlevel (image processing) and mid-level (recursive trajectory estimation) information obtained during the tracking process. The resulting system can segment and maintain the tracking of moving objects before, during, and after occlusion. At each frame, the system also extracts a stabilized coordinate frame of the moving objects. This stabilized frame can be used as input to motion recognition modules. The approach enables robust tracking without constraining the system to know the shape of the objects being tracked beforehand; although, some assumptions are made about the characteristics of the shape of the objects, and how they evolve with time. Experiments in tracking moving people are described.
Mantech Publications , 2021
Multiple people detection in real-time is still a challenging task despite having different techniques. It is challenging because partially occluded people are still often not recognized in a heavily populated area, and also due to Non-Maximum suppression, correct bounding boxes are also discarded, which leads to imprecision in the detections. This paper presents the various modifications done to multiple people detection and tracking algorithms, which improves the efficiency and accuracy of the previously used cases.
Loading Preview
Sorry, preview is currently unavailable. You can download the paper by clicking the button above.