Academia.edu no longer supports Internet Explorer.
To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to upgrade your browser.
2004, Lecture Notes in Computer Science
This paper describes a real-time system for multi-target tracking and classification in image sequences from a single stationary camera. Several targets can be tracked simultaneously in spite of splits and merges amongst the foreground objects and presence of clutter in the segmentation results. In results we show tracking of upto 17 targets simultaneously. The algorithm combines Kalman filter-based motion and shape tracking with an efficient pattern matching algorithm. The latter facilitates the use of a dynamic programming strategy to efficiently solve the data association problem in presence of multiple splits and merges. The system is fully automatic and requires no manual input of any kind for initialization of tracking. The initialization for tracking is done using attributed graphs. The algorithm gives stable and noise free track initialization. The image based tracking results are used as inputs to a Bayesian network based classifier to classify the targets into different categories. After classification a simple 3D model for each class is used along with camera calibration to obtain 3D tracking results for the targets. We present results on a large number of real world image sequences, and accurate 3D tracking results compared with the readings from the speedometer of the vehicle. The complete tracking system including segmentation of moving targets works at about 25Hz for 352×288 resolution color images on a 2.8 GHz pentium-4 desktop.
It is important to maintain the identity of multiple targets while tracking them in some applications such as behavior understanding. However, unsatisfying tracking results may be produced due to different real-time conditions. These conditions include: inter-object occlusion, occlusion of the ocjects by background obstacles, splits and merges, which are observed when objects are being tracked in real-time. In this paper, an algorithm of feature-based using Kalman filter motion to handle multiple objects tracking is proposed. The system is fully automatic and requires no manual input of any kind for initialization of tracking. Through establishing Kalman filter motion model with the features centroid and area of moving objects in a single fixed camera monitoring scene, using information obtained by detection to judge whether merge or split occurred, the calculation of the cost function can be used to solve the problems of correspondence after split happened. The algorithm proposed is validated on human and vehicle image sequence algorithm proposed achieve efficient tracking of multiple moving objects under the confusing situations.
2003
The study of collaborative, distributed, real-time sensor networks is an emerging research area. Such networks are expected to play an essential role in a number of applications such as, surveillance and tracking of vehicles in the battlefield of the future. This paper proposes an approach to detect and classify multiple targets, and collaboratively track their position and velocity utilizing video cameras. Arbitrarily placed cameras collaboratively perform self-calibration and provide complete battlefield coverage.
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2019
In this work, we propose a tracker that differs from most existing multi-target trackers in two major ways. Firstly, our tracker does not rely on a pre-trained object detector to get the initial object hypotheses. Secondly, our tracker's final output is the fine contours of the targets rather than traditional bounding boxes. Therefore, our tracker simultaneously solves three main problems: detection, data association and segmentation. This is especially important because the output of each of those three problems are highly correlated and the solution of one can greatly help improve the others. The proposed algorithm consists of two main components: structured learning and Lagrange dual decomposition. Our structured learning based tracker learns a model for each target and infers the best locations of all targets simultaneously in a video clip. The inference of our structured learning is achieved through a new Target Identity-aware Network Flow (TINF), where each node in the network encodes the probability of each target identity belonging to that node. The probabilities are obtained by training target specific models using a global structured learning technique. This is followed by proposed Lagrangian relaxation optimization to find the high quality solution to the network. This forms the first component of our tracker. The second component is Lagrange dual decomposition, which combines the structured learning tracker with a segmentation algorithm. For segmentation, multi-label Conditional Random Field (CRF) is applied to a superpixel based spatio-temporal graph in a segment of video, in order to assign background or target labels to every superpixel. We show how the multi-label CRF is combined with the structured learning tracker through our dual decomposition formulation. This leads to more accurate segmentation results and also helps better resolve typical difficulties in multiple target tracking, such as occlusion handling, ID-switch and track drifting. The experiments on diverse and challenging sequences show that our method achieves superior results compared to competitive approaches for detection, multiple target tracking as well as segmentation.
2011
This paper describes a framework for simultaneous identification and tracking of moving targets in random media. Video and IR thermal sensors are used to obtain the target signature. Classical Kalman filtering methods are implemented on targets with unknown trajectories. Computer vision methodologies are proposed to design a smart interceptor which identifies the targets based on shape and thermal signatures. The paper also describes a platform for basic studies in tracking of targets using vision-guided robotics. The system enables multiple object tracking and recognition.
IEEE Transactions on Circuits and Systems for Video Technology, 2006
For applications such as behavior recognition it is important to maintain the identity of multiple targets, while tracking them in the presence of splits and merges, or occlusion of the targets by background obstacles. Here we propose an algorithm to handle multiple splits and merges of objects based on dynamic programming and a new geometric shape matching measure. We then cooperatively combine Kalman filter-based motion and shape tracking with the efficient and novel geometric shape matching algorithm. The system is fully automatic and requires no manual input of any kind for initialization of tracking. The target track initialization problem is formulated as computation of shortest paths in a directed and attributed graph using Dijkstra's shortest path algorithm. This scheme correctly initializes multiple target tracks for tracking even in the presence of clutter and segmentation errors which may occur in detecting a target. We present results on a large number of real world image sequences, where upto 17 objects have been tracked simultaneously in real-time, despite clutter, splits, and merges in measurements of objects. The complete tracking system including segmentation of moving objects works at 25 Hz on 352 288 pixel color image sequences on a 2.8-GHz Pentium-4 workstation.
2008
Non-intrusive video-detection for traffic flow observation and surveillance is the primary alternative to conventional inductive loop detectors. Video Image Detection Systems (VIDS) can derive traffic parameters by means of image processing and pattern recognition methods. Existing VIDS emulate the inductive loops. We propose a trajectory based recognition algorithm to expand the common approach and to obtain new types of information (e.g. queue length or erratic movements).
Data association is an essential component of the human detection and tracking system. The majority of the existing methods, such as Bi-partite matching and GMCP methods are incorporated the limited-temporal-locality of the sequence into data association problem. GMCP tracker is considered as an important complete representation of the tracking problem, where all pair wise relationships between the detections in temporal span of a video is considered and makes the input to the data association as a complete Bi-partite graph. In Bi-partite graph a track of a person will form a clique (a subgraph in which all the nodes are connected to each other). A cost is assigned to each clique and it maximizes the score function, which is selected as the best clique (track), but it is sub-optimal. GMCP tracker does not follow the joint optimization for all the tracks simultaneously and finds the tracks one by one which makes difficulties caused by cluttered background, and crowded scenes to detect and tracking Tracking-by-detection methods are used to track multiple targets with unified handling of complex scenarios, where current detection responses are linked to the previous trajectories. By adding the standard Hungarian algorithm, dummy nodes to each trajectory to allow nodes to temporally disappear and solve the data association implicitly in a global manner even though it is formulated between two consecutive frames. If a trajectory fails to find its matching detection, it is linked to its corresponding dummy nodes until its emergence of matching detection. The source nodes are also incorporated into the account of new targets. The dummy nodes tend to accumulate in fake or disappeared trajectories while they occasionally appear in real trajectories and improve detection inevitable failures, which include the miss detection, the false detection and the occlusion, where an object is partially or fully invisible because of the limited camera view. Extended hybrid Hungarian algorithm is relatively better when compared with GMCP and Hybrid Hungarian algorithm in accuracy. Experiments show that the proposed method makes significant improvement in tracking and detection of different length of videos, specifically with short length videos.
2007 Ieee International Symposium on Industrial Electronics, Proceedings, Vols 1-8, 2007
In this paper one of the most important solutions in position estimation is used in conjunction with a data association algorithm in order to achieve a multi-tracking application. A Kalman Filter is extended and adapted in order to track the position and speed of a variable number of objects in an unstructured and complex environment. Both the developed algorithms and the results obtained with their real-time execution implementation in the mentioned application are described, and interesting conclusions extracted from these experiments are remarked in the paper. Finally, tracking results of the proposed algorithm are compared with another multi-object estimator based on a Particle Filter previously developed by the authors. I.
In this paper, we study the problem of joint tracking and classification of several targets at the same time. Targets are considered to be known and sufficiently separated so that they cannot be confused. Our goal is to propose a full methodology that is robust to missing information. The classical probabilistic approach with Bayesian tools is improved with belief functions. A simulation concerning the identification of go fast boats in a piracy problem shows that our approach improves previous results.
IEE Seminar on Target Tracking: Algorithms and Applications, 2006
This paper presents a particle filtering algorithm for multiple object tracking. The proposed particle filter (PF) embeds a data association technique based on the joint probabilistic data association (JPDA) which handles the uncertainty of the measurement origin. I. Introduction Tracking a group of targets in video sequences has many surveillance applications. It has been used for security monitoring (1), (2),
2012
Multiple target tracking that integrates target model estimation and data association steps is described. The integration allows successive refinement of the models while reducing the uncertainty in data association. Each target is described by "weak" models of kinematics, shape and appearance. The target models are refined in a two-stage process: imagebased tracklets of high purity and accuracy are generated, and geospatial tracks are extended from these tracklets. During each stage of tracking, observation data of reduced uncertainties are associated with the refined tracks in a probabilistic manner. We describe our approach in the context of a real time system that has been tested and evaluated for vehicle and human tracking in sparse, medium, and dense clutter using aerial EO/IR video.
2017
In this paper, we consider multi-object target tracking using video reference datasets. Our objective is detection of the target using a novel adaboost and Gentle Boost method in order to track the subjects from reference data sets. Multi-target tracking is still challenging topic which is used to find the same object across different camera views and also used to find the location and sizes of different object at different places. Furthermore extensive performance analysis of the three main parts demonstrates usefulness of multi object tracking. We carried out experiment to analyze discriminative power of nine features (HSV, LBP, HOG extracted on body, torso and legs) used in the appearance model for multicam dataset. For each features the RMSE and PSNR obtained.
Lecture Notes in Computer Science, 2006
In this paper, we propose a maximum a posteriori formulation to the multiple target tracking problem. We adopt a graph representation for storing the detected regions as well as their association over time. The multiple target tracking problem is formulated as a multiple paths search in the graph. Due to the noisy foreground segmentation, an object may be represented by several foreground regions and one foreground region may corresponds to multiple objects. We introduce merge, split and mean shift operations that add new hypothesis to the measurement graph in order to be able to aggregate, split detected blobs or re-acquire objects that have not been detected during stop-and-gomotion. To make full use of the visual observations, we consider both motion and appearance likelihood. Experiments have been conducted on both indoor and outdoor data sets, and a comparison has been carried to assess the contribution of the new tracker.
2010 13th International Conference on Information Fusion, 2010
With the evolution and fusion of technologies from sensor networks and embedded cameras, smart camera networks are emerging as useful and powerful systems. Wireless networks, however, introduce new constraints of limited bandwidth, computation, and power. Existing camera network approaches for target tracking either utilize target handover mechanisms between cameras, or combine results from 2D trackers into 3D target state for continuous tracking. Such approaches suffer from the drawbacks associated with 2D tracking, such as scale selection, target rotation, and occlusion. In this paper, we present an approach for tracking multiple targets in 3D space using a wireless network of smart cameras. In our approach, we use multiview histograms in different feature-spaces to characterize targets in 3D space. We employ color and texture as the visual features to model targets. The visual features from each camera, along with the target models are used in a probabilistic tracker to estimate the target state. We demonstrate the effectiveness of our proposed tracker with results tracking people using a camera network deployed in a building.
16th IPPR Conference on CVGIP, 2003
Moving object tracking has been an important research topic in numerous applications. Most papers adopted estimation-based approach to achieve moving object tracking. In this paper, an effective tracking approach using contour and texture features is proposed. Detail analysis of the proposed features for tracking is elaborated. A tracking scheme by classifying the features with backpropagation neural network is invented. The proposed approach is verified through a set of experimental image sequences. 1310 moving objects, including 486 cars and 824 humans, can be effectively classified with 99.8% recognition rate.
ICTACT Journal on Image and Video Processing, 2017
2018
Most multiple object tracking algorithms relying on a single view have failed to follow the trajectories of targets when they have been completely hidden by obstacles. In this paper, we introduce a novel method of collaborative tracking in a synchronized overlapping cameras network. We propose an efficient target association method between cameras based on the tracking results of each target on each view. Our framework naturally handles obstacle occlusions and mutual target occlusions. We implemented our multiple object tracking algorithm by Decision Making algorithm [30] on each view. The tracking outcomes on each camera are collected and associated into targets. The feedback from the central association helps the individual cameras in tracking hidden targets, even in the case of complete occlusion. We use the standard MOT metric to validate our method. The experimental results on each view show that the multiple view tracking system outperforms the single view ones. The source cod...
Pattern Recognition Letters, 2010
Visual detection and tracking are interdisciplinary tasks which are oriented at estimating the state of one or multiple moving objects in a video sequence. This is one of the first tasks in processing video systems which try to describe human behaviour in different contexts, such as video-surveillance, sport technique analysis. This work presents a multiple object tracking system which properly hybridizes particle filters and memetic algorithms to produce a more reliable and efficient tracking algorithm. The system has been tested on synthetic and real image sequences, with the aim of describing their performance for different levels of noise, occlusions, a variable number of objects, etc. Experimental results demonstrate that the proposed system accurately tracks multiple objects in the scene, by grouping and ungrouping them when necessary, while keeping their identities during the sequence of images. Moreover, the performance of the proposed system is not strongly affected by the increase in the number of objects, maintaining computational load and precision in proper balance.
IEEE Signal Processing Magazine, 2002
We outline a framework for collaborative signal processing in distributed sensor networks. The ideas are presented in the context of tracking multiple moving objects in a sensor field. The key steps involved in the tracking procedure include event detection, target classification, and estimation and prediction of target location. Algorithms for various tasks are discussed with an emphasis on classification. Results based on experiments with real data are reported which provide useful insights into the essential nature of the problems. Issues, challenges and directions for future research are identified.
Loading Preview
Sorry, preview is currently unavailable. You can download the paper by clicking the button above.