Keyframe-based modeling and tracking of multiple 3D objects

Kiyoung Kim; Vincent Lepetit; Woontack Woo

Keyframe-based modeling and tracking of multiple 3D objects

Woontack Woo

2010, 2010 IEEE International Symposium on Mixed and Augmented Reality

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

We propose a real-time solution for modeling and tracking multiple 3D objects in unknown environments. Our contribution is twofold: First, we show how to scale with the number of objects. This is done by combining recent techniques for image retrieval and online Structure from Motion, which can be run in parallel. As a result, tracking 40 objects in 3D can be done within 6 to 25 milliseconds per frame, even under difficult conditions for tracking. Second, we propose a method to let the user add new objects very quickly. The user simply has to select in an image a 2D region lying on the object. A 3D primitive is then fitted to the features within this region, and adjusted to create the object 3D model. In practice, this procedure takes less than a minute.

Figures (7)

Figure 1: Overview of our approach. In the top line (A/B), (A): the total number of the detected objects in the current image, and (B): the number of the registered objects in database. (a) Our system continuously reconstructs the environment and tracks the camera from the video stream. (b) To add a new object, the user starts by selecting some features on the object using a brush-like tool. (c) The system then fits a planar facet to the features that can be adjusted (d) and extended to a 3D box by the user (e). (f) The object can then be moved independently from the rest of the scene. (g-k) Known objects are recognized when they become visible, and are tracked independently from each other. (1) Our system can handle many objects in real-time.

Figure 3: Flowchart of the proposed modeling procedure.

Figure 4 shows the results of the proposed tracker running in a small desktop environment. For this experiment, we translated the camera roughly in a direction orthogonal to its line of sight, and parallel to the ground plane. Then we measure the tracking time and the repro- jection error over 1000 frames. The results are shown in Figure 4. Tracking time always remained between 5 and 25 ms, and the re- projection error was maintained under 2 pixels, even when the map contains almost 8000 points. The keyframes and map points are rendered in 3D in Figure 4(c), which shows that the camera trajec- tory is as expected, at least qualitatively.

Table 1: Average Times for the Different Modules Running on (1): Foreground and (2),(3): Background Threads.

Figure 5: Object modeling and tracking using the proposed method: (a) Generated map and tracker keyframes during the modeling pro- cess described in Figure 1(a)-(f) seen from two different viewpoints, (b) two of the generated object-keyframes and their object-keypoints represented as red dots, and (c) two snapshots of the target object tracking results.

Figure 6: Evaluating the scalability in terms of the number of objects: (a) 10 of the 40 objects used for the experiment. (b) Number of keyframes and number of map points for each object of these 10 objects. (c) The prototype could detect and track potentially visible 40 objects, for a total of 444 keyframes and 24,568 map points. The computation required for tracking reached 25 ms when 13 objects were visible and tracked simultaneously. The computation time evolves roughly linearly with the number of visible objects, while the number of known objects has only a very limited influence.

Woontack Woo

Computers & Graphics, 2012

We propose a real-time solution for modeling and tracking multiple 3D objects in unknown environments for Augmented Reality. The proposed solution consists of both scalable tracking and interactive modeling. Our contribution is twofold: First, we show how to scale with the number of objects using keyframes. This is done by combining recent techniques for image retrieval and online Structure from Motion, which can be run in parallel. As a result, tracking 50 objects in 3D can be done within 6-35 ms per frame, even under difficult conditions for tracking. Second, we propose a method to let the user add new objects very quickly. The user simply has to select in an image a 2D region lying on the object. A 3D primitive is then fitted to the features within this region, and adjusted to create the object 3D model. We demonstrate the modeling of polygonal and circular-based objects. In practice, this procedure takes less than a minute.

Log In

Keyframe-based modeling and tracking of multiple 3D objects

Sign up for access to the world's latest research

Abstract

Related papers

Related topics