TraceForge

TraceForge is a unified dataset pipeline that converts cross-embodiment videos into consistent 3D traces via camera motion compensation and speed retargeting.

Project Website: tracegen.github.io
arXiv: 2511.21690

Installation

1. Create a conda environment

conda create -n traceforge python=3.11
conda activate traceforge

2. Install dependencies

Installs PyTorch 2.8.0 (CUDA 12.8) and all required packages.

bash setup_env.sh

3. Download checkpoints

Download the TAPIP3D model checkpoint:

mkdir -p checkpoints
wget -O checkpoints/tapip3d_final.pth https://huggingface.co/zbww/tapip3d/resolve/main/tapip3d_final.pth

Usage

Prepare videos

Case A: videos directly in the input folder

<input_video_directory>/
├── 1.webm
├── 2.webm
└── ...

Use --scan_depth 0 because the videos are already in the root folder.

Case B: one subfolder per video containing extracted frames

<input_video_directory>/
├── <video_name_1>/
│   ├── 000000.png
│   ├── 000001.png
│   └── ...
├── <video_name_2>/
│   ├── 000000.png
│   └── ...
└── ...

Use --scan_depth 1 so TraceForge scans one level down to reach each video’s frames.

Case C: two-level layout (per-video folder with an images/ subfolder)

<input_video_directory>/
├── <video_name_1>/
│   └── images/
│       ├── 000000.png
│       ├── 000001.png
│       └── ...
├── <video_name_2>/
│   └── images/
│       ├── 000000.png
│       └── ...
└── ...

Use --scan_depth 2 to search two levels down for the image frames.

Quick test dataset

Download a small sample dataset and unpack it under data/test_dataset:

pip install gdown  # if not installed
mkdir -p data
gdown --fuzzy https://drive.google.com/file/d/1Vn1FNbthz-K8o2ijq9V7jYv10rElWuUd/view?usp=sharing -O data/test_dataset.tar
tar -xf data/test_dataset.tar -C data

The downloaded data follows the Case B layout above; run inference with

python infer.py \
    --video_path data/test_dataset \
    --out_dir <output_directory> \
    --batch_process \
    --use_all_trajectories \
    --skip_existing \
    --frame_drop_rate 5 \
    --scan_depth 1

Running Inference

python infer.py \
    --video_path <input_video_directory> \
    --out_dir <output_directory> \
    --batch_process \
    --use_all_trajectories \
    --skip_existing \
    --frame_drop_rate 5 \
    --scan_depth 2

Options

Argument	Description	Default
`--video_path`	Path to video directory	Required
`--out_dir`	Output directory	`outputs`
`--batch_process`	Process all video folders in the directory	`False`
`--skip_existing`	Skip if output already exists	`False`
`--frame_drop_rate`	Query points every N frames	`1`
`--scan_depth`	Directory levels to scan for subfolders	`2`
`--fps`	Frame sampling stride (0 for auto)	`1`
`--max_frames_per_video`	Target max frames per episode	`50`
`--future_len`	Tracking window length per query frame	`128`

Output Structure

<output_dir>/
└── <video_name>/
    ├── images/
    │   ├── <video_name>_0.png
    │   ├── <video_name>_5.png
    │   └── ...
    ├── depth/
    │   ├── <video_name>_0.png
    │   ├── <video_name>_0_raw.npz
    │   └── ...
    ├── samples/
    │   ├── <video_name>_0.npz
    │   ├── <video_name>_5.npz
    │   └── ...
    └── <video_name>.npz          # Full video visualization data

Visualization

3D Trajectory Viewer

Visualize 3D traces on single images using viser:

python visualize_single_image.py \
    --npz_path <output_dir>/<video_name>/samples/<video_name>_0.npz \
    --image_path <output_dir>/<video_name>/images/<video_name>_0.png \
    --depth_path <output_dir>/<video_name>/depth/<video_name>_0.png \
    --port 8080

Verify Output Files

Check saved NPZ files:

# 3D trajectory checker
python checker/batch_process_result_checker_3d.py <output_dir> --max-videos 1 --max-samples 3

# 2D trajectory checker
python checker/batch_process_result_checker.py <output_dir> --max-videos 1 --max-samples 3

Instruction Generation

Generate task descriptions using VLM (Vision-Language Model).

Setup API Keys

Create a .env file in the project root:

# For OpenAI (default)
OPENAI_API_KEY=your_openai_api_key

# For Google Gemini
GOOGLE_API_KEY=your_gemini_api_key

Generate Descriptions

cd text_generation/
python generate_description.py --episode_dir <dataset_directory>

# Skip episodes that already have descriptions
python generate_description.py --episode_dir <dataset_directory> --skip_existing

Helper Functions

Reading 3D data: See ThreedReader in visualize_single_image.py
Point and camera transformations: See utils/threed_utils.py

📖 Citation

If you find this work useful, please consider citing our paper:

@article{lee2025tracegen,
  title={TraceGen: World Modeling in 3D Trace Space Enables Learning from Cross-Embodiment Videos},
  author={Lee, Seungjae and Jung, Yoonkyo and Chun, Inkook and Lee, Yao-Chih and Cai, Zikui and Huang, Hongjia and Talreja, Aayush and Dao, Tan Dat and Liang, Yongyuan and Huang, Jia-Bin and Huang, Furong},
  journal={arXiv preprint arXiv:2511.21690},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
assets		assets
checker		checker
datasets		datasets
models		models
text_generation		text_generation
third_party		third_party
training_tapip		training_tapip
utils		utils
.gitignore		.gitignore
README.md		README.md
infer.py		infer.py
setup_env.sh		setup_env.sh
visualize_single_image.py		visualize_single_image.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

TraceForge

Installation

1. Create a conda environment

2. Install dependencies

3. Download checkpoints

Usage

Prepare videos

Running Inference

Options

Output Structure

Visualization

3D Trajectory Viewer

Verify Output Files

Instruction Generation

Setup API Keys

Generate Descriptions

Helper Functions

📖 Citation

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Yoonkyo/TraceForge

Folders and files

Latest commit

History

Repository files navigation

TraceForge

Installation

1. Create a conda environment

2. Install dependencies

3. Download checkpoints

Usage

Prepare videos

Running Inference

Options

Output Structure

Visualization

3D Trajectory Viewer

Verify Output Files

Instruction Generation

Setup API Keys

Generate Descriptions

Helper Functions

📖 Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages