ViDAR: Video Diffusion-Aware 4D Reconstruction From Monocular Inputs

Install

Our code builds on MoSca. Please follow MoSca installation to setup the environment.
Install additional dependencies

pip install -r requirements_vidar.txt

DyCheck training

We provide training on full and half resolution DyCheck scenes. In our benchmark we evaluate the methods on masked dynamic regions. Masks can be found here. We follow MoSca directory structure and place the files in data/iphone and data/iphone_full_res.

To speed up the training, consider using pregenerated sampled camera poses and corresponding masks found here.

The pipeline can be run as:

bash pipeline.sh

The steps involved:

MoSca reconstruction.
LoRA training on the input video.
Camera sampling (skipped if downloaded).
Sampled views generation.
Mask generation (skipped if downloaded).
- For the best masks consider using Track Anything or similar on the sampled views.
Sampled views enhancement.
Pseudo-multi-view reconstruction.

Evalation

To run evaluation, use the following:

python vidar_evaluate.py --scene apple --input_dir data/iphone_full_res/apple --pred_dir data/iphone_full_res/apple/logs/iphone_fit_vidar/tto_test --output_dir output/vidar/apple

Citation

@inproceedings{nazarczuk2025vidar,
  title={{ViDAR: Video Diffusion-Aware 4D Reconstruction From Monocular Inputs}},
  author={Nazarczuk, Michal and Catley-Chandar, Sibi and Tanay, Thomas and Zhang, Zhensong and Slabaugh, Gregory and Pérez-Pellitero, Eduardo},
  booktitle={Advances in Neural Information Processing Systems},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data_utils		data_utils
eval_utils		eval_utils
lib_moca		lib_moca
lib_mosca		lib_mosca
lib_prior		lib_prior
lib_render		lib_render
profile/iphone		profile/iphone
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
install.sh		install.sh
jax_requirements.txt		jax_requirements.txt
lite_moca_reconstruct.py		lite_moca_reconstruct.py
mosca_evaluate.py		mosca_evaluate.py
mosca_precompute.py		mosca_precompute.py
mosca_reconstruct.py		mosca_reconstruct.py
mosca_reconstruct_step.py		mosca_reconstruct_step.py
mosca_viz.py		mosca_viz.py
pipeline.sh		pipeline.sh
pyrightconfig.json		pyrightconfig.json
recon_utils.py		recon_utils.py
requirements.txt		requirements.txt
requirements_vidar.txt		requirements_vidar.txt
template.json		template.json
train_dreambooth_lora_sdxl.py		train_dreambooth_lora_sdxl.py
vidar_enhance_extra_renders.py		vidar_enhance_extra_renders.py
vidar_evaluate.py		vidar_evaluate.py
vidar_generate_masks.py		vidar_generate_masks.py
vidar_reconstruct.py		vidar_reconstruct.py
vidar_render_extra_cams.py		vidar_render_extra_cams.py
vidar_sample_cameras.py		vidar_sample_cameras.py
viz_utils.py		viz_utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ViDAR: Video Diffusion-Aware 4D Reconstruction From Monocular Inputs

Install

DyCheck training

Evalation

Citation

About

Uh oh!

Releases

Packages

Languages

License

vidar-4d/ViDAR

Folders and files

Latest commit

History

Repository files navigation

ViDAR: Video Diffusion-Aware 4D Reconstruction From Monocular Inputs

Install

DyCheck training

Evalation

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages