[Feature] VITA Policy + FM Policy using DiT + minor bug fixes#580
Merged
zcyqyq merged 7 commits intoRoboVerseOrg:mainfrom Nov 6, 2025
Merged
[Feature] VITA Policy + FM Policy using DiT + minor bug fixes#580zcyqyq merged 7 commits intoRoboVerseOrg:mainfrom
zcyqyq merged 7 commits intoRoboVerseOrg:mainfrom
Conversation
Morpheus-An
pushed a commit
to Morpheus-Antuo/RoboVerse
that referenced
this pull request
Nov 6, 2025
…rseOrg#580) * Remove noise scheduler from fm. Refactor dp and fm. Fix posix path. * add DiT and FM DiT; lower cased file names * Init VITA * Remove repeated base policy * pre-commit and update contrib.md --------- Co-authored-by: Murphy <[email protected]>
2 tasks
myuansun
pushed a commit
to yongce-liu/RoboVerse
that referenced
this pull request
Nov 27, 2025
…rseOrg#580) * Remove noise scheduler from fm. Refactor dp and fm. Fix posix path. * add DiT and FM DiT; lower cased file names * Init VITA * Remove repeated base policy * pre-commit and update contrib.md --------- Co-authored-by: Murphy <[email protected]>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Description
New Features: two new IL policies (VITA, FM + DiT)
Bug Fixes
Type of change
1. VITA Policy [0]
VITA (Vision-to-Action flow matching)[0] is a new fast and performant policy learning algorithm. VITA directly flows from latent images to latent actions without sampling from Gaussian or injecting conditions during denoising.
2. Flow Matching Policy
RoboVerse now supports FM + UNet. The PR provides a DiT implementation that has been well tested in our previous papers and can outperform FM + UNet and match RoboVerse DP implementation.
Local performance tests on
CloseBox L0. We use the default collect_demos scripts to generate datasets and default DP runner configurations for all the models and conduct two random runs locally. FM + DiT outperforms UNet and VITA outperforms both DP and FM.[0] Gao, Dechen, et al. "VITA: Vision-to-Action Flow Matching Policy." arXiv preprint arXiv:2507.13231 (2025).
[1] Peebles, William, and Saining Xie. "Scalable diffusion models with transformers." Proceedings of the IEEE/CVF international conference on computer vision. 2023.
[2] Su, Jianlin, et al. "Roformer: Enhanced transformer with rotary position embedding." Neurocomputing 568 (2024): 127063.
[3] Tong, Alexander, et al. "Improving and generalizing flow-based generative models with minibatch optimal transport." arXiv preprint arXiv:2302.00482 (2023).
[4] Tong, Alexander, et al. "Simulation-free schr" odinger bridges via score and flow matching." arXiv preprint arXiv:2307.03672 (2023).
3. Minor fixes.
BaseImagePolicydeclartions in DP policies and use the base class within DP utils.roboverse_learn/il/data2zarr_dp.pycan be interrupted when meta json is not successfully generated. Added try-catch.ckpt_name = args.checkpoint_path.split("/")[-1] + "_" + time_strindp_runner.pyis not compatible with newer pathlib versions.How to test
Please describe how to test the change if applicable.
VITA Policy:
Set
algo_choosetovita_model(dp_run.sh)FM Policy + DiT:
Set
algo_choosetofm_dit_model(dp_run.sh)Local performance tests on
CloseBox L0:Screenshots / Videos
Please attach before and after screenshots or videos of the change if applicable.
Checklist
pre-commitchecks withpre-commit run --color=always --all-filesCONTRIBUTORS.mdor my name already exists there