vista3d

MONAI Versatile Imaging SegmenTation and Annotation

[Paper] [Demo] [Huggingface] [Huggingface-CTMR]

News!

[10/27/2025] We release NV-Segment-CTMR, a joint CT-MR automatic segmentation model trained on over 30K CT and MRI scans, supporting over 300 classes.

[03/12/2025] We provide VISTA3D as a baseline for the challenge "CVPR 2025: Foundation Models for Interactive 3D Biomedical Image Segmentation"(link). The simplified code based on MONAI 1.4 is provided in the here.

[02/26/2025] VISTA3D paper has been accepted by CVPR2025!

Overview

Model Comparison

Feature	VISTA3D	NV-Segment-CT	NV-Segment-CTMR
Anatomical Classes	132 classes (7 types of tumors)	Same as VISTA3D	345+ classes Details
Modalities	CT only	Same as VISTA3D	CT + MRI (body & brain)
Segmentation Type	Automatic + Interactive (point-click)	Same as VISTA3D	Automatic only
Model Weights	NV-Segment-CT on HuggingFace (MONAI 1.3)	NV-Segment-CT on HuggingFace (MONAI1.4, minor layer naming change)	NV-Segment-CTMR on HuggingFace
Implementation	Current Repo: MONAI1.3 research code	Optimized MONAI Bundle (MONAI>=1.4)	Optimized MONAI Bundle (MONAI>=1.4)
Usage	Full training for all models, inference and finetune	Optimized and fast inference. Light weight finetune. Wrapped into config and bundle	Same as NV-Segment-CT
License	Commercial Friendly	Same as VISTA3D	Non-Commercial

We recommend users to use NV-Segment-CTMR for large scale automatic segmentation for CT and MRI scans because it is trained with large and diverse datasets. For CT tumor or interactive refinement, user should try NV-Segment-CT.

VISTA3D/NV-Segment-CT (Paper) is a foundation model trained systematically on 11,454 volumes encompassing 127 types of human anatomical structures and various lesions. The model provides State-of-the-art performances on:

out-of-the-box automatic segmentation on 3D CT scans
zero-shot interactive segmentation in 3D CT scans
automatic segemntation + interactive refinement

NV-Segment-CTMR starts from NV-Segment-CT checkpoint and finetuned on over 30K CT and MRI scans, supporting over 300 classes.

out-of-the-box automatic segmentation on 3D CT scans
share the same architecture with VISTA3D-CT model but we only trained the automatic segmentation branch with larger CT and MRI datasets.

Usage

1. NVIDIA Open Models and HuggingFace

We wrapped VISTA3D-CT and VISTA3D-CTMR into a more structured MONAI bundle format with optimized GPU utilization and better interface for training and inference, meanwhile, we created simplified Huggingface models for inference. We will only maintain directories in the following repository:

Quick Start

Installation

# use the same conda env as this repo
conda create -y -n vista3d-nv python=3.9
conda activate vista3d-nv
git clone https://github.com/NVIDIA-Medtech/NV-Segment-CTMR.git
cd NV-Segment-CTMR/NV-Segment-CTMR;
pip install -r requirements.txt;
cd ..;
mkdir NV-Segment-CT/models;mkdir NV-Segment-CTMR/models
# download from huggingface link for CT
cd NV-Segment-CT
# Option 1: Download using hf and move to expected location
hf download nvidia/NV-Segment-CT --local-dir models/ && \
mv models/vista3d_pretrained_model/model.pt models/model.pt
# download from huggingface link for CTMR
cd ../NV-Segment-CTMR
hf download nvidia/NV-Segment-CTMR --local-dir models/ && \
mv models/vista3d_pretrained_model/model.pt models/model.pt

1.1 NV-Segment-CT[Github][Huggingface]

Automatic Segmentation (support multi-gpu batch processing)

class definition

# CT sementation
cd NV-Segment-CT
# Automatic Segment everything
python -m monai.bundle run --config_file configs/inference.json --input_dict "{'image':'example/spleen_03.nii.gz'}"
# Automatic Segment specific class
python -m monai.bundle run --config_file configs/inference.json --input_dict "{'image':'example/spleen_03.nii.gz','label_prompt':[3]}"
# Automatic Batch segmentation for the whole folder
python -m monai.bundle run --config_file="['configs/inference.json', 'configs/batch_inference.json']" --input_dir="example/" --output_dir="example/"
# Automatic Batch segmentation for the whole folder with multi-gpu support. mgpu_inference.json is below. change nproc_per_node to your GPU number.
torchrun --nproc_per_node=2 --nnodes=1 -m monai.bundle run --config_file="['configs/inference.json', 'configs/batch_inference.json', 'configs/mgpu_inference.json']" --input_dir="example/" --output_dir="example/"

Interactive segmentation

# Points must be three dimensional (x,y,z) in the shape of [[x,y,z],...,[x,y,z]]. Point labels can only be -1(ignore), 0(negative), 1(positive) and 2(negative for special overlaped class like tumor), 3(positive for special class). Only supporting 1 class per inference. The output 255 represents NaN value which means not processed region.
cd NV-Segment-CT
python -m monai.bundle run --config_file configs/inference.json --input_dict "{'image':'example/spleen_03.nii.gz','points':[[128,128,16], [100,100,16]],'point_labels':[1, 0]}"

NOTE MONAI bundle accepts multiple json config files and input arguments. The latter configs/arguments will overide the previous configs/arguments if they have overlapping keys.

1.2 NV-Segment-CTMR[Github][Huggingface]

Please read the complete usage in the NV-Segment-CTMR [Github] repo.

We defined 345 classes as in metadata.json and details in label_dict.json. It shows the label organ name, index, training dataset, modality and evaluation dice score. If a class only comes from CT training dataset, it may not perform well on MRI, but the actual performance will vary case by case. We support three type of segment everything "CT_BODY", "MRI_BODY", and "MRI_BRAIN".

"CT_BODY" is the previous VISTA3D bundle supported 132 CT classes. Same as NV-Segment-CT everything prompts.
"MRI_BODY" shares the same 50 label class as TotalsegmentatorMR.
"MRI_BRAIN" is trained on skull stripped LUMIR dataset and will segment 133 brain MRI substructures. We followed MIR Preprocessing tutorials and put the corresponding components into this repo. All contrasts of brain MRI are supported

Quick Start

Automatic Segmentation (support multi-gpu batch processing)

class definition

# navigate into CTMR folder.
cd NV-Segment-CTMR;
# Automatic Segment everything. It requires a modality key. We allow "CT_BODY", "MRI_BODY", and "MRI_BRAIN". If modality key is not provided, CT_BODY is used as default. For brain, we require preprocessing.
python -m monai.bundle run --config_file configs/inference.json --input_dict "{'image':'example/s0289.nii.gz'}" --modality MRI_BODY
# Automatic Segment specific class
python -m monai.bundle run --config_file configs/inference.json --input_dict "{'image':'example/s0289.nii.gz','label_prompt':[3]}"
# Automatic Batch segmentation for the whole folder
python -m monai.bundle run --config_file="['configs/inference.json', 'configs/batch_inference.json']" --input_dir="example/" --output_dir="example/" --modality MRI_BODY
# Automatic Batch segmentation for the whole folder with multi-gpu support. mgpu_inference.json is below. change nproc_per_node to your GPU number.
torchrun --nproc_per_node=2 --nnodes=1 -m monai.bundle run --config_file="['configs/inference.json', 'configs/batch_inference.json', 'configs/mgpu_inference.json']" --input_dir="example/" --output_dir="example/" --modality MRI_BODY

Brain MRI segmentation

For brain MRI segmentation, skull stripping, bias correction and MNI space alignment is included in the codebase. Skull stripping requires docker enviroments. More details can be found in run_brain_segmentation.sh.

./brain_t1_preprocess/run_brain_segmentation.sh --input example/brain_t1.nii.gz --output_dir results/

2. CVPR2025 research repo (current codebase, CT only)

This research repo is for reproducing results for the CVPR2025 paper with all the model training and evaluation code, built upon MONAI1.3. We will not update this repo in the future. See details

3. VISTA3D results postprocessing with ShapeKit

VISTA3D is trained with binary segmentation, and may produce false positives due to weak false positive supervision. ShapeKit solves this problem with sophisticated postprocessing. ShapeKit requires segmentation mask for each class. VISTA3D by default performs argmax and collaps overlapping classes. Change the monai.apps.vista3d.transforms.VistaPostTransformd in inference.json to save each class segmentation as a separate channel. Then follow ShapeKit codebase for processing.

{ 
  "_target_": "Activationsd",
  "sigmoid": true,
  "keys": "pred"
},

Community

Join the conversation on Twitter @ProjectMONAI or join our Slack channel.

Ask and answer questions on MONAI VISTA's GitHub discussions tab.

License

The codebase is under Apache 2.0 Licence. The model weight is released under NVIDIA OneWay Noncommercial License.

Reference

@article{he2024vista3d,
  title={VISTA3D: A Unified Segmentation Foundation Model For 3D Medical Imaging},
  author={He, Yufan and Guo, Pengfei and Tang, Yucheng and Myronenko, Andriy and Nath, Vishwesh and Xu, Ziyue and Yang, Dong and Zhao, Can and Simon, Benjamin and Belue, Mason and others},
  journal={CVPR},
  year={2025}
}

Name		Name	Last commit message	Last commit date
parent directory ..
assets/imgs		assets/imgs
configs		configs
cvpr_workshop		cvpr_workshop
data		data
scripts		scripts
tests		tests
vista3d		vista3d
LICENSE		LICENSE
NVIDIA OneWay Noncommercial License.txt		NVIDIA OneWay Noncommercial License.txt
README.md		README.md
README_research.md		README_research.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

MONAI Versatile Imaging SegmenTation and Annotation

News!

Overview

Model Comparison

Usage

1. NVIDIA Open Models and HuggingFace

Quick Start

Installation

1.1 NV-Segment-CT[Github][Huggingface]

Automatic Segmentation (support multi-gpu batch processing)

Interactive segmentation

1.2 NV-Segment-CTMR[Github][Huggingface]

Quick Start

Automatic Segmentation (support multi-gpu batch processing)

Brain MRI segmentation

2. CVPR2025 research repo (current codebase, CT only)

3. VISTA3D results postprocessing with ShapeKit

Community

License

Reference

Acknowledgement

FilesExpand file tree

vista3d

Directory actions

More options

Directory actions

More options

Latest commit

History

vista3d

Folders and files

parent directory

README.md

MONAI Versatile Imaging SegmenTation and Annotation

News!

Overview

Model Comparison

Usage

1. NVIDIA Open Models and HuggingFace

Quick Start

Installation

1.1 NV-Segment-CT[Github][Huggingface]

Automatic Segmentation (support multi-gpu batch processing)

Interactive segmentation

1.2 NV-Segment-CTMR[Github][Huggingface]

Quick Start

Automatic Segmentation (support multi-gpu batch processing)

Brain MRI segmentation

2. CVPR2025 research repo (current codebase, CT only)

3. VISTA3D results postprocessing with ShapeKit

Community

License

Reference

Acknowledgement