AOR: Anatomical Ontology-Guided Reasoning for Medical Large Multimodal Model in Chest X-Ray Interpretation
- [2025/05/05] We released our research paper on arXiv.
- Release Full training code
- Release AOR-Instruction data
- Implementation Guide
- Clone the
AOR
git clone https://github.com/Liqq1/AOR
cd AOR
- Create the env
conda create -n aor python=3.10 -y
conda activate aor
pip install --upgrade pip # enable PEP 660 support
pip install torch==2.3.1 torchvision==0.18.1 torchaudio==2.3.1 --index-url https://download.pytorch.org/whl/cu121 # install pytorch
pip install setuptools_scm
pip install --no-cache-dir -e .
- Install the
flash-attnpackage
pip install ninja
pip install flash-attn --no-build-isolation
- Install the
mmcv-1.4.7package
cd mmcv-1.4.7
MMCV_WITH_OPS=1 pip install -e .
AOR is trained on 4 NVIDIA A100 GPUs with the following code.
ONLY_SPI: Whether train spi module (region feature extractor) only.
CLIP: Use openai/CLIP instead of BioCLIP.
V15: Use LLaVA v1.5 instead of LLaVA v1.
bash train_stage1.sh
bash train_stage2.sh
bash train_stage3.sh