AOR: Anatomical Ontology-Guided Reasoning for Medical Large Multimodal Model in Chest X-Ray Interpretation

Updates

[2025/05/05] We released our research paper on arXiv.

TODO List

Release Full training code
Release AOR-Instruction data
Implementation Guide

Install

Clone the AOR

git clone https://github.com/Liqq1/AOR
cd AOR

Create the env

conda create -n aor python=3.10 -y
conda activate aor
pip install --upgrade pip  # enable PEP 660 support
pip install torch==2.3.1 torchvision==0.18.1 torchaudio==2.3.1 --index-url https://download.pytorch.org/whl/cu121  # install pytorch
pip install setuptools_scm
pip install --no-cache-dir -e .

Install the flash-attn package

pip install ninja
pip install flash-attn --no-build-isolation

Install the mmcv-1.4.7 package

cd mmcv-1.4.7
MMCV_WITH_OPS=1 pip install -e .

Training

AOR is trained on 4 NVIDIA A100 GPUs with the following code.

Explanation of Environment Variables

ONLY_SPI: Whether train spi module (region feature extractor) only.

CLIP: Use openai/CLIP instead of BioCLIP.

V15: Use LLaVA v1.5 instead of LLaVA v1.

STAGE 1

bash train_stage1.sh

STAGE 2

bash train_stage2.sh

STAGE 3

bash train_stage3.sh

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
aor		aor
llava		llava
mmcv-1.4.7		mmcv-1.4.7
mmdet		mmdet
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
mmcv		mmcv
pyproject.toml		pyproject.toml
train_stage1.sh		train_stage1.sh
train_stage2.sh		train_stage2.sh
train_stage3.sh		train_stage3.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AOR: Anatomical Ontology-Guided Reasoning for Medical Large Multimodal Model in Chest X-Ray Interpretation

Updates

TODO List

Install

Training

Explanation of Environment Variables

STAGE 1

STAGE 2

STAGE 3

Acknowledgement

About

Uh oh!

Releases

Packages

Languages

License

Liqq1/AOR

Folders and files

Latest commit

History

Repository files navigation

AOR: Anatomical Ontology-Guided Reasoning for Medical Large Multimodal Model in Chest X-Ray Interpretation

Updates

TODO List

Install

Training

Explanation of Environment Variables

STAGE 1

STAGE 2

STAGE 3

Acknowledgement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages