VLM-R1

1. Setup

Dependencies Installation

# Install transformers from source
pip install git+ssh://[email protected]/huggingface/transformers.git@89d27fa6fff206c0153e9670ae09e2766eb75cdf


# Run setup script
bash setup.sh

# Install Qwen related packages
pip install git+ssh://[email protected]/cjakfskvnad/Qwen-Agent.git
pip install qwen-vl-utils
pip install torch==2.6.0 torchvision==0.21.0

2. RL Training

Run the training script:

bash src/open-r1-multimodal/run_grpo_rec.sh

Training outputs will be saved to: src/open-r1-multimodal/output

3. Supervised Fine-Tuning (SFT)

Running the SFT Script

bash /data/muze/VLM-R1/src/open-r1-multimodal/sft.sh

Data Configuration

Before running the script, please ensure your data follows the required format as shown in processed_data.json.

4. Evaluation

Note: Before running evaluation, make sure to configure the path to your model.

Multiple Checkpoints Evaluation

bash src/open-r1-multimodal/src/open_r1/eval.sh

Single Checkpoint Evaluation

bash src/open-r1-multimodal/src/open_r1/eval_1.sh

Name		Name	Last commit message	Last commit date
Latest commit History 116 Commits
1		1
Qwen-Agent		Qwen-Agent
assets		assets
datapreprocess		datapreprocess
logs		logs
src		src
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
all_data.json		all_data.json
dataset_info.json		dataset_info.json
debug_sft.sh		debug_sft.sh
debug_vlm.sh		debug_vlm.sh
debug_vlm_frz_vision.sh		debug_vlm_frz_vision.sh
environment.yml		environment.yml
lll.jpg		lll.jpg
lll.py		lll.py
processed_data.json		processed_data.json
result.json		result.json
setup.sh		setup.sh
tttttt.py		tttttt.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

VLM-R1

1. Setup

Dependencies Installation

2. RL Training

3. Supervised Fine-Tuning (SFT)

Running the SFT Script

Data Configuration

4. Evaluation

Multiple Checkpoints Evaluation

Single Checkpoint Evaluation

About

Uh oh!

Releases

Packages

Languages

License

cjakfskvnad/VLM-R1

Folders and files

Latest commit

History

Repository files navigation

VLM-R1

1. Setup

Dependencies Installation

2. RL Training

3. Supervised Fine-Tuning (SFT)

Running the SFT Script

Data Configuration

4. Evaluation

Multiple Checkpoints Evaluation

Single Checkpoint Evaluation

About

Resources

License

Code of conduct

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages