CoDA: Coding LM via Diffusion Adaptation

End-to-end diffusion language modeling across TPU pre-training, GPU fine-tuning, evaluation, and serving.

Overview 🎯

CoDA is Salesforce AI Research's open diffusion language model. This repo contains a unified training pipeline from pre-training to post-training, evaluation harnesses, and a simple Fast-API based serving backend.

Note: This repository is provided for research purposes only. Data release is subjected to internal regulations.

Repository Map 🗺️

Directory	Purpose
`CoDALanguageModel/`	Huggingface model class
`post-train/`	Supervised fine-tuning (SFT) pipeline
`evaluation/`	Evaluation framework
`pre-train/`	TPU-based pre-training pipeline
`serving/`	Serving stack
`run_sft.sh`	Launcher coupling pre-training checkpoints with the post-training diffusion trainer.
`save_hf_model.py`	Util function to convert checkpoint in Huggingface model class

Training Quickstart 🚀

To avoid dependency conflicts, we recommend maintaining isolated environments per subsystem and activate the corresponding environment before executing scripts in each subdirectory.

Pre-training on TPU (`pre-train/`)

Populate TPU metadata in pre-train/env.example and copy to pre-train/.env.
Run pre-train/setup_tpu.sh to provision dependencies and sync the repository to the TPU pod.
Launch pre-training with the provided recipes (e.g., pre-train/recipes/midtrain_v4_512.sh) to produce CoDA checkpoints (GCS or local storage).

Supervised Diffusion Fine-tuning (`post-train/`)

Install prerequisites following post-train/LLaMA-Factory/README.md.
Configure dataset metadata in post-train/LLaMA-Factory/data/dataset_info.json and diffusion arguments in post-train/LLaMA-Factory/examples/train_full/*.yaml.
Execute ./run_sft.sh to fine-tune CoDA checkpoints with discrete denoising objectives.

Evaluation Pipelines (`evaluation/`)

Choose a benchmark script such as evaluation/lm_eval/eval_mbpp_humaneval.sh.
Update MODEL_DIR and diffusion parameters (diffusion_steps, temperature, top_p) to match the target checkpoint.
Run the script to gather metrics; logs are stored locally for aggregation and reporting.

Benchmark 📊

Comparison of code-generation performance across standard and plus-enhanced benchmarks. Evalplus is computed as the mean pass@1 on enhanced variants. Bold marks results where CoDA produces the strongest diffusion-model performance.

Model	Humaneval Instruct	Humaneval Plus	MBPP Instruct	MBPP Plus	Evalplus
CoDA-Base	29.3	23.8	35.2	46.0	34.9
CoDA-Instruct	54.3	47.6	47.2	63.2	55.4
Dream-Base	56.7	50.0	68.7	57.4	53.7
Dream-7B-Instruct	57.9	53.7	68.3	56.1	54.9
LLaDA-8B-Instruct	35.4	31.7	31.5	28.6	30.2
Qwen3-1.7B	66.5	61.6	46.2	65.9	63.8
Qwen2.5-Coder-1.5B	43.9	36.6	69.2	58.6	47.6
Qwen2.5-Coder-1.5B-Instruct	70.7	66.5	69.2	59.4	62.3
Gemma-3-1B-it	39.6	35.4	39.4	63.5	49.5
LLaMA-3.2-1B-Instruct	35.4	31.1	24.4	53.7	42.4

Deployment Guide 🛠️

1 Create a virtual environment (recommended)

python3 -m venv .venv
source .venv/bin/activate

2 Install dependencies

pip install -r serving/requirements.txt

3 Export your Hugging Face token

export HF_TOKEN="hf_..."

4 Serve the model on cuda

bash serving/fast-api/start_server.sh

The server will listen on http://localhost:8000.

5 Interact with the served model

python serving/fast-api/chat_cli.py --base-url http://localhost:8000 --model Salesforce/CoDA-v0-Instruct

Optional flags:

--stream to stream tokens as they are generated
--show-meta to display latency and token usage

Generation hyperparameters (env vars)

You can customize generation with these environment variables (defaults in parentheses):

MAX_TOKENS (256)
TEMPERATURE (0.0)
TOP_P (unset)
TOP_K (unset)
STEPS (128)
ALG ("entropy")
ALG_TEMP (0.1)
BLOCK_LENGTH (32)

Example:

export MAX_TOKENS=512
export TEMPERATURE=0.7
export TOP_P=0.9
export STEPS=128
export ALG=entropy
export ALG_TEMP=0.1
export BLOCK_LENGTH=32
bash serving/fast-api/start_server.sh

Citation 📚

@misc{coda2025,
  title={CoDA: Coding LM via Diffusion Adaptation},
  author={Chen, Haolin and Wang, Shiyu and Qin, Can and Pang, Bo and Liu, Zuxin and Qiu, Jielin and Zhang, Jianguo and Zhou, Yingbo and Chen, Zeyuan and Xu, Ran and Heinecke, Shelby and Savarese, Silvio and Xiong, Caiming and Wang, Huan and Yao, Weiran},
  year={2025},
  publisher={Salesforce AI Research}
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
CoDALanguageModel		CoDALanguageModel
evaluation/lm_eval		evaluation/lm_eval
post-train		post-train
pre-train		pre-train
serving		serving
.gitignore		.gitignore
CODEOWNERS		CODEOWNERS
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
CoDA-logo.png		CoDA-logo.png
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
how_to_license.md		how_to_license.md
run_sft.sh		run_sft.sh
save_hf_model.py		save_hf_model.py
technical_report.pdf		technical_report.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CoDA: Coding LM via Diffusion Adaptation

Table of Contents

Overview 🎯

Repository Map 🗺️

Training Quickstart 🚀

Pre-training on TPU (`pre-train/`)

Supervised Diffusion Fine-tuning (`post-train/`)

Evaluation Pipelines (`evaluation/`)

Benchmark 📊

Deployment Guide 🛠️

1 Create a virtual environment (recommended)

2 Install dependencies

3 Export your Hugging Face token

4 Serve the model on cuda

5 Interact with the served model

Generation hyperparameters (env vars)

Citation 📚

About

Uh oh!

Releases

Packages

Languages

License

SalesforceAIResearch/CoDA

Folders and files

Latest commit

History

Repository files navigation

CoDA: Coding LM via Diffusion Adaptation

Table of Contents

Overview 🎯

Repository Map 🗺️

Training Quickstart 🚀

Pre-training on TPU (pre-train/)

Supervised Diffusion Fine-tuning (post-train/)

Evaluation Pipelines (evaluation/)

Benchmark 📊

Deployment Guide 🛠️

1 Create a virtual environment (recommended)

2 Install dependencies

3 Export your Hugging Face token

4 Serve the model on cuda

5 Interact with the served model

Generation hyperparameters (env vars)

Citation 📚

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Pre-training on TPU (`pre-train/`)

Supervised Diffusion Fine-tuning (`post-train/`)

Evaluation Pipelines (`evaluation/`)

Packages