Policy Contrastive Decoding for Robotic Foundation Models

Official implementation of the paper "Policy Contrastive Decoding for Robotic Foundation Models".

Note: We are doing our best to improve this work. If you have any questions or suggestions, please feel free to create an issue in this repo or contact us at [email protected].

[Project] [ArXiv] [PDF] [PCD-real] [PCD-LeRobot]

News

🔥Jan, 26, 2026: 🎉🎉Our paper has been accepted by ICLR 2026!🎉🎉
🔥Oct 13, 2025: Our paper has been updated for better clarity and readability. The optimized version is now available on arXiv.
🔥May 20, 2025: The code is released and the paper is now available on arXiv.

Introduction

Abstract Generalist robot policies, or robotic foundation models, hold immense potential to enable flexible, general-purpose and dexterous robotic systems. Despite their advancements, our empirical experiments reveal that existing robot policies are prone to learning spurious correlations from pre-training trajectories, adversely affecting their generalization capabilities during inference. To tackle this, we propose a novel Policy Contrastive Decoding (PCD) approach, which redirects the robot policy’s focus toward object-relevant visual clues by contrasting action probability distributions derived from original and object-masked visual inputs. As a training-free method, our PCD can be used as a plugin to improve different types of robot policies without needing to finetune or access model weights. We conduct extensive experiments on top of three open-source robot policies, including the autoregressive policy OpenVLA and the diffusion-based policies Octo and $\pi_0$. The obtained results in both simulation and real-world environments prove PCD’s flexibility and effectiveness, e.g., PCD enhances the state-of-the-art policy $\pi_0$ by 8% in the simulation environment and by 108% in the real-world environment.

Experiments

Overall Performance

Simulated Environments

Real-world Environments

Performance on Different Factors

Videos

Real-world Environments

Note: The relevant code of the real-world experiments is available in PCD-real.

Baseline: Pick Ball	Baseline: Move Near	Baseline: Banana Plate	Baseline: Stack Cube

+Ours: Pick Ball	+Ours: Move Near	+Ours: Banana Plate	+Ours: Stack Cube

Baseline: Distractors	Baseline: Spatial Relation	Baseline: Brightness	Baseline: Texture

+Ours: Distractors	+Ours: Spatial Relation	+Ours: Brightness	+Ours: Texture

Simulated Environments

Baseline: Pick Coke Can	Baseline: Move Near	Baseline: Carrot Plate	Baseline: Eggplant Basket

+Ours: Pick Coke Can	+Ours: Move Near	+Ours: Carrot Plate	+Our: Eggplant Basket

Baseline: Spatial Relation	Baseline: Brightness	Baseline: Texture	Baseline: Texture

+Ours: Spatial Relation	+Ours: Brightness	+Ours: Texture	+Ours: Texture

Running

Clone this repository.

git clone https://github.com/Koorye/PCD.git

Install all dependencies.

conda create -n pcd python=3.10
conda activate pcd
bash scripts/install_dependencies.sh

Download model checkpoints.

Note: Some of the checkpoints cannot be downloaded directly, you may need to download them manually from the links provided in the script.

bash scripts/download_pretrained_weights.sh

Run evaluation on simpler.

bash scripts/default/inference/run.sh

Acknowledgements

Our work is built upon the following open-source projects: SimplerEnv, OpenVLA, Octo, Open Pi-0, Grounded SAM2, YOLO World, SED, Inpaint Anything. We thank the authors for releasing their code. If you use our model and code, please consider citing these works as well.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
assets		assets
contrast_policies		contrast_policies
contrast_utils		contrast_utils
examples		examples
scripts		scripts
simpler_env		simpler_env
third_party		third_party
tools		tools
.gitignore		.gitignore
README.md		README.md
parallel_inference.py		parallel_inference.py
parse_results.py		parse_results.py
properties.py		properties.py
requirements_full_install.txt		requirements_full_install.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Policy Contrastive Decoding for Robotic Foundation Models

News

Introduction

Experiments

Overall Performance

Performance on Different Factors

Videos

Real-world Environments

Simulated Environments

Running

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Policy Contrastive Decoding for Robotic Foundation Models

News

Introduction

Experiments

Overall Performance

Performance on Different Factors

Videos

Real-world Environments

Simulated Environments

Running

Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages