Skip to content

Koorye/PCD

Repository files navigation

Policy Contrastive Decoding for Robotic Foundation Models

Official implementation of the paper "Policy Contrastive Decoding for Robotic Foundation Models".

Note: We are doing our best to improve this work. If you have any questions or suggestions, please feel free to create an issue in this repo or contact us at [email protected].

[Project] [ArXiv] [PDF] [PCD-real] [PCD-LeRobot]

News

  • 🔥Jan, 26, 2026: 🎉🎉Our paper has been accepted by ICLR 2026!🎉🎉
  • 🔥Oct 13, 2025: Our paper has been updated for better clarity and readability. The optimized version is now available on arXiv.
  • 🔥May 20, 2025: The code is released and the paper is now available on arXiv.

Introduction

Abstract Generalist robot policies, or robotic foundation models, hold immense potential to enable flexible, general-purpose and dexterous robotic systems. Despite their advancements, our empirical experiments reveal that existing robot policies are prone to learning spurious correlations from pre-training trajectories, adversely affecting their generalization capabilities during inference. To tackle this, we propose a novel Policy Contrastive Decoding (PCD) approach, which redirects the robot policy’s focus toward object-relevant visual clues by contrasting action probability distributions derived from original and object-masked visual inputs. As a training-free method, our PCD can be used as a plugin to improve different types of robot policies without needing to finetune or access model weights. We conduct extensive experiments on top of three open-source robot policies, including the autoregressive policy OpenVLA and the diffusion-based policies Octo and $\pi_0$. The obtained results in both simulation and real-world environments prove PCD’s flexibility and effectiveness, e.g., PCD enhances the state-of-the-art policy $\pi_0$ by 8% in the simulation environment and by 108% in the real-world environment.

Policy Contrastive Decoding

Experiments

Overall Performance

Simulated Environments

Simpler Results

Real-world Environments

Real-world Results

Performance on Different Factors

Factors

Videos

Real-world Environments

Note: The relevant code of the real-world experiments is available in PCD-real.

Baseline: Pick Ball Baseline: Move Near Baseline: Banana Plate Baseline: Stack Cube
Pick Ball Move Near Banana Plate Stack Cube
+Ours: Pick Ball +Ours: Move Near +Ours: Banana Plate +Ours: Stack Cube
Pick Ball Move Near Banana Plate Stack Cube
Baseline: Distractors Baseline: Spatial Relation Baseline: Brightness Baseline: Texture
Distractors Spatial Relation Brightness Texture
+Ours: Distractors +Ours: Spatial Relation +Ours: Brightness +Ours: Texture
Distractors Spatial Relation Brightness Texture

Simulated Environments

Baseline: Pick Coke Can Baseline: Move Near Baseline: Carrot Plate Baseline: Eggplant Basket
Pick Coke Can Move Near Carrot Plate Stack Cube
+Ours: Pick Coke Can +Ours: Move Near +Ours: Carrot Plate +Our: Eggplant Basket
Pick Coke Can Move Near Carrot Plate Stack Cube
Baseline: Spatial Relation Baseline: Brightness Baseline: Texture Baseline: Texture
Spatial Relation Brightness Texture Distractors
+Ours: Spatial Relation +Ours: Brightness +Ours: Texture +Ours: Texture
Spatial Relation Brightness Texture Distractors

Running

  1. Clone this repository.
git clone https://github.com/Koorye/PCD.git
  1. Install all dependencies.
conda create -n pcd python=3.10
conda activate pcd
bash scripts/install_dependencies.sh
  1. Download model checkpoints.

Note: Some of the checkpoints cannot be downloaded directly, you may need to download them manually from the links provided in the script.

bash scripts/download_pretrained_weights.sh
  1. Run evaluation on simpler.
bash scripts/default/inference/run.sh

Acknowledgements

Our work is built upon the following open-source projects: SimplerEnv, OpenVLA, Octo, Open Pi-0, Grounded SAM2, YOLO World, SED, Inpaint Anything. We thank the authors for releasing their code. If you use our model and code, please consider citing these works as well.

About

[ICLR 2026] Official implemetation of the paper "Policy Contrastive Decoding for Robotic Foundation Models"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors