Learning from Sparse Offline Datasets via Conservative Density Estimation

This project provides the open source implementation of the CDE in the paper: "Learning from Sparse Offline Datasets via Conservative Density Estimation"

Installation

We recommend to use Anaconda or Miniconda to manage python environment.

Install mujoco and mujoco-py, your can either refer to https://github.com/openai/mujoco-py or run

wget https://mujoco.org/download/mujoco210-linux-x86_64.tar.gz
tar -xvf mujoco210-linux-x86_64.tar.gz
mkdir .mujoco
mv mujoco210 ~/.mujoco/mujoco210

It is also necessary to add below to ~/.bashrc:

export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:~/.mujoco/mujoco210/bin
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/usr/lib/nvidia

We have included mujoco-py in requirements.txt but you may need to install libglew-dev, patchelf when compiling the mujoco-py after the installation:

sudo apt-get install libglew-dev
sudo apt-get install patchelf

Create conda env:

cd cde-offline-rl
conda env create -f environment.yaml
conda activate cde

Install PyTorch according to your platform and cuda version.
Install D4rl from https://github.com/Farama-Foundation/D4RL.

Training

To run a single experiment, take maze2d-medium-v1 for example, run

python run_cde.py --env_name "maze2d-medium-v1" --hyperparams 'hyper_params/cde/maze2d.yaml' --cudaid 0 --seed 100

where --hyperparams specifies the hyperparameter files in directory ./hyper_params/, --cudaid specifies which gpu will be used for training (the defaulted -1 means using cpu).

If you want to run multiple experiments, we have also included some other training commands in bash file run_exp.sh. You can consider using & to run the commands in parallel.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
hyper_params/cde		hyper_params/cde
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
Readme.md		Readme.md
cde.py		cde.py
environment.yaml		environment.yaml
run_cde.py		run_cde.py
run_exp.sh		run_exp.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Learning from Sparse Offline Datasets via Conservative Density Estimation

Installation

Training

About

Uh oh!

Releases

Packages

Languages

License

czp16/cde-offline-rl

Folders and files

Latest commit

History

Repository files navigation

Learning from Sparse Offline Datasets via Conservative Density Estimation

Installation

Training

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages