Replay Overshooting: Learning Stochastic Latent Dynamics with the Extended Kalman Filter

This package contains code to use the methods described in our submission to the International Conference on Robotics and Automation (ICRA) 2021:

Albert H. Li*, Philipp Wu*, Monroe Kennedy III Replay Overshooting: Learning Stochastic Latent Dynamics with the Extended Kalman Filter Proceedings of the International Conference on Robotics and Autonomation (ICRA) 2021.

Minimal Setup

Note that only python>=3.8 is supported.

git clone https://github.com/wuphilipp/replay-overshooting.git
cd replay-overshooting
pip install -e .

We recommend using conda for managing python environments. Optionally, install the requirements.txt for development utilities.

pip install -r requirements.txt

Run the example

python scripts/train_example.py

Repository Overview

File Structure

scripts - Scripts used for training and evaluation.
- paper_experiments
  - pend_img - Configuration files for training models on video frames of a simulated pendulum.
  - mit_push - Configuration files for training models on the MIT Push Dataset.
dynamics_learning - Contains the core code.
- custom - Common learning rate schedulers and policies.
- data - Manages data and converts datasets into a common format.
- networks - Contains all code for creating neural dynamics models.
  - baseline - Includes implementations of baseline models from PlaNet.
  - image_models - Implements common vision models and extends the EKF to image observations.
  - kalman - Contains all core functional for the EKF.
- traininig - Manages training and evaluation of models.
- utils - Common utilities shared across the code base.

More detailed documentation is provided in the code.

Training and Evaluation

All runs are managed through an ExpConfig (found in dynamics_learning/training/configs.py) which contains all the information necessary to reproduce a model (including hyperparameter settings). Each experiment in the scripts directory contains the construction of the ExpConfig. See scripts/train_example.py for an example of this. Model performance evaluation can done by running the corresponding eval_* file in the same folder.

Tracking model metrics and visualizations can be viewed through Tensorboard. Tensorboard logs will automatically be created during training in log folder.

Additional Code

pytorch is auto differentiation framework used throughout the codebase.
fannypack is used for experiment management.

Contribute

Contributions and bug fixes are welcome! The code follows the numpydoc style guide for docstrings. In addition, the following tools are used to manage the python code:

black for code formatting
isort for managing imports
flake8 for style checking
mypy for static type checking

pre-commit hooks

Some pre-commits are provided in the repo to be optionally used to assist development. This will automatically do some checking/formatting. To use the pre-commit hooks, run the following:

pip install pre-commit
pre-commit install

Known Bugs

This code base makes heavy use of torch.cholesky. However occasionally there will be a CUDA illegal memory access error. This is a known pytorch issue, but not something we can directly resolve. If this occurs, try changing the random seed.

Bibtex

Please cite our paper if relevant!

@inproceedings{
    liwu_2021ReplayOvershooting,
    title={{Replay Overshooting}: Learning Stochastic Latent Dynamics with the
    Extended Kalman Filter},
    author={A. {Li} and P. {Wu} and M. {Kennedy}},
    booktitle={2021 International Conference on Robotics and Automation (ICRA)},
    year={2021},
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.github/workflows		.github/workflows
dynamics_learning		dynamics_learning
scripts		scripts
.flake8		.flake8
.gitignore		.gitignore
.isort.cfg		.isort.cfg
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
mypy.ini		mypy.ini
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Replay Overshooting: Learning Stochastic Latent Dynamics with the Extended Kalman Filter

Minimal Setup

Repository Overview

File Structure

Training and Evaluation

Additional Code

Contribute

pre-commit hooks

Known Bugs

Bibtex

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

wuphilipp/replay-overshooting

Folders and files

Latest commit

History

Repository files navigation

Replay Overshooting: Learning Stochastic Latent Dynamics with the Extended Kalman Filter

Minimal Setup

Repository Overview

File Structure

Training and Evaluation

Additional Code

Contribute

pre-commit hooks

Known Bugs

Bibtex

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages