Hybrid CPU-GPU implementation of Crossmodal Attentive Skill Learner (CASL)
Codebase design is based on GA3C.
S. Omidshafiei, D. K. Kim, J. Pazis, and J. P. How, "Crossmodal Attentive Skill Learner", In NIPS Deep Reinforcement Learning Symposium, 2017.
Link: https://arxiv.org/abs/1711.10314
TensorFlow is required (tested with version 1.4.0).
For other dependencies, please refer to src/dependencies_install.sh.
Two frameworks are supported:
- Option-based crossmodal attention learning (master branch)
- Action-based crossmodal attention learning (CASL-action branch)
Three environments are supported:
- Sequential Door Puzzle
- 2D Minecraft-like
- Arcade Learning Environment-Audio (ALE-Audio)
We have added audio query support to ALE, and a pull request to the official ALE repo will be sent shortly so the community can benefit from this.
Click image to see videos of ALE-Audio in action (left shows image, right shows audio spectrogram):
Please refer to instruction in src/tensorflow/CASL/README.md.
Shayegan Omidshafiei (https://github.com/shayegano)
Dong-Ki Kim (https://github.com/dkkim93)

