UCLK-C

This repository implements and compares the performance of the following algorithms: 'UCLK-C' (our proposed algorithm), 'UCRL2-VTR (Bernstein-type)', 'TSDE', 'UCRL2', and 'RANDOM'.

Algorithms

UCLK-C: introduced in "Learning Infinite-Horizon Average-Reward Linear Mixture MDPs of Bounded Span"
UCRL2-VTR: introduced in "Nearly Minimax Optimal Regret for Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation"
TSDE: introduced in "Learning Unknown Markov Decision Processes: A Thompson Sampling Approach"
UCRL2: introduced in "Near-optimal Regret Bounds for Reinforcement Learning"
RANDOM: take arbitrary action

Project Structure

algorithms/: Algorithm implementations
env/: Hard-To-Learn MDP environments
test/: Create experiment outputs
data/: Logs and experiment outputs
image/: Regret plots
plot.ipynb: Regret visualization notebook

Dependencies

Install required packages:

pip install -r requirements.txt

Results

The experiment supports the efficacy of our algorithm 'UCLK-C' in terms of learning hard-to-learn MDP.

License

This project is licensed under the MIT License. See LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
algorithms		algorithms
data/hardtolearn		data/hardtolearn
env		env
image		image
test		test
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
plot.ipynb		plot.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

UCLK-C

Algorithms

Project Structure

Dependencies

Results

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

UCLK-C

Algorithms

Project Structure

Dependencies

Results

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages