Releases: prabhatnagarajan/table-rl
Releases · prabhatnagarajan/table-rl
v0.2.0
Release 0.0.2
This release introduces several new features:
- Double Q-learning
- Explorers: linear decay epsilon-greedy exploration and adds percentage decay epsilon-greedy exploration.
- Renames
envtoenvs - Adds Riverswim environment
v0.1.0
Release 0.1.0
Algorithms
- #25: SARSA
Features/Enhancements
Explorers
- #24: Expands the
observemethod in explorers - #32: Adds
training_modeto explorers'observe - #31: Adds PolicyExecutor as an explorer that executes a specific policy
- #27: Provides the observation to explorers
Environments
- #26: Adds overestimation environment from Double Q-learning paper
Learners
- #28: Adds training mode to learners