Skip to content

RL Toolkit v4.0.0

Compare
Choose a tag to compare
@markub3327 markub3327 released this 05 Feb 17:40
668f128

Release v4.0.0

Changelog

Features 🔊

  • Render environments to WanDB
  • Grouping of runs in WanDB
  • SampleToInsertRatio rate limiter
  • Global Gradient Clipping to avoid exploding gradients
  • Softplus for numerical stability
  • YAML configuration file
  • LogCosh instead of Huber loss
  • Critic network with Add layer applied on state & action branches
  • Custom uniform initializer
  • XLA (Accelerated Linear Algebra) compiler
  • Optimized Replay Buffer (google-deepmind/reverb#90)
  • split into Agent, Learner, Tester and Server

Bug fixes 🛠️

  • Fixed creating of saving path for models
  • Fixed model's summary()