Skip to content

Latest commit

 

History

History
11 lines (9 loc) · 656 Bytes

README.md

File metadata and controls

11 lines (9 loc) · 656 Bytes

Offline RL Practice

A set of offline RL algorithm with tensorflow-keras.

  • Use D4RL dataset
  • For the purpose of result reproducibility.
  • Heavily borrow most test concepts from TD3+BC.

TD3+BC

A Minimalist Approach to Offline RL, Fujumoto et al [paper], [repo]

Implicit Q-Learning (IQL)

Offline Reinforcement Learning with Implicit Q-Learning, Kostrikov et al [paper][repo(jax)][repo(pytorch)]