Closed
Description
Important Note: We do not do technical support, nor consulting and don't answer personal questions per email.
Please post your question on the RL Discord, Reddit or Stack Overflow in that case.
🚀 Feature
I would like to implement a model-based RL algorithm, MBPO proposed here.
Motivation
The proposed algorithm claims to be simpler and up to 10x as sample efficient as some other baselines like SAC.
This would be helpful in my own work too.
Checklist
- [ x] I have checked that there is no similar issue in the repo (required)