[Feature Request] Implement MBPO algorithm

**Important Note: We do not do technical support, nor consulting** and don't answer personal questions per email.
Please post your question on the [RL Discord](https://discord.com/invite/xhfNqQv), [Reddit](https://www.reddit.com/r/reinforcementlearning/) or [Stack Overflow](https://stackoverflow.com/) in that case.


### 🚀 Feature

I would like to implement a model-based RL algorithm, MBPO proposed [here](https://arxiv.org/abs/1906.08253). 

### Motivation

The proposed algorithm claims to be simpler and up to 10x as sample efficient as some other baselines like SAC. 
This would be helpful in my own work too. 

### Checklist

- [ x] I have checked that there is no similar [issue](https://github.com/DLR-RM/stable-baselines3/issues) in the repo (**required**)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Implement MBPO algorithm #43

🚀 Feature

Motivation

Checklist

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature Request] Implement MBPO algorithm #43

Description

🚀 Feature

Motivation

Checklist

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions