Skip to content

Improving PILCO with Bayesian Neural Network Dynamics Models #3

@zuoxingdong

Description

@zuoxingdong

Work to Replicate

Gal, Y., McAllister, R. and Rasmussen, C.E., 2016, April. Improving PILCO with Bayesian neural network dynamics models. In Data-Efficient Machine Learning workshop, ICML.

Motivation

This paper extends a very sample efficient model-based policy search method, PILCO, with Bayesian Neural Network Dynamics model rather than Gaussian Processes.

I have an initial trial in this repo, it fails to learn a good controller, even though I have tried a few months for testing good hyperparameters.

Challenges

If anyone interested in reproducing this algorithm can firstly have a look of my initial implementation in PyTorch.

I failed to make it work, perhaps with following potential problems:

  1. Sensitive to specific good hyperparameters ?
  2. BNN in this paper uses Monte Carlo dropout, maybe other BNN can work ?
  3. The dynamics model must be trained sufficiently good for each iteration ?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions