Improving PILCO with Bayesian Neural Network Dynamics Models

## Work to Replicate

Gal, Y., McAllister, R. and Rasmussen, C.E., 2016, April. Improving PILCO with Bayesian neural network dynamics models. In Data-Efficient Machine Learning workshop, ICML.

## Motivation

This paper extends a very sample efficient model-based policy search method, PILCO, with Bayesian Neural Network Dynamics model rather than Gaussian Processes. 

I have an initial trial in [this repo](https://github.com/zuoxingdong/DeepPILCO), it fails to learn a good controller, even though I have tried a few months for testing good hyperparameters. 

## Challenges

If anyone interested in reproducing this algorithm can firstly have a look of my [initial implementation](https://github.com/zuoxingdong/DeepPILCO) in PyTorch. 

I failed to make it work, perhaps with following potential problems:

1. Sensitive to specific good hyperparameters ?
2. BNN in this paper uses Monte Carlo dropout, maybe other BNN can work ?
3. The dynamics model must be trained sufficiently good for each iteration ?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improving PILCO with Bayesian Neural Network Dynamics Models #3

Work to Replicate

Motivation

Challenges

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Improving PILCO with Bayesian Neural Network Dynamics Models #3

Description

Work to Replicate

Motivation

Challenges

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions