-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Description
Work to Replicate
Gal, Y., McAllister, R. and Rasmussen, C.E., 2016, April. Improving PILCO with Bayesian neural network dynamics models. In Data-Efficient Machine Learning workshop, ICML.
Motivation
This paper extends a very sample efficient model-based policy search method, PILCO, with Bayesian Neural Network Dynamics model rather than Gaussian Processes.
I have an initial trial in this repo, it fails to learn a good controller, even though I have tried a few months for testing good hyperparameters.
Challenges
If anyone interested in reproducing this algorithm can firstly have a look of my initial implementation in PyTorch.
I failed to make it work, perhaps with following potential problems:
- Sensitive to specific good hyperparameters ?
- BNN in this paper uses Monte Carlo dropout, maybe other BNN can work ?
- The dynamics model must be trained sufficiently good for each iteration ?
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels