Implement Data Augmentation for RL as in DrAC

[Automatic Data Augmentation for Generalization in Reinforcement Learning](https://proceedings.neurips.cc/paper/2021/hash/2b38c2df6a49b97f706ec9148ce48d86-Abstract.html) describes a method Data-Regularized Actor-Critic (DrAC) that should be implemented in our framework. Their code is available [here](https://github.com/rraileanu/auto-drac)

It is a combination of input augmentations and a KL Loss between unaugmented and augmented outputs.
The key challenge is the handling of RNN states, as the manipulated state should not be forwarded (maybe enable testing this?).
Implementation should be possible almost exclusively in the definition of [context and loss for rl](https://github.com/berenslab/retinal-rl/blob/master/retinal_rl/rl/loss.py). For RNN states, a slight adjustment might be needed in [LatentCore](https://github.com/berenslab/retinal-rl/blob/master/retinal_rl/models/circuits/latent_core.py) to allow for passing not only the most recent, but a history of states.
Important: One does not have access to the full history of RNN states for a multi layer RNN, in that case we need to use / reimplement this by stacking RNNs!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement Data Augmentation for RL as in DrAC #87

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Implement Data Augmentation for RL as in DrAC #87

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions