Skip to content
This repository was archived by the owner on Oct 31, 2023. It is now read-only.

Latest commit

 

History

History
10 lines (8 loc) · 953 Bytes

README.md

File metadata and controls

10 lines (8 loc) · 953 Bytes

A2C Examples (and Tutorial)

The examples of A2C implementations illustrate the different capabilities of the library and act as a tutorial. To understand all the different aspects, you have to read the examples in the following order:

  • Mono CPU - The simplest implementation
  • Mono CPU 2 - Splitting the agent in three to illustrate the modularity of salina, seperating the critic from the policy.
  • Multi CPUs - Making use of the RemoteAgent to execute over multiple CPUs in parallel
  • Asynchronous Evaluation - Making use of the asynchronous functionnality to evaluate policies in parallel without slowing down the learning
  • GPU - Making use of a GPU for the loss computation
  • Complete - The reference implementation of A2C used for benchmarking. It contains different policies architectures to illustrate how one can define complex policies.