Skip to content
This repository was archived by the owner on Oct 31, 2023. It is now read-only.

Latest commit

 

History

History

complete_with_async_eval

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 

A2C Asynchronous Evaluation

Starting from the multi-CPUs implementation, we explain how we can implement a non blocking evaluation of the policy made on other CPUs

  • It makes use of the RemoteAgent._asynchronous_call function that is a non-blocking function (when num_processes>0).
  • It uses two copies of the policy agents - one for training and one for evaluation
  • Note that, in that case, we can evaluate the policy in deterministic mode