Skip to content

Add reward/loss over time graphs for actor and critic networks #11

Open
@generic-github-user

Description

@generic-github-user

The actor reward graph should display both the predicted loss generated by the critic network (equivalent to the actor optimization loss) and the actual loss once the training episode is complete.

Metadata

Metadata

Assignees

No one assigned

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions