Skip to content

[Bug Report] Inconsistent definition of reward #1341

Open
@LinHungShi

Description

@LinHungShi

Describe the bug

In the step function, reward is defined as SupportsFloat. However, its type is required to be np.floating in
passive_env_checker

Code example

System info

No response

Additional context

No response

Checklist

  • I have checked that there is no similar issue in the repo

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions