Open
Description
Describe the bug
In the step function, reward is defined as SupportsFloat
. However, its type is required to be np.floating
in
passive_env_checker
Code example
System info
No response
Additional context
No response
Checklist
- I have checked that there is no similar issue in the repo