Description
Description of the bug
While using AntMaze_UMaze-v5
alongside a pseudocount exploration algorithm, I noticed that the ant can go through the walls in the maze. Initially, this is a rare occurrence, but since I am training an novelty-based exploration algorithm, the agent is able to recreate the issue with greater reliability over time.
Code example
Here is how I am creating the env:
env = gym.make(
'AntMaze_UMaze-v5',
render_mode='rgb_array',
max_episode_steps=1000,
continuing_task=False
)
Not that this should be important, but for context, I am using the TD3 algorithm and CFN for novelty-based intrinsic rewards.
Versioning
gymnasium_robotics
: 1.3.1
gymnasium
: 1.0.0
python
: 3.9.20
Supporting Evidence
In the attached image, I have plotted the (x, y) coordinates of the ant (according to the states saved in the replay buffer). The color of each point denotes the novelty prediction, but that can be ignored for our purposes. The purple lines show where the walls should be (approximately), and the red circle highlights a trajectory that goes through the wall near the start state and exits near the goal state (which is at (-4, 4)
).
Checklist
- I have checked that there is no similar issue in the repo (required)