Question
rewards from openai/multiagent-particle-envs/simple_spread_v3 is -200~-140
rewards from pettingzoo/mpe/simple_spread_v3 is around -70
When I use the same MAPPO algorithm to test the MPE environments of OpenAI and PettingZoo, the reward in OpenAI converges to a good value, but the same MPE environment in PettingZoo remains around -70 and does not rise.
Question
rewards from openai/multiagent-particle-envs/simple_spread_v3 is -200~-140
rewards from pettingzoo/mpe/simple_spread_v3 is around -70
When I use the same MAPPO algorithm to test the MPE environments of OpenAI and PettingZoo, the reward in OpenAI converges to a good value, but the same MPE environment in PettingZoo remains around -70 and does not rise.