Question
|
agent_obsk: Number of nearest joints to observe, |
|
If set to 0 it only observes local state, |
|
If set to 1 it observes local state + 1 joint over, |
|
If set to 2 it observes local state + 2 joints over, |
|
If it set to None the task becomes single agent (the agent observes the entire environment, and performs all the actions) |
|
The Default value is: 1 |
agent_obsk in the original repo controls only the observation construction, but not the action factorization. I am wondering why implementing agent 0 performs all the actions when agent_obsk is None here.
Question
Gymnasium-Robotics/gymnasium_robotics/envs/multiagent_mujoco/mujoco_multi.py
Lines 93 to 98 in b1acee9
agent_obskin the original repo controls only the observation construction, but not the action factorization. I am wondering why implementing agent 0 performs all the actions whenagent_obskis None here.