DDQN update

In the  paper,《 Continual Deep Reinforcement Learning for Decentralized Satellite Routing》，the formula 11 means that the transmitting node i update the Q-network. However, in the file, SimulationRL, the line 4209 indicates the receiving node j  update  the Q-network. I don not know whether  i understand correctlt or not.And which method shoud i choose?

Thank you!