-
Notifications
You must be signed in to change notification settings - Fork 30
Open
Description
In the paper,《 Continual Deep Reinforcement Learning for Decentralized Satellite Routing》,the formula 11 means that the transmitting node i update the Q-network. However, in the file, SimulationRL, the line 4209 indicates the receiving node j update the Q-network. I don not know whether i understand correctlt or not.And which method shoud i choose?
Thank you!
Metadata
Metadata
Assignees
Labels
No labels