@sbrunk Thank you for this excellent library! I have been trying to re-implement this cart-pole deep q learning using storch and cats-effect. In that article, there is the following python code which initializes the two networks:
self.policy_net = self.build_network(layer_sizes)
self.target_net = self.build_network(layer_sizes)
self.target_net.load_state_dict(self.policy_net.state_dict())
Every so often, as the policy_net gets trained, the target_net needs to be updated.
However, I cannot seem to find a way to access stateDict in storch. Is there a more scala/storch-recommended way to take one network (the target_net in the example above) and load it up so that it initially is equivalent to a different network (the policy_net)?
I noticed that in storch, the loadStateDict method is available, but just cannot figure out what to feed into it. Any help is much appreciated. Thanks!
@sbrunk Thank you for this excellent library! I have been trying to re-implement this cart-pole deep q learning using
storchandcats-effect. In that article, there is the following python code which initializes the two networks:Every so often, as the
policy_netgets trained, thetarget_netneeds to be updated.However, I cannot seem to find a way to access
stateDictinstorch. Is there a more scala/storch-recommended way to take one network (thetarget_netin the example above) and load it up so that it initially is equivalent to a different network (thepolicy_net)?I noticed that in storch, the
loadStateDictmethod is available, but just cannot figure out what to feed into it. Any help is much appreciated. Thanks!