Keep-rate scheduling of DropBlock in a multi-GPU environment

Hello,
I found an issue while trying to train your model.
In your code, the variable ['self.num_batches_tracked'](https://github.com/kjunelee/MetaOptNet/blob/7a8e2ae25ef47cfe75a6fe8bc7920dc9fd29191f/models/ResNet12_embedding.py#L32) should count the progress of the episode by [increasing](https://github.com/kjunelee/MetaOptNet/blob/7a8e2ae25ef47cfe75a6fe8bc7920dc9fd29191f/models/ResNet12_embedding.py#L38) when the model is called.
But in the multi-GPU environment, the modification of the variable in the forward() is ignored because a DataParallel replicates the model into each GPU and the updates are destroyed after forward(). So the variable just moves up and down with 0 and 1.
I think this should be fixed. Thanks :)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Keep-rate scheduling of DropBlock in a multi-GPU environment #41

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Keep-rate scheduling of DropBlock in a multi-GPU environment #41

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions