[fix] some param will be set to optimizer twice when using MLM transformer heads#965

Open

butterluo wants to merge 9 commits intofacebookresearch:mainfrom

butterluo:fix_dup_param

butterluo commented Jun 10, 2021

summary:
When using MMFTransformer with MLM head, an warning will occur which will be an error in future: "UserWarning: optimizer contains a parameter group with duplicate parameters; in future, this will cause an error; see github.com/pytorch/pytorch/issues/40967 for more information"
This is cuase by get_optimizer_parameters() function will get some parameters twice if these parameters which belong to backbone were tied to head in head's tie_weights() method

Tested locally

Thanks for your contribution!

If you're sending a large PR (e.g., >50 lines), please open an issue first about
the feature/bug, and indicate how you want to contribute.

Use contributing guidelines before opening up the PR to follow MMF style guidelines.


          When using MMFTransformer with multi heads, an warning will occur whi…

3b9dc68

…ch will be an error in future:

    UserWarning: optimizer contains a parameter group with duplicate parameters; in future, this will cause an error; see github.com/pytorch/pytorch/issues/40967 for more information

This is cuase by get_optimizer_parameters() function will get some parameters twice if these parameters which belong to backbone were tied to some heads in head's tie_weights() method (eg. the head called MLM)

facebook-github-bot added the CLA Signed label

apsdehal suggested changes

View reviewed changes

Contributor

apsdehal left a comment

Thanks for this change and helping make MMF better. I have left some comments and then this PR should be ready to land.

mmf/models/transformers/base.py Outdated Show resolved Hide resolved

mmf/models/transformers/base.py Outdated

                   def get_optimizer_parameters(self, config):
                       lr = config.optimizer.params.lr
+                      backbone_param_set = set()

Contributor

apsdehal Jun 12, 2021

Let's name this trunk_params_set.

Author

butterluo Jun 13, 2021

Thanks for your reviewing. I will change my code according to ur suggestion.

mmf/models/transformers/base.py Outdated Show resolved Hide resolved

mmf/models/transformers/base.py Outdated

+                      self, config, module_name, base_lr, module, parameters, param_list, backbone_param_set = None
                   ):
                       lr_multiplier = config.get("lr_multiplier", 1.0)
+                      if backbone_param_set is None:

Contributor

apsdehal Jun 12, 2021

If this is None, make it an empty list, [].

Author

butterluo Jun 13, 2021

Thanks for your reviewing. I will change my code according to ur suggestion.

mmf/models/transformers/base.py Outdated

                       lr_multiplier = config.get("lr_multiplier", 1.0)
+                      if backbone_param_set is None:
+                          module_param = list(module.named_parameters())
+                      else:

Contributor

apsdehal Jun 12, 2021

Now, you can remove this else condition.

Author

butterluo Jun 13, 2021

Thanks for your reviewing. I will change my code according to ur suggestion.

Contributor

facebook-github-bot commented Jun 12, 2021

@apsdehal has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.


          Update mmf/models/transformers/base.py

a39ac78

Co-authored-by: Amanpreet Singh <apsdehal@gmail.com>

Contributor

facebook-github-bot commented Jun 13, 2021

@butterluo has updated the pull request. You must reimport the pull request before landing.

butterluo added 2 commits

June 13, 2021 10:36


          Apply apsdehal's suggestion. Chang var name, and refactor if-else codes

423db55


          Merge branch 'fix_dup_param' of https://github.com/butterluo/mmf into…

15e15a7

… fix_dup_param

Contributor

facebook-github-bot commented Jun 13, 2021

@butterluo has updated the pull request. You must reimport the pull request before landing.

Author

butterluo commented Jun 21, 2021

Hi @apsdehal can this bug fix code pass your review? And the auto-merging of ci seems was blocked by the issue https://github.com/facebookresearch/mmf/issues/966 ? If the codes have pass your review how can i merge it into master?

apsdehal approved these changes

View reviewed changes

Contributor

apsdehal left a comment

It looks good to me. Thanks for making the updates. Can you fix lint issues? I will then work on landing it.


          Merge branch 'master' into fix_dup_param

ae967b2

Contributor

facebook-github-bot commented Jun 21, 2021

@butterluo has updated the pull request. You must reimport the pull request before landing.

Contributor

facebook-github-bot commented Jun 21, 2021

@apsdehal has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

butterluo added 2 commits

June 22, 2021 16:02


          formated the code

de0111b


          Merge branch 'fix_dup_param' of https://github.com/butterluo/mmf into…

2e575a6

… fix_dup_param

Contributor

facebook-github-bot commented Jun 22, 2021

@butterluo has updated the pull request. You must reimport the pull request before landing.

Author

butterluo commented Jun 23, 2021 •

edited

Loading

It looks good to me. Thanks for making the updates. Can you fix lint issues? I will then work on landing it.

@apsdehal I've reformated the the bug fix codes using pycharm. Can it fix the lint issues?


          formated by black

496e54b

Contributor

facebook-github-bot commented Jun 27, 2021

@butterluo has updated the pull request. You must reimport the pull request before landing.


          Merge branch 'master' into fix_dup_param

57a2aed

Contributor

facebook-github-bot commented Jun 29, 2021

@butterluo has updated the pull request. You must reimport the pull request before landing.

Author

butterluo commented Jun 29, 2021

It looks good to me. Thanks for making the updates. Can you fix lint issues? I will then work on landing it.
@apsdehal I formated it using 'black'，could u approve the ci workflow？

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels