[bnb] Small improvements on utils by younesbelkada · Pull Request #18646 · huggingface/transformers

younesbelkada · 2022-08-16T09:07:26Z

What does this PR do?

Fixes a small typo in bitsandbytes.py, should address huggingface/blog#463 (comment)
I will have to test it first and mark it as ready for review!

- replace `modules_to_not_convert` by `module_to_not_convert`

HuggingFaceDocBuilderDev · 2022-08-16T09:19:14Z

The documentation is not available anymore as the PR was closed or merged.

younesbelkada · 2022-08-16T13:40:25Z

Can confirm the tests pass!

stas00 · 2022-08-16T16:36:48Z

so will there always be just one module not to convert?

won't it be safer to have modules instead and work with the list?

- changed variables name - now output a list - change error message

younesbelkada · 2022-08-16T22:46:53Z

I have proposed a small refactoring that includes:

checking the list of modules to not convert instead of a single value.
changing an error message as it confused some user. Check: ValueError("8-bit operations on bitsandbytes are not supported under CPU!") bitsandbytes-foundation/bitsandbytes#10

The bnb slow tests are passing with this fix!

younesbelkada · 2022-08-17T07:24:03Z

From #18660 I also just added a commit to support having a custom list of the keys to ignore

src/transformers/modeling_utils.py

younesbelkada · 2022-08-17T20:06:29Z

Thanks a lot @stas00 !
There is no rush at all for this PR, we can definitely wait for @sgugger before moving forward with it

sgugger

Thanks for working on this, I left some comments.

src/transformers/modeling_utils.py

sgugger · 2022-08-31T12:50:26Z

src/transformers/modeling_utils.py

        offload_state_dict = kwargs.pop("offload_state_dict", False)
        load_in_8bit = kwargs.pop("load_in_8bit", False)
        int8_threshold = kwargs.pop("int8_threshold", 6.0)
+        no_load_in_8bit_modules = kwargs.pop("no_load_in_8bit_modules", None)


Would it make more sense to have this be a class variable of PreTrainedModel (like the no_split variable used for big model inference)? I'm afraid the user won't know what to set this too and it looks like it's something we should automatically handle?

I don't have a strong opinion on that but this argument is optional because the function get_keys_not_to_convert should automatically take care of that except for some models like Jukebox where it is a bit trickier due to its architecture.
In this case the user will just have to manually set which modules should be kept in their native precision and specify them in the kwargs, so I feel like it is a bit easier than having it as an argument of PretrainedModel because you would need to open a PR to add the feature.

Co-authored-by: stas00 <stas00@users.noreply.github.com>

younesbelkada · 2022-09-12T09:49:34Z

Can confirm the bnb slow tests are passing with the proposed fixes! Would love to have a final round of review 💪
cc @sgugger @stas00

sgugger

Still good for me. I'll let @stas00 have a second look since merging is blocked by his change request.

src/transformers/modeling_utils.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

stas00

thank you for addressing the suggestions, @younesbelkada

…name

younesbelkada · 2022-09-15T10:53:44Z

Can confirm the slow tests pass after rebasing on main, will merge once it's green! 🟢

Small replacement

ea155ed

- replace `modules_to_not_convert` by `module_to_not_convert`

younesbelkada changed the title ~~Small replacement~~ [bnb] Fixes a small typo on utils Aug 16, 2022

younesbelkada marked this pull request as ready for review August 16, 2022 13:40

younesbelkada requested a review from stas00 August 16, 2022 13:40

younesbelkada added 2 commits August 16, 2022 22:43

refactor a bit

a7731f7

- changed variables name - now output a list - change error message

make style

bf59f9f

add list

f5dc6ad

younesbelkada changed the title ~~[bnb] Fixes a small typo on utils~~ [bnb] Small improvements on utils Aug 17, 2022

younesbelkada mentioned this pull request Aug 17, 2022

_no_load_in_8bit module list have custom ignored layers #18660

Closed

make style

42c9df2

stas00 suggested changes Aug 17, 2022

View reviewed changes

src/transformers/modeling_utils.py Outdated Show resolved Hide resolved

src/transformers/modeling_utils.py Outdated Show resolved Hide resolved

src/transformers/modeling_utils.py Outdated Show resolved Hide resolved

sgugger approved these changes Aug 31, 2022

View reviewed changes

younesbelkada and others added 4 commits September 12, 2022 09:21

Merge branch 'main' into bnb-fix-change-arg-name

a84aaa7

change args name

27b0ef0

Co-authored-by: stas00 <stas00@users.noreply.github.com>

fix comment

224b504

fix typo

01a4c0c

Co-authored-by: stas00 <stas00@users.noreply.github.com>

sgugger approved these changes Sep 12, 2022

View reviewed changes

src/transformers/modeling_utils.py Outdated Show resolved Hide resolved

Update src/transformers/modeling_utils.py

23fe74a

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

stas00 approved these changes Sep 12, 2022

View reviewed changes

Merge remote-tracking branch 'upstream/main' into bnb-fix-change-arg-…

c266e23

…name

younesbelkada merged commit 7743cac into huggingface:main Sep 15, 2022

Conversation

younesbelkada commented Aug 16, 2022

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Aug 16, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

younesbelkada commented Aug 16, 2022

Uh oh!

stas00 commented Aug 16, 2022

Uh oh!

younesbelkada commented Aug 16, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

younesbelkada commented Aug 17, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

younesbelkada commented Aug 17, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sgugger Aug 31, 2022

Choose a reason for hiding this comment

Uh oh!

younesbelkada Sep 12, 2022

Choose a reason for hiding this comment

Uh oh!

younesbelkada commented Sep 12, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

stas00 left a comment

Choose a reason for hiding this comment

Uh oh!

younesbelkada commented Sep 15, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

HuggingFaceDocBuilderDev commented Aug 16, 2022 •

edited

Loading

younesbelkada commented Aug 16, 2022 •

edited

Loading

younesbelkada commented Aug 17, 2022 •

edited

Loading

younesbelkada commented Sep 12, 2022 •

edited

Loading