Bias tensors #1259

gabe-l-hart · 2024-10-03T15:35:16Z

Dependencies

This PR is part of a sequence in support of adding Granite Code. It depends on merging the following PRs:

Safetensors: Safetensors #1255

Issues

Description

This PR adds support for models which have bias tensors for the attention and ffn modules alongside the primary weight tensors.

Changes

Add the bias tensors to the weight_map in HF checkpoint conversion
Handle merged wqkv tensors for bias as well as weights in HF checkpoint conversion
- This includes changes to the permutation logic to support the shapes of the bias tensors. I leveraged the corresponding logic in llama.cpp's converter.
Add configs to TransformerArgs to allow models to indicate the presence of attention_bias and feed_forward_bias tensors
Populate the Attention and FeedForward modules' tensors' bias arguments based on the config args

Testing

In conjunction with my other changes for Granite Code, I've been able to validate that the results produced with this logic do produce the expected token sequence.

NOTE: If there's any preferred way to include unit tests along with the PR, please let me know and I can get them added! I don't see a familiar unit test structure in the project at this point, so I've been relying on local ad-hoc testing.

pytorch-bot · 2024-10-03T15:35:20Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1259

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit bbea338 with merge base 766bee9 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Branch: GraniteCodeSupport Signed-off-by: Gabe Goodhart <[email protected]>

gabe-l-hart · 2024-10-04T20:02:46Z

Thanks for the review/merge on #1255! This PR is now ready for review

mikekg · 2024-10-04T23:04:25Z

Current tests are run through .github/workflows, with some scripts in .ci (including a script that can be used to ensure that code in
markdown files works).

Or were you looking for "unit tests" of subcomponents with a Python driver? If you are looking for python-level unit tests, I don't think we have any right now, but that doesn't mean we can't have any. If you want to make a proposal, you might discuss with @byjlw and @Jack-Khuu, and @lessw2020 for distributed inference.

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 3, 2024

This was referenced Oct 3, 2024

Add support for separate bias tensors #1250

Open

Tied word embeddings #1260

Draft

Tokenizers tokenizer #1261

Draft

gabe-l-hart force-pushed the BiasTensors-1250 branch 2 times, most recently from 4c08ee4 to 964ae69 Compare October 4, 2024 15:29

gabe-l-hart added 3 commits October 4, 2024 14:00

feat: Add support for attention and ff biases

88705cb

Branch: GraniteCodeSupport Signed-off-by: Gabe Goodhart <[email protected]>

fix(convert): Add support for permuted kvq bias weights in HF conversion

12b7d16

Branch: GraniteCodeSupport Signed-off-by: Gabe Goodhart <[email protected]>

fix(model): Add support for bias wqkv tensor in Attention

bbea338

Branch: GraniteCodeSupport Signed-off-by: Gabe Goodhart <[email protected]>

gabe-l-hart force-pushed the BiasTensors-1250 branch from 964ae69 to bbea338 Compare October 4, 2024 20:01

gabe-l-hart marked this pull request as ready for review October 4, 2024 20:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bias tensors #1259

Bias tensors #1259

gabe-l-hart commented Oct 3, 2024 •

edited

Loading

pytorch-bot bot commented Oct 3, 2024 •

edited

Loading

gabe-l-hart commented Oct 4, 2024

mikekg commented Oct 4, 2024 •

edited

Loading

Bias tensors #1259

Are you sure you want to change the base?

Bias tensors #1259

Conversation

gabe-l-hart commented Oct 3, 2024 • edited Loading

Dependencies

Issues

Description

Changes

Testing

pytorch-bot bot commented Oct 3, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1259

✅ No Failures

gabe-l-hart commented Oct 4, 2024

mikekg commented Oct 4, 2024 • edited Loading

gabe-l-hart commented Oct 3, 2024 •

edited

Loading

pytorch-bot bot commented Oct 3, 2024 •

edited

Loading

mikekg commented Oct 4, 2024 •

edited

Loading