[Docs] Add Developer Guide: How to Hack Any Transformers Model #33979

MagnusS0 · 2024-10-05T14:35:29Z

What does this PR do?

This PR adds a new developer guide titled "How to Hack Any Transformers Model" to the docs. The guide shows how to modify existing models, using the Segment Anything Model (SAM) as an example. It also encourages community contributions by inviting others to share their own hacks.

Changes

Added a new developer guide at docs/source/en/how_to_hack_models.md
Updated docs/source/en/_toctree.yml to include the new guide in the Developer Guides section.

Fixes #33928

Before submitting

This PR improves the docs.
I have read the contributor guideline.
This was discussed in the GitHub issue Separate q_proj, k_proj, and v_proj for Attention Layers in SAM #33928.

Who can review?

Tagging @ArthurZucker as we discussed this in issue #33928 and the code example was provided by him. Let me know if this is what you had in mind 🤗

ArthurZucker

Nice! I think we can have a separate doc, with sam being the example!
This could be in the same section as https://huggingface.co/docs/transformers/fast_tokenizers (developer guides!)

MagnusS0 · 2024-10-05T14:51:24Z

Ahh, yeah makes sense, then I can extend the example and show how it also can be used e.g. with PEFT! WDYT?

ArthurZucker · 2024-10-05T14:54:31Z

Of course! And if people have example of adding SDPA for example (not here for SAM) or good hacking, will go there! let's call for contribution probably! 🤗

MagnusS0 · 2024-10-05T20:24:40Z

Updated the PR: It now adds a new developer guide titled "How to Hack Any Transformers Model" 🚀
I've also updated the PR description and title accordingly.

Let me know your thoughts! Do you think we should make the guide more general regarding model hacking? Or do you (or anyone else) have any extra examples to add?

docs: add example for separating q, k, v projections in SAM

927da22

ArthurZucker reviewed Oct 5, 2024

View reviewed changes

MagnusS0 added 3 commits October 5, 2024 17:19

Merge branch 'huggingface:main' into SamVisionAttentionSplit

40fa227

docs: How to Hack Any Transformers Model

9676f33

docs: remove changes from sam model docs

5c6e349

MagnusS0 changed the title ~~docs: add example for separating q, k, v projections in SAM attention~~ [Docs] Add Developer Guide: How to Hack Any Transformers Model Oct 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Docs] Add Developer Guide: How to Hack Any Transformers Model #33979

[Docs] Add Developer Guide: How to Hack Any Transformers Model #33979

MagnusS0 commented Oct 5, 2024 •

edited

Loading

ArthurZucker left a comment

MagnusS0 commented Oct 5, 2024

ArthurZucker commented Oct 5, 2024

MagnusS0 commented Oct 5, 2024

[Docs] Add Developer Guide: How to Hack Any Transformers Model #33979

Are you sure you want to change the base?

[Docs] Add Developer Guide: How to Hack Any Transformers Model #33979

Conversation

MagnusS0 commented Oct 5, 2024 • edited Loading

What does this PR do?

Changes

Before submitting

Who can review?

ArthurZucker left a comment

Choose a reason for hiding this comment

MagnusS0 commented Oct 5, 2024

ArthurZucker commented Oct 5, 2024

MagnusS0 commented Oct 5, 2024

MagnusS0 commented Oct 5, 2024 •

edited

Loading