Embedding experiments by lhk · Pull Request #3 · epfl-dlab/PauseToken

lhk · 2024-04-15T12:19:29Z

Adding 3 different weight freezing approaches in an example script under: notebooks/embedding_wrapper.py.

The first wrapper uses a mask and ended up being equivalent to the code that Chris already had set up.
The second wrapper uses a hook. That's only recommended for debugging, so ultimately not useful for us. It helped me spot an error in the other code though, so I think the pattern is useful and I'm keeping it in for reference.
The third wrapper uses a custom autograd function. That may be interesting, since it's more lowlevel than the masking code.

Finally, this code contains a test setup which doesn't pull in hf / transformers/ etc. That's convenient for quick debugging.

…ht-tying / grad-zeroing / etc

…n a mask, a hook and a custom autograd function

lhk added 9 commits April 15, 2024 11:55

adding a small experiment script to play with different forms of weig…

c4f5321

…ht-tying / grad-zeroing / etc

wip: test notebook

dbe88c7

wip: adding small training setup for testing

9ea2174

wip: debugging

f3b3e25

wip: debugging

5330a34

setting up new testbed, getting rid of backward_hook

5ac48fa

adding implementation based on autograd function

038376e

wip: debugging

0806645

probably done, adding 3 different weight freezing approaches. Based o…

af9b61e

…n a mask, a hook and a custom autograd function

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Embedding experiments#3

Embedding experiments#3
lhk wants to merge 9 commits intomainfrom
embedding_experiments

lhk commented Apr 15, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

lhk commented Apr 15, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant