add conv1d layer type (used in gpt2) by zywilliamli · Pull Request #50 · EleutherAI/bergson

zywilliamli · 2025-10-16T03:36:25Z

gpt2 type models use transformers.pytorch_utils.Conv1D instead of nn.Linear which is not currently tracked by gradient collector, this pr adds that layer type.

tested manually using python -m bergson runs/test --model openai-community/gpt2 --dataset NeelNanda/pile-10k --truncation, also added unit tests for it.

bergson/gradients.py

luciaquirke · 2025-10-16T04:58:58Z

bergson/gradients.py

        """Process the incoming gradient wrt the output of the module."""
        # Sanity checks
-        assert isinstance(module, nn.Linear), "Expected a Linear module"
+        assert isinstance(module, LayerAdapter.supported_modules), "Expected a supported module"


Could we print the supported modules here?

CLAassistant · 2025-10-16T04:59:16Z

All committers have signed the CLA.

luciaquirke

LGTM 🚀

add conv1d for gpt2

2e7e7ad

luciaquirke reviewed Oct 16, 2025

View reviewed changes

bergson/gradients.py Outdated Show resolved Hide resolved

zywilliamli added 4 commits October 16, 2025 03:44

comments

56c6745

feat: add conv1d support for gpt2

a2b1c09

refactor

b4c52f2

refactor

1e9ce50

luciaquirke reviewed Oct 16, 2025

View reviewed changes

rename

d06a36b

zywilliamli force-pushed the main branch from 8abd3b8 to d06a36b Compare October 16, 2025 05:12

test, comment

7775520

luciaquirke approved these changes Oct 16, 2025

View reviewed changes

zywilliamli merged commit 0e1b245 into EleutherAI:main Oct 16, 2025
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add conv1d layer type (used in gpt2)#50

add conv1d layer type (used in gpt2)#50
zywilliamli merged 7 commits intoEleutherAI:mainfrom
zywilliamli:main

zywilliamli commented Oct 16, 2025

Uh oh!

Uh oh!

luciaquirke Oct 16, 2025

Uh oh!

CLAassistant commented Oct 16, 2025 •

edited

Loading

Uh oh!

luciaquirke left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

zywilliamli commented Oct 16, 2025

Uh oh!

Uh oh!

luciaquirke Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

CLAassistant commented Oct 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

luciaquirke left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

CLAassistant commented Oct 16, 2025 •

edited

Loading