Fix maximum value check for token_batch_size to allow larger values by smarter · Pull Request #130 · EleutherAI/bergson

smarter · 2026-01-22T15:19:27Z

Previously, token_batch_size was limited by max_position_embeddings, but this is the maximum length of one sequence, and a batch can contain multiple sequences. As far as I can tell, the real limit is the default value of model_max_length in the tokenizer (some models have model_max_length == max_position_embeddings which causes this confusion).

We also make the truncation logic more explicit by passing a max_length parameter instead of mutating model_max_len.

.github/workflows/build.yml

luciaquirke · 2026-01-30T03:40:38Z

LGTM + thanks very much for the fix! let's merge when conflicts are resolved

luciaquirke · 2026-02-03T05:06:15Z

@claude could you please fix the merge conflicts

claude · 2026-02-03T05:06:29Z

Claude encountered an error —— View job

Command failed: git fetch origin --depth=20 tbs

I'll analyze this and get back to you.

Previously, token_batch_size was limited by max_position_embeddings, but this is the maximum length of one sequence, and a batch can contain multiple sequences. As far as I can tell, the real limit is the default value of model_max_length in the tokenizer (some models have model_max_length == max_position_embeddings which causes this confusion). We also make the truncation logic more explicit by passing a max_length parameter instead of mutating model_max_len.

smarter · 2026-02-07T15:12:57Z

Rebased.

smarter force-pushed the tbs branch 5 times, most recently from f677706 to c9f4fef Compare January 22, 2026 19:10

LouisYRYJ requested a review from luciaquirke January 29, 2026 14:19

luciaquirke reviewed Jan 30, 2026

View reviewed changes

.github/workflows/build.yml Outdated Show resolved Hide resolved

smarter force-pushed the tbs branch from c9f4fef to 4e36d5b Compare February 7, 2026 14:51

smarter force-pushed the tbs branch from 4e36d5b to 012e650 Compare February 7, 2026 14:57

smarter requested a review from luciaquirke February 7, 2026 15:26

luciaquirke merged commit 72975c9 into EleutherAI:main Feb 9, 2026
5 of 6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix maximum value check for token_batch_size to allow larger values#130

Fix maximum value check for token_batch_size to allow larger values#130
luciaquirke merged 1 commit intoEleutherAI:mainfrom
smarter:tbs

smarter commented Jan 22, 2026

Uh oh!

Uh oh!

luciaquirke commented Jan 30, 2026 •

edited

Loading

Uh oh!

luciaquirke commented Feb 3, 2026

Uh oh!

claude bot commented Feb 3, 2026 •

edited

Loading

Uh oh!

smarter commented Feb 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

smarter commented Jan 22, 2026

Uh oh!

Uh oh!

luciaquirke commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

luciaquirke commented Feb 3, 2026

Uh oh!

claude bot commented Feb 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

smarter commented Feb 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

luciaquirke commented Jan 30, 2026 •

edited

Loading

claude bot commented Feb 3, 2026 •

edited

Loading