Skip to content

Conversation

@brendandahl
Copy link
Collaborator

Implement a caching mechanism using a JSON file to track the state of downloaded models. This avoids redundant downloads during prebuild if the model configuration has not changed.

A --force flag is also added to allow users to manually bypass the cache and re-download all models when necessary.

Implement a caching mechanism using a JSON file to track the state of
downloaded models. This avoids redundant downloads during prebuild if the
model configuration has not changed.

A --force flag is also added to allow users to manually bypass the cache
and re-download all models when necessary.
@brendandahl brendandahl force-pushed the cache-transformer-models branch from 0d4fab1 to 0fe9bfe Compare January 23, 2026 20:14
@brendandahl brendandahl requested a review from rmahdav January 23, 2026 22:38
Copy link
Collaborator

@rmahdav rmahdav left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM with a nit. Thanks!

},
{
class: 'AutoTokenizer',
dtype: 'bnb4',
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: I don't think AutoTokenizer has dtype.

@brendandahl brendandahl merged commit 571fba8 into GoogleChrome:main Jan 27, 2026
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants