Skip to content

Remove excess no-op kv cache reshardings in llm benchmark now that output shardings are preserved#827

Open
jameszianxuTT wants to merge 1 commit intomainfrom
jzx/no_reshard_generate
Open

Remove excess no-op kv cache reshardings in llm benchmark now that output shardings are preserved#827
jameszianxuTT wants to merge 1 commit intomainfrom
jzx/no_reshard_generate

Conversation

@jameszianxuTT
Copy link
Contributor

@jameszianxuTT jameszianxuTT commented Jan 22, 2026

This was a workaround to patch an issue now fixed in tt-xla by tenstorrent/tt-xla#2731

@jameszianxuTT jameszianxuTT changed the title Remove excess no-op reshardings now that output shardings are preserved Remove excess no-op kv cache reshardings in llm benchmark now that output shardings are preserved Jan 22, 2026
@jameszianxuTT
Copy link
Contributor Author

tbd - need to handle static cache persistence case brought up by Aleks, i.e. reuse the same dirty static cache

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant