Skip to content

V1 support for remaining sampling parameters#307

Merged
tchedaTT merged 30 commits intodevfrom
tcheda/v1_sampling_generality
Feb 13, 2026
Merged

V1 support for remaining sampling parameters#307
tchedaTT merged 30 commits intodevfrom
tcheda/v1_sampling_generality

Conversation

@tchedaTT
Copy link

@tchedaTT tchedaTT commented Jan 30, 2026

Purpose

Implement the changes from #301

Custom logits processors are not currently supported - this is a limitation of the v1 version we branched from. More changes possibly required once we rebase onto a version of v1 that supports them

Test Plan

Added tests covering these sampling features, testing across DP/non DP correct so far. CI pipeline to run them across all machine/model combos WIP.

@tchedaTT tchedaTT force-pushed the tcheda/v1_sampling_generality branch from 73d1015 to f689d08 Compare February 5, 2026 04:41
@tchedaTT
Copy link
Author

@tchedaTT tchedaTT force-pushed the tcheda/v1_sampling_generality branch from 2601742 to 04f47df Compare February 13, 2026 16:55
@tchedaTT tchedaTT force-pushed the tcheda/v1_sampling_generality branch from 04f47df to d441413 Compare February 13, 2026 16:57
@tchedaTT
Copy link
Author

previous run had multiple unrelated failures, new one: https://github.com/tenstorrent/tt-metal/actions/runs/21996024453

model_input)
sampled_token_ids = sampled_token_ids_per_dp[0]
logprobs_tensors = logprobs_per_dp[0] if logprobs_per_dp else None
logprobs = logprobs_tensors.tolists() if logprobs_tensors else None
Copy link
Collaborator

@skhorasganiTT skhorasganiTT Feb 13, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is the tolists() needed here?

Copy link
Author

@tchedaTT tchedaTT Feb 13, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is here to match the conversion done in the DP path, which in turn is there to match what the gpu_model_runner does. It might be cleaner to do it in get_output_tokens and investigate whether it can be removed, but I think that's best handled in a separate PR.

@tchedaTT tchedaTT merged commit fdb4104 into dev Feb 13, 2026
1 of 2 checks passed
@tchedaTT tchedaTT deleted the tcheda/v1_sampling_generality branch February 13, 2026 23:14
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants