-
-
Notifications
You must be signed in to change notification settings - Fork 12k
[Model][Last/N] Improve all pooling task | Generate runner supports using embed and token_embed tasks. #30672
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Signed-off-by: wang.yuqi <[email protected]>
096de98 to
0346166
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Code Review
This pull request updates a test for pooling tasks to align with recent improvements, including support for returning prompt hidden states during generation. The test now uses the 'generate' runner and includes a new case to verify text generation with prefix caching.
My main feedback is that the new test case, while intended to verify the return of prompt hidden states, lacks assertions to confirm their presence and correctness. I've suggested adding these assertions to ensure the feature is properly tested.
47058b2 to
63038d5
Compare
Signed-off-by: wang.yuqi <[email protected]>
63038d5 to
64da7a6
Compare
|
Are we planning to implement generate runner support using embed and token_embed tasks? With this very very dirty fix can make the tests/models/language/pooling/test_extract_hidden_states.py test pass, but we can't batch the generated request and pooling request together. |
Signed-off-by: wang.yuqi <[email protected]>
Signed-off-by: wang.yuqi <[email protected]>
I think this limitation is ok if we can alternate between generative and pooling batches |
I will further refine this PR if it's decided to implement “Generate runner support using embed and token_embed tasks” |
|
hello @breakices PTAL: #24288 (comment) token_embed can be used to extract Prompt Hidden States
Could using "generate runner support with embed and token_embed tasks" as a form of "Returning Prompt Hidden States" help with RLVR? |
|
hello @charlotte12l A long time has passed since my last comment #24288 (comment), and I'm finally about to implement it. Could using "generate runner support with embed and token_embed tasks" as a form of "Returning Prompt Hidden States" help with your use case. |
Improve all pooling task
--convert reward, use--convert embedinstead. #30463These PRs are mostly conflicting with each other, so combining them into a series would better inform reviewers about what happened. And what else needs to be done after that?
Purpose
Generate runner supports using embed and token_embed tasks & End the Improve all pooling tasks series
FIX #11905
FIX #24288
FIX #6165
FIX #4435
Test Plan
tests/models/language/pooling/test_extract_hidden_states.py
Test Result
pass
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model.