Add pod_type categorical feature to latency prediction models#1993
Add pod_type categorical feature to latency prediction models#1993k8s-ci-robot merged 1 commit intokubernetes-sigs:mainfrom
pod_type categorical feature to latency prediction models#1993Conversation
✅ Deploy Preview for gateway-api-inference-extension ready!
To edit notification comments on pull requests, go to your Netlify project configuration. |
|
Hi @RishabhSaini. Thanks for your PR. I'm waiting for a github.com member to verify that this patch is reasonable to test. If it is, they should reply with Once the patch is verified, the new status will be reflected by the I understand the commands that are listed here. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
f2b600b to
d9556ef
Compare
|
/ok-to-test |
4497584 to
efcc534
Compare
pod_type categorical feature to latency prediction models
pod_type categorical feature to latency prediction modelspod_type categorical feature to latency prediction models
|
@RishabhSaini this looks good, can you update here the the test_dual_server outputs? |
efcc534 to
c52bde2
Compare
|
to both TTFT and TPOT prediction models - Add pod_type field to PredictionRequest and TrainingEntry models - Encode pod_type as categorical in _prepare_features_with_interaction - Handle pod_type_cat in both TTFT and TPOT feature columns - One-hot encode pod_type_cat for Bayesian Ridge models - Add pod_type to XGBoost/LightGBM feature orders with monotone constraints - Add comprehensive tests for pod_type functionality - Update Go types to include PodType field
c52bde2 to
1fd3277
Compare
|
/lgtm |
|
/approve |
|
cc maintainers: @danehans @kfswain @ahg-g @nirrozenbaum for approval |
|
/approve |
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: kaushikmitr, kfswain, RishabhSaini The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
…') (kubernetes-sigs#1993) to both TTFT and TPOT prediction models - Add pod_type field to PredictionRequest and TrainingEntry models - Encode pod_type as categorical in _prepare_features_with_interaction - Handle pod_type_cat in both TTFT and TPOT feature columns - One-hot encode pod_type_cat for Bayesian Ridge models - Add pod_type to XGBoost/LightGBM feature orders with monotone constraints - Add comprehensive tests for pod_type functionality - Update Go types to include PodType field
Add
pod_typesupport for prefill-decode (PD) disaggregated serving:pod_typecategorical feature (''=monolithic, 'prefill', 'decode') to both TTFT and TPOT prediction modelspod_type_catencoding in feature preparation with proper categorical handlingResolves: #1923
Links to related PRs:
llm-d/llm-d#596
llm-d/llm-d-inference-scheduler#564