Skip to content

Pull requests: kubernetes-sigs/inference-perf

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Support setting custom y-axis limits optionally cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/S Denotes a PR that changes 10-29 lines, ignoring generated files.
#268 opened Nov 3, 2025 by Shuwen-Fang Loading…
fix: custom tokenizer truncates inputs to model max input length cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. kind/bug Categorizes issue or PR as related to a bug. size/XS Denotes a PR that changes 0-9 lines, ignoring generated files.
#266 opened Oct 30, 2025 by changminbark Loading…
publish-on-change workflow should use helm client login instead of docker login cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/M Denotes a PR that changes 30-99 lines, ignoring generated files.
#264 opened Oct 30, 2025 by Bslabe123 Loading…
Loadgen concurrent load type cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. kind/feature Categorizes issue or PR as related to a new feature. size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
#263 opened Oct 30, 2025 by changminbark Loading…
Feat: Add user session to support Multi-turn chat (#179) cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
#257 opened Oct 22, 2025 by huaxig Loading…
feat: Improve client perf and error handling cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. lgtm "Looks good to me", indicates that a PR is ready to be merged. needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. size/M Denotes a PR that changes 30-99 lines, ignoring generated files.
#247 opened Oct 7, 2025 by LukeAVanDrie Loading…
refactor: Make base client concrete and usable cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/S Denotes a PR that changes 10-29 lines, ignoring generated files.
#246 opened Oct 7, 2025 by LukeAVanDrie Loading…
[WIP] Add Kubecon Demo results for Llama 3.1 8b cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. size/XL Denotes a PR that changes 500-999 lines, ignoring generated files.
#224 opened Sep 16, 2025 by Bslabe123 Draft
Trace load gen cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
#198 opened Aug 22, 2025 by aish1331 Loading…
ProTip! Updated in the last three days: updated:>2025-11-01.