generated from kubernetes/kubernetes-template-project
    
        
        - 
                Notifications
    
You must be signed in to change notification settings  - Fork 46
 
Pull requests: kubernetes-sigs/inference-perf
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
      Support setting custom y-axis limits optionally
        
              
                cncf-cla: yes
  Indicates the PR's author has signed the CNCF CLA. 
              
                size/S
  Denotes a PR that changes 10-29 lines, ignoring generated files. 
        
      
    
      
  
        
          #268
            opened Nov 3, 2025  by
            Shuwen-Fang
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      fix: custom tokenizer truncates inputs to model max input length
        
              
                cncf-cla: yes
  Indicates the PR's author has signed the CNCF CLA. 
              
                kind/bug
  Categorizes issue or PR as related to a bug. 
              
                size/XS
  Denotes a PR that changes 0-9 lines, ignoring generated files. 
        
      
    
      
  
        
          #266
            opened Oct 30, 2025  by
            changminbark
            
        
        
            
    
  
    Loading…
 
        
        
      
    publish-on-change workflow should use helm client login instead of docker login
        
              
                cncf-cla: yes
  
        
          #264
            opened Oct 30, 2025  by
            Bslabe123
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      Loadgen concurrent load type
        
              
                cncf-cla: yes
  Indicates the PR's author has signed the CNCF CLA. 
              
                kind/feature
  Categorizes issue or PR as related to a new feature. 
              
                size/L
  Denotes a PR that changes 100-499 lines, ignoring generated files. 
        
      
    
      
  
        
          #263
            opened Oct 30, 2025  by
            changminbark
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      Feat: Add user session to support Multi-turn chat (#179)
        
              
                cncf-cla: yes
  Indicates the PR's author has signed the CNCF CLA. 
              
                size/L
  Denotes a PR that changes 100-499 lines, ignoring generated files. 
        
      
    
      
  
        
          #257
            opened Oct 22, 2025  by
            huaxig
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      feat: Improve client perf and error handling
        
              
                cncf-cla: yes
  Indicates the PR's author has signed the CNCF CLA. 
              
                lgtm
  "Looks good to me", indicates that a PR is ready to be merged. 
              
                needs-rebase
  Indicates a PR cannot be merged because it has merge conflicts with HEAD. 
              
                size/M
  Denotes a PR that changes 30-99 lines, ignoring generated files. 
        
      
    
      
  
        
          #247
            opened Oct 7, 2025  by
            LukeAVanDrie
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      refactor: Make base client concrete and usable
        
              
                cncf-cla: yes
  Indicates the PR's author has signed the CNCF CLA. 
              
                size/S
  Denotes a PR that changes 10-29 lines, ignoring generated files. 
        
      
    
      
  
        
          #246
            opened Oct 7, 2025  by
            LukeAVanDrie
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      [WIP] Add Kubecon Demo results for Llama 3.1 8b
        
              
                cncf-cla: yes
  Indicates the PR's author has signed the CNCF CLA. 
              
                do-not-merge/work-in-progress
  Indicates that a PR should not merge because it is a work in progress. 
              
                size/XL
  Denotes a PR that changes 500-999 lines, ignoring generated files. 
        
      
    
    
      
  
      Trace load gen
        
              
                cncf-cla: yes
  Indicates the PR's author has signed the CNCF CLA. 
              
                size/L
  Denotes a PR that changes 100-499 lines, ignoring generated files. 
        
      
    
      
  
        
          #198
            opened Aug 22, 2025  by
            aish1331
            
        
        
            
    
  
    Loading…
 
        
        
      
    
  
  ProTip!
  Updated in the last three days: updated:>2025-11-01.