Labels
Labels
65 labels
- Accepted LLM API contract change that is backwards-incompatible
- Accepted LLM API contract change that is backwards-compatible
- <NV> AutoDeploy Backend
- <NV> AutoDeploy Backend: Dashboard related
- <NV> Tag for issues that are blocking AutoDeploy standalone repo
- When the PR breaks backwards compatibility
- Something isn't working
- It's a label that applies to Cherry-pick PR.
- help/insights needed from community
- PRs initiated from Community
- <NV>Specialized/modified CUDA kernels in TRTLLM for LLM ops, beyond standard TRT. Dev & perf.
- <NV>Token sampling algorithms in TRTLLM for text gen (top-k, top-p, beam).
- Pull requests that update a dependency file
- <NV>Deploying with separated, distributed components (params, kv-cache, compute). Arch & perf.
- <NV>TRTLLM's textual/illustrative materials: API refs, guides, tutorials. Improvement & clarity.
- This issue or pull request already exists
- Items about improving or complaints about TRTLLM ease of use
- New feature or request. This includes new model, dtype, functionality support
- <NV>Frontend of the LLM workflow
- Label for getting slack notifications via GitHub Slack app
- <NV>Broad performance issues not specific to a particular component
- Extra attention is needed
- <NV>General operational aspects of TRTLLM execution not in other categories.
- <NV>automated tests, build checks, github actions, system stability & efficiency.
- Setting up and building TRTLLM: compilation, pip install, dependencies, env config, CMake.
- kv-cache management for efficient LLM inference