Commit 05788ad
* Add vLLM app to community catalog
Adds vLLM (high-performance LLM inference server with OpenAI-compatible API)
as a TrueNAS community app, addressing issue #4631.
Features:
- NVIDIA CUDA and AMD ROCm image selector
- GPU passthrough via definitions/gpu_configuration
- IPC host mode for PyTorch shared memory
- Model name as command argument (--model)
- HuggingFace token for gated models
- HF_HOME=/data for model cache storage
- HTTP healthcheck on /health endpoint
- ROCm-specific SYS_PTRACE capability and seccomp:unconfined
- Port 30422 (next free port in catalog)
- 8192 MB default memory (vLLM is memory-hungry)
Closes #4631
* Fix vLLM container crash: add HOME env var for UID 568
* Fix vLLM CI: add USER env var and vendored library
* Fix vLLM CI: CPU image on gh runners
* Add --port flag to vLLM command so it listens on the configured port inside the container (default 8000 was not mapped)
* Fix vLLM CI: give vllm more time to start in poor env
* add image to renovate-bot list
* no need for custom versioning, its semver
* update lib
* meta
* not really needed, unless another container had a depends on chain against it.
* huh. weird
* svg
---------
Co-authored-by: blazej karabin <blazej@cynergybank.co.uk>
Co-authored-by: Stavros Kois <47820033+stavros-k@users.noreply.github.com>
Co-authored-by: Stavros Kois <s.kois@outlook.com>
1 parent 16e53b2 commit 05788ad
85 files changed
Lines changed: 11842 additions & 0 deletions
File tree
- ix-dev/community/vllm
- templates
- library/base_v2_3_5
- tests
- test_values
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
534 | 534 | | |
535 | 535 | | |
536 | 536 | | |
| 537 | + | |
537 | 538 | | |
538 | 539 | | |
539 | 540 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
0 commit comments