Skip to content

Commit 05788ad

Browse files
blkablazej karabinstavros-k
authored
Add vLLM (#4631) (#4913)
* Add vLLM app to community catalog Adds vLLM (high-performance LLM inference server with OpenAI-compatible API) as a TrueNAS community app, addressing issue #4631. Features: - NVIDIA CUDA and AMD ROCm image selector - GPU passthrough via definitions/gpu_configuration - IPC host mode for PyTorch shared memory - Model name as command argument (--model) - HuggingFace token for gated models - HF_HOME=/data for model cache storage - HTTP healthcheck on /health endpoint - ROCm-specific SYS_PTRACE capability and seccomp:unconfined - Port 30422 (next free port in catalog) - 8192 MB default memory (vLLM is memory-hungry) Closes #4631 * Fix vLLM container crash: add HOME env var for UID 568 * Fix vLLM CI: add USER env var and vendored library * Fix vLLM CI: CPU image on gh runners * Add --port flag to vLLM command so it listens on the configured port inside the container (default 8000 was not mapped) * Fix vLLM CI: give vllm more time to start in poor env * add image to renovate-bot list * no need for custom versioning, its semver * update lib * meta * not really needed, unless another container had a depends on chain against it. * huh. weird * svg --------- Co-authored-by: blazej karabin <blazej@cynergybank.co.uk> Co-authored-by: Stavros Kois <47820033+stavros-k@users.noreply.github.com> Co-authored-by: Stavros Kois <s.kois@outlook.com>
1 parent 16e53b2 commit 05788ad

85 files changed

Lines changed: 11842 additions & 0 deletions

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

cspell.config.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -534,6 +534,7 @@ words:
534534
- vhost
535535
- vikunja
536536
- vitransfer
537+
- vllm
537538
- wabayang
538539
- warracker
539540
- webapi

ix-dev/community/vllm/README.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,3 @@
1+
# vLLM
2+
3+
[vLLM](https://github.com/vllm-project/vllm) is a high-throughput and memory-efficient inference and serving engine for LLMs with an OpenAI-compatible API.

ix-dev/community/vllm/app.yaml

Lines changed: 36 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,36 @@
1+
app_version: v0.20.0
2+
capabilities: []
3+
categories:
4+
- ai
5+
changelog_url: https://github.com/vllm-project/vllm/releases
6+
date_added: '2026-04-30'
7+
description: vLLM is a high-throughput and memory-efficient inference and serving
8+
engine for LLMs with an OpenAI-compatible API.
9+
home: https://github.com/vllm-project/vllm
10+
host_mounts: []
11+
icon: https://media.sys.truenas.net/apps/vllm/icons/icon.svg
12+
keywords:
13+
- ai
14+
- llm
15+
- inference
16+
- openai
17+
- gpu
18+
lib_version: 2.3.5
19+
lib_version_hash: c0d042e3de6350fae9aee546d3c79bc0accf1f2011da89cc1ebff29421aa7fc9
20+
maintainers:
21+
- email: dev@truenas.com
22+
name: truenas
23+
url: https://www.truenas.com/
24+
name: vllm
25+
run_as_context:
26+
- description: Container [vllm] can run as any non-root user and group.
27+
gid: 568
28+
group_name: Host group is [apps]
29+
uid: 568
30+
user_name: Host user is [apps]
31+
screenshots: []
32+
sources:
33+
- https://github.com/vllm-project/vllm
34+
title: vLLM
35+
train: community
36+
version: 1.0.0

ix-dev/community/vllm/item.yaml

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,10 @@
1+
categories:
2+
- ai
3+
icon_url: https://media.sys.truenas.net/apps/vllm/icons/icon.svg
4+
screenshots: []
5+
tags:
6+
- ai
7+
- llm
8+
- inference
9+
- openai
10+
- gpu
Lines changed: 17 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,17 @@
1+
images:
2+
image:
3+
repository: vllm/vllm-openai
4+
tag: v0.20.0
5+
rocm_image:
6+
repository: vllm/vllm-openai-rocm
7+
tag: v0.20.0
8+
cpu_image:
9+
repository: vllm/vllm-openai-cpu
10+
tag: v0.20.0
11+
container_utils_image:
12+
repository: ixsystems/container-utils
13+
tag: 1.0.2
14+
15+
consts:
16+
vllm_container_name: vllm
17+
perms_container_name: permissions

0 commit comments

Comments
 (0)