[fs-backend] Track feature gaps in upstream vLLM fs_python secondary tier vs llmd_fs_backend

## Context

vLLM has upstreamed a pure-Python `fs_python` secondary tier for the `OffloadingConnector`
in [vllm-project/vllm#41735](https://github.com/vllm-project/vllm/pull/41735) — a minimal
baseline equivalent of our `kv_connectors/llmd_fs_backend`. Before we can deprecate
llmd_fs_backend in favor of the upstream tier, the following gaps need to be either
upstreamed into vLLM or kept as an extension in this repo.

## Reference
- Upstream PR: vllm-project/vllm#41735 (merged)
- Upstream paths: `vllm/v1/kv_offload/tiering/fs/{manager,io,thread_pool}.py`, `vllm/v1/kv_offload/file_mapper.py`
- Local equivalent: `kv_connectors/llmd_fs_backend/`

## Gap checklist

- [ ] **1. KV-event publishing** —  emits `BlockStored` events over ZMQ PUB in vLLM's msgpack wire format (`event_publisher.py`); 
- [ ] **2. Per-job metrics** — llmd_fs_backend reports `transfer_size`, `transfer_time`, and throughput per completed transfer; upstream `JobResult` carries only `(job_id, success)`.
- [ ] **3. Partial file read/write (HMA)** — lmd_fs_backend allows partial reads and writes to a file.
- [ ] **4. Adaptive write-queue cap with EMA-based dropping** — llmd_fs_backend caps queue depth at `max_write_queued_seconds / EMA(write_latency)` and drops excess writes (fix for #457 deadlock, verified on L40S and H100); upstream's deques are unbounded.
- [ ] **5. Independent storage vs CPU-tier block-size knob** — llmd_fs_backend exposes a separate `block_size` for the storage tier and actually packs multiple GPU blocks per file; 
- [ ] **6. GDS support** — llmd_fs_backend supports 7 modes (`disabled`, `read_only`, `write_only`, `read_write`, and `bb_*` bounce-buffer variants) via C++ `GdsFileIO` with POSIX fallback; upstream is POSIX-only with `O_DIRECT`.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[fs-backend] Track feature gaps in upstream vLLM fs_python secondary tier vs llmd_fs_backend #616

Context

Reference

Gap checklist

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

[fs-backend] Track feature gaps in upstream vLLM fs_python secondary tier vs llmd_fs_backend #616

Description

Context

Reference

Gap checklist

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions