This issue tracks the temporary "hack" introduced in https://github.com/neuralmagic/llm-d-inference-scheduler/pull/34 to allow the importing of a nerualmagic internal repo: llm-d-kv-cache-manager. Once the latter goes public, these changes must be reverted.