why init_kvcache need vattention.reserve_physical_pages(GPU_MEM_RESERVE)

I believe this is similar to how vLLM reserves actual resources at startup. According to the paper, no actual resources should be allocated at startup; the CUDA interface should only be invoked to allocate resources when processing inference requests.
![image](https://github.com/user-attachments/assets/6ee07cd2-e0ef-4cb8-8840-852b9e00a54e)