Skip to content
Discussion options

You must be logged in to vote

In extreme cases, such as when there's no storage backend and the device cache is full, offloading to hicache will cause an OOM error. To better handle this case, a more sophisticated method should be used, eliminating the need to ensure the host KV cache is larger than the device cache.

Replies: 2 comments

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by stmatengss
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants