Open
Description
https://flashinfer.ai/2024/02/02/cascade-inference.html
Hi, I notice this blog posted a year ago.
I wonder what situation does the Evaluations
part refer to.
Is it for prefill stage ? or decoding stage? Or for both phase?
Metadata
Assignees
Labels
No labels
Activity