Warning
This repository contains research implementations which are not supported. If you are looking for a production implementation of speculative decoding models, please refer to the the vllm-project/speculators repo.
Warning
This repository contains research implementations which are not supported. If you are looking for a production implementation of speculative decoding models, please refer to the the vllm-project/speculators repo.