diff --git a/Popular_Models_Guide/Llama2/vllm_guide.md b/Popular_Models_Guide/Llama2/vllm_guide.md index b851292e..24800187 100644 --- a/Popular_Models_Guide/Llama2/vllm_guide.md +++ b/Popular_Models_Guide/Llama2/vllm_guide.md @@ -26,6 +26,8 @@ # OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE. --> +# Deploying Llama2-7B Model with Triton and vLLM + The vLLM Backend uses vLLM to do inference. Read more about vLLM [here](https://blog.vllm.ai/2023/06/20/vllm.html) and the vLLM Backend [here](https://github.com/triton-inference-server/vllm_backend). ## Pre-build instructions @@ -114,4 +116,4 @@ The future of AI is The future of AI is in the hands of the people who use it. ========= -``` \ No newline at end of file +```