Skip to content

v0.4.5: serving LLM Embeddings models

Latest

Choose a tag to compare

@dacorvo dacorvo released this 11 Feb 10:28

What's Changed

  • doc: add a guide to explain how vLLM deployment on Inference Endpoints by @tengomucho in #1057
  • Add Qwen embedding guide and notebook by @pinak-p in #1045
  • Serve embedding models using vLLM by @dacorvo in #1072

Other changes

New Contributors

Full Changelog: v0.4.4...v0.4.5