v1.6.0
v1.6.0 - Experimental Multimodal Support
This release:
- ⚗️ Adds very experimental multimodal support for granite vision models. No performance or accuracy guarantees and no documented support
- ⬆️ Bumps vllm support to include v0.14.0
- 🐛 Fixes a bug where structured output requests would crash the server
What's Changed
- 📝 Update granite logs by @joerunde in #637
- ⚡ Use ty for type checking instead of mypy by @joerunde in #620
- 🔧 set 32MB HDMA collective override by @joerunde in #638
- [CP][PC] rip out block reservation for chunked prefill by @yannicks1 in #622
- Multimodal / Granite Vision Support by @alex-jw-brooks in #614
- Add v0.14.0 support by @rafvasq in #636
- 🔥 remove vllm 0.10.2 compatibility by @tjohnson31415 in #660
- fix: drop structured output from request to avoid crash by @tjohnson31415 in #657
New Contributors
- @alex-jw-brooks made their first contribution in #614
Full Changelog: v1.5.0...v1.5.1