Skip to content

Release Gateway-v0.2.3

Choose a tag to compare

@slin1237 slin1237 released this 17 Nov 11:23
· 1850 commits to main since this release
172c71a

πŸš€ SGLang Model Gateway - New Release!

We're excited to announce another powerful update to SGLang Model Gateway with performance improvements and expanded database support!

✨ Headline Features

⚑ Bucket Mode Routing - 20-30% Performance Boost
Introducing our new bucket-based routing algorithm that dramatically improves performance in PD mode. See up to 20-30% improvements in TTFT (Time To First Token) and overall throughput

πŸ’Ύ PostgreSQL Support for Chat History Management
Flexibility in data storage! We now support PostgreSQL alongside OracleDB and in-memory storage for chat history management.

πŸ› οΈ Enhanced Model Tool & Structured Output Support

  • MinMax M2 model support!
  • Structured model output for OpenAI and gRPC router
  • Streaming parsing with Tool Choice in chat completions API
  • Tool_choice support for Responses API
  • OutputItemDone events with output item array storage for better observability

πŸ› Stability & Quality Improvements

Multiple bug fixes for model validation, streaming logic, reasoning content indexing, and CI stability enhancements.

πŸ”§ Code Quality Enhancements

Refactored builders for chat and responses, restructured modules for better maintainability, and consolidated error handling.

Try the latest version: pip install sglang-router --upgrade

What's Changed in Gateway

Gateway Changes (45 commits)

New Contributors

Paths Included

  • sgl-router
  • python/sglang/srt/grpc
  • python/sglang/srt/entrypoints/grpc_server.py

Full Changelog: gateway-v0.2.2...gateway-v0.2.3