Skip to content

Release Gateway-v0.2.4

Choose a tag to compare

@slin1237 slin1237 released this 10 Dec 01:09
· 1064 commits to main since this release
390406c

πŸš€ SGLang Model Gateway v0.2.4 Released!

We're excited to announce SGLang Model Gateway v0.2.4 – a massive release focused on performance, security, and production-ready observability!

✨ Headline Features

⚑ Major Performance Optimizations

We've invested heavily in performance across the entire stack:

  • Optimized radix tree for cache-aware load balancing – Smarter routing decisions with lower overhead
  • Tokenizer optimization – Dramatically reduced CPU and memory footprint during tokenization
  • Core module optimization – HTTP and gRPC routers now run leaner and faster
  • Efficient OTEL implementation – Production-grade observability with minimal performance impact

πŸ”Œ Industry-First WASM Middleware Support

Programmable middleware using WebAssembly! Extend your gateway with safe, isolated plugins. Build custom routing logic, transform requests/responses, or integrate proprietary systems – all without touching core code. Your gateway, your rules.

πŸ“Š Production-Grade Observability

Full OpenTelemetry integration with distributed tracing for both HTTP and gRPC. Track requests across your entire inference stack with native trace context propagation. Finally, real visibility into your LLM infrastructure.

⚑ Built for speed. Hardened for security. Ready for production.

Gateway Changes (98 commits)

New Contributors

Full Changelog: gateway-v0.2.3...gateway-v0.2.4