[Collection Request] LLM Gateway & Proxy Infrastructure - Unified Access, Routing & Cost Optimization for 100+ AI Models

# [Collection Request] LLM Gateway & Proxy Infrastructure - Unified Access, Routing & Cost Optimization for 100+ AI Models

## Summary

Track the rapidly growing **LLM Gateway/Proxy** ecosystem - infrastructure that provides unified access, intelligent routing, load balancing, and cost optimization across 100+ LLM providers. This is distinct from #2159 (LLM Inference & Serving) which focuses on running models locally.

## Why This Matters

- **Unified API**: Single endpoint for 100+ LLM providers (OpenAI, Anthropic, Bedrock, VertexAI, etc.)
- **Cost Optimization**: Automatic routing to cheapest/fastest models, fallback strategies
- **Enterprise Features**: Rate limiting, caching, guardrails, observability, multi-tenant isolation
- **Production Ready**: Load balancing, retry logic, circuit breakers, high availability

## Key Repositories to Track

### Major Players (10K+ Stars)

| Repo | Stars | Forks | Description |
|------|-------|-------|-------------|
| [Kong/kong](https://github.com/Kong/kong) | 43K+ | 5.1K | The API and AI Gateway - traditional API gateway adding AI capabilities |
| [BerriAI/litellm](https://github.com/BerriAI/litellm) | 40K+ | 6.6K | Python SDK + Proxy Server (AI Gateway) for 100+ LLM APIs with cost tracking, guardrails, loadbalancing |
| [apache/apisix](https://github.com/apache/apisix) | 16K+ | 2.8K | Cloud-Native API Gateway and AI Gateway |
| [Portkey-AI/gateway](https://github.com/Portkey-AI/gateway) | 11K+ | 954 | Blazing fast AI Gateway with integrated guardrails, 200+ LLMs, 50+ AI Guardrails |
| [tensorzero/tensorzero](https://github.com/tensorzero/tensorzero) | 11K+ | 797 | LLMOps platform unifying LLM gateway, observability, evaluation, optimization |

### Emerging Players (1K-10K Stars)

| Repo | Stars | Forks | Description |
|------|-------|-------|-------------|
| [alibaba/higress](https://github.com/alibaba/higress) | 7.9K+ | 1K+ | AI Gateway | AI Native API Gateway |
| [IBM/mcp-context-forge](https://github.com/IBM/mcp-context-forge) | 3.5K+ | 592 | AI Gateway, registry, and proxy for MCP, A2A, or REST/gRPC APIs |
| [maximhq/bifrost](https://github.com/maximhq/bifrost) | 3.1K+ | 338 | Claims 50x faster than LiteLLM, adaptive load balancer, <100µs overhead at 5k RPS |

### Specialized Solutions (<1K Stars)

| Repo | Stars | Forks | Description |
|------|-------|-------|-------------|
| [envoyproxy/ai-gateway](https://github.com/envoyproxy/ai-gateway) | 1.4K+ | 197 | Unified Access to Generative AI Services built on Envoy Gateway |
| [traceloop/hub](https://github.com/traceloop/hub) | 174+ | 31 | High-scale LLM gateway in Rust with OpenTelemetry observability |
| [Helicone/ai-gateway](https://github.com/Helicone/ai-gateway) | 555+ | 46 | Fastest, lightest AI gateway - fully open-sourced |
| [labring/aiproxy](https://github.com/labring/aiproxy) | - | - | High performance AI gateway with intelligent error handling, multi-channel management |

## Ecosystem Categories

1. **Traditional API Gateways adding AI**: Kong, Apache APISIX, Higress
2. **LLM-Native Gateways**: LiteLLM, Portkey, Bifrost
3. **LLMOps Platforms with Gateway**: TensorZero, Helicone
4. **Service Mesh / Proxy-based**: Envoy AI Gateway
5. **MCP/A2A Integration**: IBM MCP Context Forge

## Key Features to Track

- [ ] Model routing & fallback strategies
- [ ] Cost optimization & tracking
- [ ] Rate limiting & quota management
- [ ] Caching & response optimization
- [ ] Guardrails & safety filters
- [ ] Observability & tracing (OpenTelemetry)
- [ ] Multi-tenant isolation
- [ ] Load balancing & high availability
- [ ] Protocol unification (OpenAI-compatible APIs)
- [ ] MCP/A2A integration

## Related Issues

- #2159 - LLM Inference & Serving Infrastructure (running models, not gateway)
- #2131 - AI Observability (tracing/monitoring, not gateway routing)
- #2109 - MCP Ecosystem (gateway can integrate with MCP)
- #2122 - A2A Protocol (gateway can integrate with A2A)

## Suggested Labels

`area/growth`, `type/feature`, `priority/p2`, `topic/ai-infra`, `topic/llm-gateway`


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Collection Request] LLM Gateway & Proxy Infrastructure - Unified Access, Routing & Cost Optimization for 100+ AI Models #2169

[Collection Request] LLM Gateway & Proxy Infrastructure - Unified Access, Routing & Cost Optimization for 100+ AI Models

Summary

Why This Matters

Key Repositories to Track

Major Players (10K+ Stars)

Emerging Players (1K-10K Stars)

Specialized Solutions (<1K Stars)

Ecosystem Categories

Key Features to Track

Related Issues

Suggested Labels

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Repo	Stars	Forks	Description
Kong/kong	43K+	5.1K	The API and AI Gateway - traditional API gateway adding AI capabilities
BerriAI/litellm	40K+	6.6K	Python SDK + Proxy Server (AI Gateway) for 100+ LLM APIs with cost tracking, guardrails, loadbalancing
apache/apisix	16K+	2.8K	Cloud-Native API Gateway and AI Gateway
Portkey-AI/gateway	11K+	954	Blazing fast AI Gateway with integrated guardrails, 200+ LLMs, 50+ AI Guardrails
tensorzero/tensorzero	11K+	797	LLMOps platform unifying LLM gateway, observability, evaluation, optimization

Repo	Stars	Forks	Description
alibaba/higress	7.9K+	1K+	AI Gateway
IBM/mcp-context-forge	3.5K+	592	AI Gateway, registry, and proxy for MCP, A2A, or REST/gRPC APIs
maximhq/bifrost	3.1K+	338	Claims 50x faster than LiteLLM, adaptive load balancer, <100µs overhead at 5k RPS

Repo	Stars	Forks	Description
envoyproxy/ai-gateway	1.4K+	197	Unified Access to Generative AI Services built on Envoy Gateway
traceloop/hub	174+	31	High-scale LLM gateway in Rust with OpenTelemetry observability
Helicone/ai-gateway	555+	46	Fastest, lightest AI gateway - fully open-sourced
labring/aiproxy	-	-	High performance AI gateway with intelligent error handling, multi-channel management

[Collection Request] LLM Gateway & Proxy Infrastructure - Unified Access, Routing & Cost Optimization for 100+ AI Models #2169

Description

[Collection Request] LLM Gateway & Proxy Infrastructure - Unified Access, Routing & Cost Optimization for 100+ AI Models

Summary

Why This Matters

Key Repositories to Track

Major Players (10K+ Stars)

Emerging Players (1K-10K Stars)

Specialized Solutions (<1K Stars)

Ecosystem Categories

Key Features to Track

Related Issues

Suggested Labels

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions