Add OmniRoute

diegosouzapw · 2026-02-20T16:43:20Z

OmniRoute is a self-hostable AI gateway with:

🔄 4-tier cascading fallback — automatic retry across providers
⚖️ Multi-provider load balancing — distribute requests across OpenAI, Anthropic, Google, local models
🔌 OpenAI-compatible API — drop-in replacement, works with any OpenAI SDK
🎯 200+ models supported — GPT, Claude, Gemini, Llama, Mistral, etc.
🐳 Docker-ready — single docker compose up deployment
📊 Dashboard — real-time monitoring, flow visualization, provider management

GitHub: https://github.com/diegosouzapw/OmniRoute
License: MIT

Copilot

Pull request overview

This pull request adds OmniRoute to the README's LLM Inference section. OmniRoute is a self-hostable AI gateway that provides multi-provider routing, fallback mechanisms, and load balancing across 200+ AI models with an OpenAI-compatible API.

Changes:

Added OmniRoute entry to the LLM Inference section with a description highlighting its gateway capabilities, fallback mechanisms, and model support

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-20T16:46:23Z

README.md

 - [vLLM](https://github.com/vllm-project/vllm) - A high-throughput and memory-efficient inference and serving engine for LLMs.
 - [llama.cpp](https://github.com/ggerganov/llama.cpp) - LLM inference in C/C++.
 - [ollama](https://github.com/ollama/ollama) - Get up and running with Llama 3, Mistral, Gemma, and other large language models.
+- [OmniRoute](https://github.com/diegosouzapw/OmniRoute) - A self-hostable AI gateway with 4-tier cascading fallback, multi-provider load balancing, and OpenAI-compatible API. Supports 200+ models across OpenAI, Anthropic, Google, and local providers.


OmniRoute appears to be a gateway/routing tool rather than a direct inference engine. Similar tools like "AI Gateway" (Portkey) and "TensorZero" are listed in the "LLM Applications" section (lines 510, 551), while the "LLM Inference" section primarily contains tools that perform actual inference (vLLM, llama.cpp, ollama, TGI, TensorRT-LLM). Consider moving this entry to the "LLM Applications" section, or to the "other deployment tools" subsection within "LLM Inference" (after line 461) where similar routing/serving tools like FastChat are located.

Add OmniRoute — self-hostable AI gateway with 4-tier fallback

535501c

Copilot AI review requested due to automatic review settings February 20, 2026 16:43

Copilot started reviewing on behalf of diegosouzapw February 20, 2026 16:43 View session

Copilot AI reviewed Feb 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add OmniRoute — self-hostable AI gateway with 4-tier fallback#309

Add OmniRoute — self-hostable AI gateway with 4-tier fallback#309
diegosouzapw wants to merge 1 commit intoHannibal046:mainfrom
diegosouzapw:add-omniroute

diegosouzapw commented Feb 20, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

diegosouzapw commented Feb 20, 2026

Add OmniRoute

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Feb 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants