OmniRoute  by diegosouzapw

AI gateway for multi-provider LLMs, optimizing cost and reliability

Created 4 weeks ago

New!

774 stars

Top 45.2% on SourcePulse

GitHubView on GitHub
Project Summary

OmniRoute provides an OpenAI-compatible AI gateway, enabling developers to route requests to over 36 LLM providers. It addresses API limits, cost management, and provider fragmentation by offering smart routing, load balancing, automatic fallbacks, caching, and observability. The primary benefit is uninterrupted, cost-optimized AI inference through a single, unified endpoint.

How It Works

The system functions as a universal API proxy, translating diverse LLM provider formats into a consistent OpenAI-compatible structure. Its core is a 4-tier fallback mechanism: Subscription → API Key → Cheap → Free, ensuring service continuity. It intelligently manages provider quotas, retries, and load balancing across multiple accounts and models, offering features like automatic token refresh and response sanitization for seamless integration.

Quick Start & Requirements

  • Installation: npm install -g omniroute and run omniroute. Docker images are available.
  • Prerequisites: Node.js 18-22 LTS (Node.js 24+ is not supported).
  • Setup: Connect providers via the web dashboard, create an API key, and configure coding tools to use http://localhost:20128/v1.
  • Resources: Official website: https://omniroute.online.

Highlighted Details

  • Smart 4-Tier Fallback: Automatically routes requests from paid subscriptions to cheaper or free tiers upon quota exhaustion.
  • Unified Endpoint: Consolidates 36+ LLM providers under a single http://localhost:20128/v1 endpoint.
  • Format Translation: Transparently converts between OpenAI, Claude, Gemini, and other API response formats.
  • Cost Control: Features real-time quota tracking, cost analytics, and integration with free/cheap providers.
  • Protocol Support: Includes MCP (Model Context Protocol) and A2A (Agent-to-Agent) for advanced agent orchestration.
  • Multi-Modal Capabilities: Supports image generation, embeddings, audio transcription/synthesis, video, and music generation.
  • Security Features: API key management with scoping, IP filtering, TLS fingerprint spoofing, and CLI fingerprint matching.
  • Observability: Unified logging, health dashboard, request telemetry, and cost tracking.
  • Cross-Platform: Available as a native Electron desktop app and Docker image.
  • Internationalization: Dashboard and documentation support 30 languages.

Maintenance & Community

The project is actively developed, forked from 9router. Community support is available via a WhatsApp group. Contributions are welcomed via pull requests.

Licensing & Compatibility

Licensed under the MIT License, permitting commercial use and integration into closed-source projects.

Limitations & Caveats

The project explicitly states incompatibility with Node.js 24+. Remote deployments of Gemini CLI and Antigravity require manual configuration of Google OAuth credentials due to redirect_uri restrictions. MCP and A2A protocols are presented as v2.0, indicating potential for ongoing API changes.

Health Check
Last Commit

1 day ago

Responsiveness

Inactive

Pull Requests (30d)
193
Issues (30d)
133
Star History
803 stars in the last 28 days

Explore Similar Projects

Feedback? Help us improve.