sora2api  by TheSmallHanCat

Unified API for advanced AI media generation

Created 3 months ago
1,021 stars

Top 36.4% on SourcePulse

GitHubView on GitHub
Project Summary

This project provides an OpenAI-compatible API service for generative AI models, specifically targeting Sora-like capabilities. It offers a unified interface for developers and power users to integrate advanced text-to-image, image-to-image, text-to-video, and image-to-video generation into their applications, benefiting from features like load balancing and simplified management.

How It Works

Sora2API functions as a reverse proxy and API gateway, presenting a fully OpenAI-compatible /v1/chat/completions endpoint. It routes requests to underlying generative models, supporting a wide array of modalities including image and video creation from text or images. Key differentiators include advanced video features like character-based generation and video remixing, alongside robust backend management capabilities.

Quick Start & Requirements

Installation is streamlined via Docker Compose (recommended) or a local Python 3.8+ environment. Prerequisites include Docker and Docker Compose, or Python 3.8+. After cloning the repository, users can launch the service with docker-compose up -d or by installing dependencies (pip install -r requirements.txt) and running python main.py. An initial web management interface is accessible at http://localhost:8000 with default credentials admin/admin, which must be changed immediately.

Highlighted Details

  • OpenAI Compatibility: Seamlessly integrates with existing OpenAI SDKs and workflows.
  • Multi-modal Generation: Supports text-to-image, image-to-image, text-to-video, and image-to-video.
  • Advanced Video Features: Includes video remixing and character-based video generation.
  • Load Balancing & Token Management: Manages multiple API keys with round-robin load balancing.
  • Web Management Interface: Provides an intuitive admin panel for configuration.
  • Proxy Support: Integrates HTTP and SOCKS5 proxies.

Maintenance & Community

Support and discussion are primarily channeled through GitHub Issues and GitHub Discussions. The project actively encourages community engagement and contributions.

Licensing & Compatibility

The project is released under the permissive MIT License. This license generally permits commercial use, modification, and distribution, making it compatible with closed-source applications and services.

Limitations & Caveats

The provided README does not explicitly detail known limitations, bugs, or unsupported platforms. The inclusion of a "WARP Mode" suggests potential complexities or specific requirements related to network configurations or environments. The sora-* model naming implies an abstraction layer over an underlying Sora implementation, the specifics of which are not elaborated upon.

Health Check
Last Commit

2 weeks ago

Responsiveness

Inactive

Pull Requests (30d)
1
Issues (30d)
24
Star History
173 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Wei-Lin Chiang Wei-Lin Chiang(Cofounder of LMArena), and
13 more.

awesome-tensor-compilers by merrymercy

0%
3k
Curated list of tensor compiler projects and papers
Created 5 years ago
Updated 1 year ago
Starred by Shengjia Zhao Shengjia Zhao(Chief Scientist at Meta Superintelligence Lab), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
14 more.

BIG-bench by google

0.1%
3k
Collaborative benchmark for probing and extrapolating LLM capabilities
Created 5 years ago
Updated 1 year ago
Starred by Lysandre Debut Lysandre Debut(Chief Open-Source Officer at Hugging Face), Shizhe Diao Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA), and
14 more.

simpletransformers by ThilinaRajapakse

0%
4k
Rapid NLP task implementation
Created 6 years ago
Updated 6 months ago
Starred by Aravind Srinivas Aravind Srinivas(Cofounder of Perplexity), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
16 more.

text-to-text-transfer-transformer by google-research

0.0%
6k
Unified text-to-text transformer for NLP research
Created 6 years ago
Updated 1 month ago
Starred by Vaibhav Nivargi Vaibhav Nivargi(Cofounder of Moveworks), Chuan Li Chuan Li(Chief Scientific Officer at Lambda), and
5 more.

awesome-mlops by visenger

0.1%
14k
Curated MLOps knowledge hub
Created 6 years ago
Updated 1 year ago
Feedback? Help us improve.