sora2api by TheSmallHanCat

Unified API for advanced AI media generation

Created 5 months ago

1,168 stars

Top 32.9% on SourcePulse

Project Summary

This project provides an OpenAI-compatible API service for generative AI models, specifically targeting Sora-like capabilities. It offers a unified interface for developers and power users to integrate advanced text-to-image, image-to-image, text-to-video, and image-to-video generation into their applications, benefiting from features like load balancing and simplified management.

How It Works

Sora2API functions as a reverse proxy and API gateway, presenting a fully OpenAI-compatible /v1/chat/completions endpoint. It routes requests to underlying generative models, supporting a wide array of modalities including image and video creation from text or images. Key differentiators include advanced video features like character-based generation and video remixing, alongside robust backend management capabilities.

Quick Start & Requirements

Installation is streamlined via Docker Compose (recommended) or a local Python 3.8+ environment. Prerequisites include Docker and Docker Compose, or Python 3.8+. After cloning the repository, users can launch the service with docker-compose up -d or by installing dependencies (pip install -r requirements.txt) and running python main.py. An initial web management interface is accessible at http://localhost:8000 with default credentials admin/admin, which must be changed immediately.

Highlighted Details

OpenAI Compatibility: Seamlessly integrates with existing OpenAI SDKs and workflows.
Multi-modal Generation: Supports text-to-image, image-to-image, text-to-video, and image-to-video.
Advanced Video Features: Includes video remixing and character-based video generation.
Load Balancing & Token Management: Manages multiple API keys with round-robin load balancing.
Web Management Interface: Provides an intuitive admin panel for configuration.
Proxy Support: Integrates HTTP and SOCKS5 proxies.

Maintenance & Community

Support and discussion are primarily channeled through GitHub Issues and GitHub Discussions. The project actively encourages community engagement and contributions.

Licensing & Compatibility

The project is released under the permissive MIT License. This license generally permits commercial use, modification, and distribution, making it compatible with closed-source applications and services.

Limitations & Caveats

The provided README does not explicitly detail known limitations, bugs, or unsupported platforms. The inclusion of a "WARP Mode" suggests potential complexities or specific requirements related to network configurations or environments. The sora-* model naming implies an abstraction layer over an underlying Sora implementation, the specifics of which are not elaborated upon.

Health Check

Last Commit

1 month ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

86 stars in the last 30 days