minisora  by mini-sora

Community initiative exploring Sora implementation and development

created 1 year ago
1,266 stars

Top 31.9% on sourcepulse

GitHubView on GitHub
Project Summary

This repository is a community-driven initiative focused on exploring and replicating the technology behind OpenAI's Sora, a text-to-video generation model. It aims to provide accessible implementations and foster research into diffusion models for video generation, targeting researchers and developers interested in state-of-the-art video synthesis.

How It Works

The project centers on reproducing key research papers and technologies related to Sora, such as DiT (Diffusion Transformer). It leverages existing frameworks like XTuner for efficient sequence training and aims to develop GPU-friendly and training-efficient models. The approach involves a comprehensive review of diffusion models for video generation, from DDPM to advanced transformer-based architectures.

Quick Start & Requirements

  • Installation: Not explicitly detailed, but likely involves Python and PyTorch.
  • Requirements: The project aims for GPU-friendly operation, targeting configurations like 8x A100 80GB, 8x A6000 48GB, or RTX4090 24GB for training and inference. Specific requirements for reproducing DiT mention 2x A100.
  • Resources: The project is actively recruiting contributors familiar with OpenMMLab's MMEngine and DiT.
  • Links: MiniSora-DiT

Highlighted Details

  • Focus on reproducing DiT (Scalable Diffusion Models with Transformers).
  • Aims for GPU-friendly training and inference with moderate hardware.
  • Comprehensive survey of video generation models and related technologies.
  • Community-driven exploration of Sora's implementation and future directions.

Maintenance & Community

The project is driven by the MiniSora Community, with regular round-table discussions involving the Sora team and community members. It actively recruits contributors and provides links to WeChat groups for community engagement.

Licensing & Compatibility

The repository's license is not explicitly stated in the README.

Limitations & Caveats

The project is a community effort to replicate complex research; therefore, the fidelity and performance of reproduced models may vary. Specific implementation details and stability are subject to ongoing community development.

Health Check
Last commit

5 months ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
9 stars in the last 90 days

Explore Similar Projects

Starred by Ying Sheng Ying Sheng(Author of SGLang), Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), and
1 more.

Open-Sora-Plan by PKU-YuanGroup

0.1%
12k
Open-source project aiming to reproduce Sora-like T2V model
created 1 year ago
updated 2 weeks ago
Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems) and Luca Antiga Luca Antiga(CTO of Lightning AI).

mmagic by open-mmlab

0.1%
7k
AIGC toolbox for image/video editing and generation
created 6 years ago
updated 1 year ago
Feedback? Help us improve.