LTX-2  by Lightricks

DiT-based audio-video foundation model for generative tasks

Created 1 week ago

New!

1,910 stars

Top 22.6% on SourcePulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

LTX-2 is an open-access, DiT-based audio-video foundation model designed for synchronized, high-fidelity video generation. It offers production-ready outputs, multiple performance modes, and API access, targeting researchers and developers in advanced video synthesis.

How It Works

Leveraging a Diffusion Transformer (DiT) architecture, LTX-2 generates synchronized audio and video streams. Its design prioritizes high fidelity, flexible performance modes (including fast inference and upscaling), and production-ready outputs, positioning it as a versatile tool for contemporary video generation challenges.

Quick Start & Requirements

Highlighted Details

  • Pipelines: Offers diverse generation modes including TI2VidTwoStagesPipeline (production, recommended), TI2VidOneStagePipeline (prototyping), DistilledPipeline (fastest inference), ICLoraPipeline (video-to-video), and KeyframeInterpolationPipeline.
  • Optimization: Supports FP8 transformers for reduced memory, xFormers/Flash Attention integration, gradient estimation for fewer inference steps, and single-stage pipelines for speed.
  • Prompting: Employs detailed, cinematographer-style prompts (max 200 words) with an optional automatic prompt enhancement feature (enhance_prompt).
  • ComfyUI Integration: Seamlessly integrates with ComfyUI via a dedicated repository.

Maintenance & Community

No specific details regarding maintainers, community channels (e.g., Discord/Slack), or roadmap were found in the provided README.

Licensing & Compatibility

The license type and any compatibility notes for commercial or closed-source use are not specified in the provided README.

Limitations & Caveats

The temporal upscaler is noted as supported but required for future pipeline implementations, indicating potential limitations in current temporal coherence features. Optimization tips like FP8 and Flash Attention may imply specific hardware dependencies (e.g., NVIDIA GPUs). The setup involves downloading numerous large model files.

Health Check
Last Commit

3 days ago

Responsiveness

Inactive

Pull Requests (30d)
6
Issues (30d)
44
Star History
1,958 stars in the last 8 days

Explore Similar Projects

Feedback? Help us improve.