riffusion-hobby  by riffusion

Library for real-time music/audio generation using stable diffusion

Created 2 years ago
3,788 stars

Top 12.8% on SourcePulse

GitHubView on GitHub
Project Summary

Riffusion (hobby) is a library for real-time music and audio generation using Stable Diffusion, targeting musicians, sound designers, and researchers. It enables the creation of audio from text prompts and spectrogram images, offering a novel approach to generative audio.

How It Works

Riffusion leverages Stable Diffusion models to generate audio by treating spectrograms as images. It employs prompt interpolation and image conditioning within the diffusion pipeline. This method allows for seamless transitions between musical styles or sounds by interpolating between different text prompts or conditioning on existing spectrograms, offering a unique way to explore and generate audio.

Quick Start & Requirements

  • Install dependencies: python -m pip install -r requirements.txt
  • Requires Python 3.9 or 3.10.
  • ffmpeg is required for audio formats other than WAV.
  • CUDA-enabled GPU (e.g., 3090, A10G) recommended for real-time performance. MPS backend supported on Apple Silicon with potential fallbacks.
  • Official documentation: https://www.riffusion.com/about

Highlighted Details

  • Real-time music and audio generation via Stable Diffusion.
  • Converts between spectrogram images and audio clips.
  • Includes a command-line interface and an interactive Streamlit app.
  • Provides a Flask server for API-based model inference.

Maintenance & Community

This project is no longer actively maintained.

Licensing & Compatibility

The repository does not explicitly state a license. The associated website and model checkpoints may have different licensing terms.

Limitations & Caveats

The project is explicitly marked as "no longer actively maintained." While CPU is supported, it is noted as "quite slow" for real-time generation. MPS backend on Apple Silicon has potential fallbacks to CPU for audio processing and is not deterministic.

Health Check
Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
1
Star History
18 stars in the last 30 days

Explore Similar Projects

Starred by Dan Abramov Dan Abramov(Core Contributor to React; Coauthor of Redux, Create React App), Benjamin Bolte Benjamin Bolte(Cofounder of K-Scale Labs), and
3 more.

riffusion-app-hobby by riffusion

0.0%
3k
Web app for real-time music generation using stable diffusion
Created 2 years ago
Updated 1 year ago
Starred by Patrick von Platen Patrick von Platen(Author of Hugging Face Diffusers; Research Engineer at Mistral) and Omar Sanseviero Omar Sanseviero(DevRel at Google DeepMind).

AudioLDM by haoheliu

0.1%
3k
Audio generation research paper using latent diffusion
Created 2 years ago
Updated 2 months ago
Feedback? Help us improve.