riffusion-app-hobby  by riffusion

Web app for real-time music generation using stable diffusion

created 2 years ago
2,665 stars

Top 18.1% on sourcepulse

GitHubView on GitHub
Project Summary

Riffusion App provides a web interface for real-time music generation using Stable Diffusion models. It targets musicians, artists, and developers interested in AI-powered audio creation, offering a user-friendly platform for exploring and generating novel musical pieces.

How It Works

The application leverages Stable Diffusion, a latent diffusion model, to generate audio spectrograms from text prompts. These spectrograms are then converted into audible music. The architecture is built with Next.js, React, and TypeScript for the frontend, utilizing three.js for 3D visualizations and Tailwind CSS for styling, deployed on Vercel.

Quick Start & Requirements

  • Install packages: npm install or yarn install
  • Run development server: npm run dev or yarn dev
  • Requires Node.js v18 or greater.
  • An inference server (Flask app) is needed for actual model output generation, requiring a GPU capable of running Stable Diffusion quickly. The inference server URL must be specified in a .env.local file.

Highlighted Details

  • Real-time music generation via text prompts.
  • Web application built with modern frontend technologies (Next.js, React, TypeScript).
  • Utilizes Stable Diffusion for audio synthesis.

Maintenance & Community

This project is no longer actively maintained.

Licensing & Compatibility

The license is not explicitly stated in the README.

Limitations & Caveats

The project is explicitly marked as "no longer actively maintained," which may indicate a lack of future updates, bug fixes, or community support. Running the full functionality requires a separate, GPU-intensive inference server.

Health Check
Last commit

1 year ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
1
Star History
23 stars in the last 90 days

Explore Similar Projects

Starred by Patrick von Platen Patrick von Platen(Core Contributor to Hugging Face Transformers and Diffusers) and Omar Sanseviero Omar Sanseviero(DevRel at Google DeepMind).

AudioLDM by haoheliu

0.3%
3k
Audio generation research paper using latent diffusion
created 2 years ago
updated 1 month ago
Feedback? Help us improve.