PyTorch library for audio processing and generation research
Top 1.9% on sourcepulse
Audiocraft is a PyTorch library for deep learning research on audio generation, offering state-of-the-art models like MusicGen and AudioGen for high-quality audio synthesis. It targets researchers and developers in the audio AI space, providing tools for both inference and training of generative audio models.
How It Works
Audiocraft leverages a modular architecture built on PyTorch, incorporating several advanced AI models. Key components include EnCodec for efficient audio compression, MusicGen for controllable text-to-music generation, and AudioGen for text-to-sound synthesis. The library also supports diffusion models (Multi Band Diffusion) and non-autoregressive approaches (MAGNeT), enabling diverse audio generation capabilities.
Quick Start & Requirements
python -m pip install -U audiocraft
ffmpeg
is recommended..[wm]
).Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The model weights are released under a non-commercial license, restricting their use in commercial products.
4 months ago
Inactive