openvino-plugins-ai-audacity by intel

AI plugins for Audacity audio editor, running locally

Created 2 years ago

1,873 stars

Top 22.8% on SourcePulse

2 Experts Love This Project

ggerganov

Georgi Gerganov

Author of llama.cpp, whisper.cpp

v4dkou

Vadim Smelyanskiy

Research Scientist at Google Quantum

Project Summary

This project provides AI-powered audio effects, generators, and analyzers for Audacity, enabling users to perform tasks like music separation, noise suppression, and audio transcription entirely offline. It targets Audacity users seeking to integrate advanced AI capabilities directly into their audio editing workflow without requiring internet connectivity.

How It Works

The plugins leverage Intel's OpenVINO toolkit to run various AI models efficiently on local hardware, including CPUs, GPUs, and NPUs. This approach allows for high-performance inference of complex models like Meta's MusicGen for music generation, Demucs v4 for music separation, and whisper.cpp for transcription, all optimized for local execution.

Quick Start & Requirements

Installation packages and instructions for Windows are available at the project's GitHub releases page.
Build instructions for Windows and Linux are provided.
Requires Audacity. Specific hardware accelerators (CPU, GPU, NPU) are leveraged by OpenVINO.

Highlighted Details

Music Separation: Splits audio into stems (Drums, Bass, Vocals, Other) using Meta's Demucs v4.
Music Generation: Utilizes Meta's MusicGen (Small and Small-Stereo variants) for creating or continuing music snippets.
Transcription: Employs whisper.cpp for generating transcriptions or translations of spoken audio.
Noise Suppression: Implements models like noise-suppression-denseunet-ll and DeepFilterNet2/3 for background noise removal.

Maintenance & Community

Contributions are welcomed via pull requests.
Issues for questions, bug reports, feature requests, and feedback can be submitted on the GitHub repository.
Acknowledgements are given to the Audacity development team & Muse Group.

Licensing & Compatibility

The project utilizes various open-source models and libraries, including those under permissive licenses. Specific licensing details for the plugins themselves are not explicitly stated in the README, but dependencies like Audacity and whisper.cpp have their own licenses.
The use of OpenVINO™ implies compatibility with Intel hardware accelerators.

Limitations & Caveats

The README primarily details Windows installation, with Linux build instructions also provided, suggesting potential platform-specific nuances.
The project relies on specific AI models which may have varying performance characteristics and resource requirements.

Health Check

Last Commit

3 weeks ago

Responsiveness

1 day

Pull Requests (30d)

2

Issues (30d)

7

Star History

62 stars in the last 30 days

Explore Similar Projects

UniAudio2 by yangdongchao

Audio foundation model unifies speech, sound, and music processing

Created 3 weeks ago

Updated 1 week ago

awesome-audio-plaza by metame-ai

Curated list of audio research papers, projects, and resources

Created 2 years ago

Updated 3 months ago

audio-development-tools by Yuan-ManX

Audio development tools list, covering ML, generation, processing, synthesis, and more

Created 3 years ago

Updated 7 months ago

SongGen by LiuZH-19

Text-to-song generation with an auto-regressive transformer

Created 1 year ago

Updated 3 months ago

musetree by stevenwaterman

AI music production suite

Created 6 years ago

Updated 2 years ago

WavJourney by Audio-AGI

Audio creation pipeline using LLMs for compositional generation

Created 2 years ago

Updated 2 years ago

awesome-large-audio-models by EmulationAI

Curated list of Large Language Models in Audio AI

Created 2 years ago

Updated 4 months ago

WavCraft by JinhuaLiang

AI agent for audio creation and editing

Created 1 year ago

Updated 1 year ago

ace-step-ui by fspecii

AI music generation app with a professional, local UI

Created 3 weeks ago

Updated 2 weeks ago

Starred by

Luis Capelo

Luis Capelo(Cofounder of Lightning AI).

FunMusic by FunAudioLLM

Toolkit for music, song, and audio generation

Created 1 year ago

Updated 9 months ago

SongGeneration by tencent-ailab

AI framework for high-fidelity song generation

Created 8 months ago

Updated 2 months ago

Starred by

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory),

Jiayi Pan

Jiayi Pan(Author of SWE-Gym; MTS at xAI), and

18 more.

audiocraft by facebookresearch

PyTorch library for audio processing and generation research

Created 2 years ago

Updated 11 months ago

Feedback? Help us improve.