diffwave by lmnt-com

Neural vocoder and waveform synthesizer

Created 5 years ago

883 stars

Top 40.6% on SourcePulse

View on GitHub

2 Experts Love This Project

Omar Sanseviero

DevRel at Google DeepMind

Chenlin Meng

Cofounder of Pika

Project Summary

DiffWave is a neural vocoder and waveform synthesizer that generates high-quality audio from conditioning signals like Mel spectrograms. It iteratively refines Gaussian noise into speech, offering fast inference and stable training. The project is suitable for researchers and developers working on speech synthesis and audio generation.

How It Works

DiffWave employs a diffusion model architecture, starting with random noise and progressively refining it through learned steps to produce audio. This approach allows for high-fidelity waveform generation, surpassing traditional methods in quality and offering a versatile framework for audio synthesis.

Quick Start & Requirements

Install via pip: pip install diffwave or from source.
Requires Python and PyTorch.
Pretrained models and audio samples are available.
See official documentation for detailed setup and training.

Highlighted Details

Achieves a real-time factor of 0.87 for speech synthesis using a pretrained model.
Supports fast sampling, stable training, high-quality synthesis, and mixed-precision/multi-GPU training.
Offers both command-line and programmatic inference APIs.
Includes unconditional waveform synthesis capabilities.

Maintenance & Community

The project has received contributions and pointers from the lead author of the DiffWave paper. The repository is active, with recent updates in late 2021.

Licensing & Compatibility

The repository does not explicitly state a license in the provided README. Users should verify licensing for commercial or closed-source use.

Limitations & Caveats

The README does not specify compatibility with different operating systems or hardware beyond mentioning GPU training. The project's last update was in late 2021, so newer PyTorch versions or dependencies might require adjustments.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

5 stars in the last 30 days