DeepSpeech  by mozilla

Open-source speech-to-text engine for on-device inference

Created 9 years ago
26,600 stars

Top 1.4% on SourcePulse

GitHubView on GitHub
Project Summary

DeepSpeech is an open-source, embedded speech-to-text engine designed for real-time, offline operation on a wide range of hardware, from Raspberry Pi to high-power GPUs. It targets developers and researchers needing on-device transcription capabilities.

How It Works

The engine utilizes a machine learning model based on Baidu's Deep Speech research paper, implemented using Google's TensorFlow. This approach allows for efficient, on-device processing without requiring cloud connectivity.

Quick Start & Requirements

Highlighted Details

  • Supports on-device, real-time transcription.
  • Trained using machine learning techniques.
  • Implemented with TensorFlow for easier development.

Maintenance & Community

Licensing & Compatibility

  • License: MPL 2.0.
  • Compatibility: Permissive license suitable for commercial and closed-source applications.

Limitations & Caveats

The project's README does not detail specific performance benchmarks or known limitations regarding accuracy across different languages or accents.

Health Check
Last Commit

3 months ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
56 stars in the last 30 days

Explore Similar Projects

Feedback? Help us improve.