porcupine  by Picovoice

On-device wake word detection engine

Created 8 years ago
4,702 stars

Top 10.4% on SourcePulse

GitHubView on GitHub
Project Summary

Summary Picovoice Porcupine offers highly accurate, lightweight, on-device wake word detection using deep neural networks. It enables developers to build always-listening voice applications across diverse platforms, from microcontrollers to web browsers, providing a compact and efficient solution for voice command interfaces.

How It Works Porcupine utilizes deep neural networks trained on real-world data for high accuracy. Its design prioritizes compactness and computational efficiency, ideal for resource-constrained environments like IoT. The engine scales to detect multiple wake words concurrently without increasing runtime overhead.

Quick Start & Requirements Installation varies by platform, with SDKs available via package managers (pip, npm/yarn, dotnet) and direct integration for C, Java, Flutter, React Native, Android, and iOS. All integrations require an ACCESS_KEY from the Picovoice Console. Demos typically need a microphone. Platform-specific SDKs may have additional environment setup needs. Links to demos and documentation are embedded within the repository structure.

Highlighted Details

  • Broad Platform Support: Operates on microcontrollers (Arm Cortex-M, STM32, Arduino), SBCs (Raspberry Pi), mobile (Android, iOS), desktop (Linux, macOS, Windows), and web browsers (WebAssembly).
  • Performance: Claims 11.0x higher accuracy and 6.5x faster speed than alternatives on Raspberry Pi 3.
  • Custom Wake Words: Enables self-service training of custom wake word models via the Picovoice Console.
Health Check
Last Commit

1 week ago

Responsiveness

Inactive

Pull Requests (30d)
6
Issues (30d)
6
Star History
81 stars in the last 30 days

Explore Similar Projects

Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera), and
1 more.

moonshine by moonshine-ai

9.0%
4k
Speech-to-text models optimized for fast, accurate ASR on edge devices
Created 1 year ago
Updated 2 days ago
Starred by Jiaming Song Jiaming Song(Chief Scientist at Luma AI), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
1 more.

RealtimeSTT by KoljaB

0.2%
9k
Speech-to-text library for realtime applications
Created 2 years ago
Updated 7 months ago
Feedback? Help us improve.