Discover and explore top open-source AI tools and projects—updated daily.
TEN-frameworkLow-latency voice activity detection for real-time AI
Top 26.8% on SourcePulse
TEN VAD is a voice activity detector designed for real-time conversational AI, offering low latency and high performance. It targets developers building voice-enabled applications, providing superior accuracy and efficiency compared to common alternatives like WebRTC VAD and Silero VAD.
How It Works
TEN VAD employs a proprietary architecture optimized for temporal efficiency, enabling rapid speech-to-non-speech transition detection. This approach minimizes end-to-end latency in conversational AI systems and effectively handles short silences between speech segments, a common failure point for other VADs.
Quick Start & Requirements
git clone https://github.com/TEN-framework/ten-vad.gitpip install -r requirements.txt (for examples/plotting), pip install -U --force-reinstall -v git+https://github.com/TEN-framework/ten-vad.git (for direct use).onnxruntime >= 1.17.1. C/C++ usage requires Clang/Visual Studio/Xcode and CMake.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
2 weeks ago
Inactive