Chinese voice assistant and smart speaker project
Top 7.5% on sourcepulse
This project provides a modular, flexible, and elegant Chinese voice dialogue robot and smart speaker system, targeting makers and hackers in China. It integrates advanced AI capabilities like ChatGPT for multi-turn conversations and offers unique features such as brain-computer interface (BCI) wake-up, aiming to enable personalized smart speaker experiences.
How It Works
The system operates on a pipeline: wake-up word detection (offline or BCI), Automatic Speech Recognition (ASR) to convert speech to text, Natural Language Understanding (NLU) for parsing, skill matching to identify the appropriate plugin, plugin execution, Text-to-Speech (TTS) synthesis, and finally, audio playback. This modular design allows each component, including ASR, TTS, and dialogue engines, to be independently customized or extended, supporting various third-party integrations and online/offline dialogue models.
Quick Start & Requirements
python3 wukong.py
https://bot.hahack.com
(user: wukong, pass: wukong@2019).Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The project explicitly states it is for personal learning and research, with no liability for any losses incurred from its use. It also notes that Intel Macs are supported, but M1 Macs are not. Users are advised to use their own API keys for services due to usage limits.
9 months ago
1 day