Discover and explore top open-source AI tools and projects—updated daily.
MaggotxyESP32 AI robot firmware for voice interaction and music
Top 97.8% on SourcePulse
This project provides open-source firmware for the Xiaozhi AI robot, enabling music playback with lyrics display on ESP32-based hardware. Targeting hobbyists and developers, it enhances the robot's functionality by integrating AI voice interaction with media control, offering a flexible platform for custom projects and commercial applications under the permissive MIT license.
How It Works
The firmware leverages the Xiaozhi AI Chat Robot framework, utilizing large language models (LLMs) like Qwen or DeepSeek for voice interaction via the MCP protocol. It specifically implements a self.music.play_song tool for music playback, supporting OPUS audio codec and lyrics display. The architecture follows a streaming ASR + LLM + TTS pipeline, enabling on-device hardware control (volume, GPIO) and cloud-based extensions for smart home or PC operations.
Quick Start & Requirements
xiaozhi.me server for free Qwen model access. For development, use VSCode with ESP-IDF (v5.4+) on Linux (preferred) or Windows.Highlighted Details
self.music.play_song tool with lyrics display.Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
xiaozhi.me) to enable the MPC tool.2 months ago
Inactive