Realtime speech AI agents for ESP32 devices
Top 35.4% on sourcepulse
ElatoAI provides a framework for building real-time AI speech agents on ESP32 microcontrollers, targeting hobbyists and developers creating AI companions or toys. It enables extended, uninterrupted conversations by integrating OpenAI's Realtime API, Deno edge functions, and secure WebSockets.
How It Works
The system comprises a Next.js frontend for AI agent management, Deno edge functions for handling WebSocket connections and API calls, and an ESP32 client for audio processing and communication. Speech is captured by the ESP32, sent via secure WebSockets to Deno edge functions, processed by OpenAI's Realtime API, and the AI's response is streamed back to the ESP32 for playback. This architecture leverages edge computing for low latency and Opus compression for efficient audio streaming.
Quick Start & Requirements
npm install
and npm run dev
, Deno server deno run -A --env-file=.env main.ts
. ESP32 firmware upload and Wi-Fi configuration via captive portal.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
5 days ago
Inactive