Voice AI agent framework for building realtime applications
Top 7.5% on sourcepulse
This framework enables the creation of real-time voice AI agents that can see, hear, and speak, targeting developers building server-side agentic applications. It offers flexible integrations with various AI models and seamless WebRTC and telephony capabilities, reducing interruptions with semantic turn detection.
How It Works
The framework uses an Agent-AgentSession model, where Agents define instructions and tools, and AgentSessions manage interactions, orchestrating Speech-to-Text (STT), Large Language Models (LLM), Text-to-Speech (TTS), and Voice Activity Detection (VAD). It leverages LiveKit's WebRTC infrastructure for real-time communication and supports custom plugins for different model providers, allowing for modular and adaptable agent development.
Quick Start & Requirements
pip install "livekit-agents[openai,silero,deepgram,cartesia,turn-detector]~=1.0"
LIVEKIT_URL
, LIVEKIT_API_KEY
, LIVEKIT_API_SECRET
, DEEPGRAM_API_KEY
, OPENAI_API_KEY
.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
3 days ago
1 day