agents by livekit

Voice AI agent framework for building realtime applications

Created 2 years ago

8,994 stars

Top 5.7% on SourcePulse

View on GitHub

9 Experts Love This Project

Russ d'Sa

Cofounder of LiveKit

Chip Huyen

Author of "AI Engineering", "Designing Machine Learning Systems"

Chris Van Pelt

Cofounder of Weights & Biases

Elvis Saravia

Founder of DAIR.AI

and 5 more!

Project Summary

This framework enables the creation of real-time voice AI agents that can see, hear, and speak, targeting developers building server-side agentic applications. It offers flexible integrations with various AI models and seamless WebRTC and telephony capabilities, reducing interruptions with semantic turn detection.

How It Works

The framework uses an Agent-AgentSession model, where Agents define instructions and tools, and AgentSessions manage interactions, orchestrating Speech-to-Text (STT), Large Language Models (LLM), Text-to-Speech (TTS), and Voice Activity Detection (VAD). It leverages LiveKit's WebRTC infrastructure for real-time communication and supports custom plugins for different model providers, allowing for modular and adaptable agent development.

Quick Start & Requirements

Install core library and plugins: pip install "livekit-agents[openai,silero,deepgram,cartesia,turn-detector]~=1.0"
Environment variables required: LIVEKIT_URL, LIVEKIT_API_KEY, LIVEKIT_API_SECRET, DEEPGRAM_API_KEY, OPENAI_API_KEY.
Documentation: https://docs.livekit.io/agents/
Playground: https://agents.livekit.io/

Highlighted Details

Supports multi-agent handoffs and complex conversational flows.
Integrates with LiveKit's telephony stack for outbound/inbound calls.
Offers semantic turn detection using transformer models.
Provides various example applications, including vision-enabled agents.

Maintenance & Community

Actively developed with a focus on rapid evolution in AI.
Contributions are welcomed via issues, PRs, and the LiveKit Slack community.

Licensing & Compatibility

Primarily Apache 2.0 licensed, allowing for commercial use and closed-source linking.

Limitations & Caveats

Requires specific API keys for integrated services (OpenAI, Deepgram).
The framework is under active development, implying potential for breaking changes.

Health Check

Last Commit

23 hours ago

Responsiveness

1 day

Pull Requests (30d)

172

Issues (30d)

145

Star History

579 stars in the last 30 days