vocode-core  by vocodedev

Open-source library for building voice-based LLM agents

created 2 years ago
3,393 stars

Top 14.7% on sourcepulse

GitHubView on GitHub
Project Summary

This library enables developers to build real-time, voice-based LLM applications, targeting developers and researchers interested in conversational AI. It simplifies the creation of voice agents that can interact via phone calls, Zoom, or system audio, offering a modular and integrated solution.

How It Works

Vocode employs a streaming architecture for low-latency, real-time voice interactions. It integrates various third-party services for transcription (e.g., Deepgram, Whisper.cpp), LLM processing (e.g., OpenAI, Anthropic), and speech synthesis (e.g., Eleven Labs, Azure TTS). This modular design allows users to select and combine preferred services, abstracting away complex integration details for a streamlined development experience.

Quick Start & Requirements

  • Install via pip: pip install vocode
  • Requires API keys for chosen LLM, transcription, and synthesis services (e.g., OpenAI, Deepgram, Azure Speech).
  • Example code demonstrates setup with system audio, Deepgram, ChatGPT, and Azure Synthesizer.
  • Official documentation and community Discord are available for further guidance.

Highlighted Details

  • Supports real-time streaming conversations with LLMs.
  • Integrates with multiple telephony providers for phone calls and Zoom meetings.
  • Offers out-of-the-box integrations with numerous transcription and synthesis services.
  • Provides a React SDK for building front-end interfaces.

Maintenance & Community

  • Actively seeking community maintainers.
  • Roadmap and contribution guides are available.
  • Community support via Discord.

Licensing & Compatibility

  • The project is licensed under the MIT License.
  • Permissive licensing allows for commercial use and integration into closed-source applications.

Limitations & Caveats

The provided quickstart example requires specific API keys for OpenAI, Azure, and Deepgram, which are not included. Users must obtain and configure these credentials to run the example.

Health Check
Last commit

8 months ago

Responsiveness

1+ week

Pull Requests (30d)
1
Issues (30d)
1
Star History
116 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Andre Zayarni Andre Zayarni(Cofounder of Qdrant), and
2 more.

RealChar by Shaunwei

0.1%
6k
Real-time AI character/companion creation and interaction codebase
created 2 years ago
updated 1 year ago
Feedback? Help us improve.