vocode-core by vocodedev

Open-source library for building voice-based LLM agents

Created 2 years ago

3,680 stars

Top 13.1% on SourcePulse

View on GitHub

14 Experts Love This Project

Jeff Hammerbacher

Cofounder of Cloudera

Cofounder of K-Scale Labs

and 10 more!

Project Summary

This library enables developers to build real-time, voice-based LLM applications, targeting developers and researchers interested in conversational AI. It simplifies the creation of voice agents that can interact via phone calls, Zoom, or system audio, offering a modular and integrated solution.

How It Works

Vocode employs a streaming architecture for low-latency, real-time voice interactions. It integrates various third-party services for transcription (e.g., Deepgram, Whisper.cpp), LLM processing (e.g., OpenAI, Anthropic), and speech synthesis (e.g., Eleven Labs, Azure TTS). This modular design allows users to select and combine preferred services, abstracting away complex integration details for a streamlined development experience.

Quick Start & Requirements

Install via pip: pip install vocode
Requires API keys for chosen LLM, transcription, and synthesis services (e.g., OpenAI, Deepgram, Azure Speech).
Example code demonstrates setup with system audio, Deepgram, ChatGPT, and Azure Synthesizer.
Official documentation and community Discord are available for further guidance.

Highlighted Details

Supports real-time streaming conversations with LLMs.
Integrates with multiple telephony providers for phone calls and Zoom meetings.
Offers out-of-the-box integrations with numerous transcription and synthesis services.
Provides a React SDK for building front-end interfaces.

Maintenance & Community

Actively seeking community maintainers.
Roadmap and contribution guides are available.
Community support via Discord.

Licensing & Compatibility

The project is licensed under the MIT License.
Permissive licensing allows for commercial use and integration into closed-source applications.

Limitations & Caveats

The provided quickstart example requires specific API keys for OpenAI, Azure, and Deepgram, which are not included. Users must obtain and configure these credentials to run the example.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

29 stars in the last 30 days