bolna by voxos-ai

Open-source platform for building voice-driven multimodal agents

Created 2 years ago

432 stars

Top 68.1% on SourcePulse

Project Summary

Bolna is an end-to-end, open-source framework for building voice-first, multimodal conversational AI agents. It targets developers and researchers looking to quickly create production-ready voice applications, enabling features like initiating phone calls, real-time transcription, LLM-driven conversations, and text-to-speech synthesis.

How It Works

Bolna orchestrates a pipeline of specialized components for voice interactions. It leverages providers for telephony (e.g., Twilio), Automatic Speech Recognition (ASR) (e.g., Deepgram), Large Language Models (LLMs) (e.g., OpenAI, Mistral via LiteLLM), and Text-to-Speech (TTS) (e.g., ElevenLabs, AWS Polly). Agents are configured via JSON, defining task flows, toolchains (parallel or sequential processing), and specific provider configurations, allowing for flexible and modular voice agent development.

Quick Start & Requirements

Install/Run: Local setup uses Docker. Build images with docker-compose build --no-cache <twilio-app | plivo-app> and run with docker-compose up <twilio-app | plivo-app>.
Prerequisites: Requires Docker, a .env file with provider API keys (Twilio/Plivo, Deepgram, LLM provider, TTS provider), and ngrok for tunneling.
Resources: Local setup involves four Docker containers (telephony server, Bolna server, ngrok, redis).
Docs: https://github.com/bolna-ai/bolna

Highlighted Details

Supports multiple telephony providers (Twilio, Plivo) for initiating calls.
Integrates with various ASR, LLM, and TTS providers through a unified interface, powered by LiteLLM for LLMs.
Agent behavior and conversation flow are defined declaratively using JSON configurations.
Offers extensibility for adding new telephony providers by implementing custom input/output handlers and a dedicated server.

Maintenance & Community

Community channels include Discord and documentation.
Contributions are welcomed via issues and pull requests.

Licensing & Compatibility

The repository is open-source. Specific license details are not explicitly stated in the README, but it mentions managed hosted offerings.

Limitations & Caveats

The README does not explicitly state the open-source license type, which may impact commercial use or closed-source linking.
Local setup requires configuring multiple external service API keys and using ngrok for external access.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

2 stars in the last 30 days

Explore Similar Projects

HoldSpeak by karolswdev

Local AI copilot for voice typing and meeting intelligence

Created 3 months ago

Updated 22 hours ago

sage by farshed

Self-hosted voice chat with LLMs

Created 1 year ago

Updated 1 year ago

Starred by

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI).

S.A.T.U.R.D.A.Y by GRVYDEV

Vocal computing toolbox for building voice interfaces to LLMs

Created 3 years ago

Updated 2 years ago

Starred by

Georgi Gerganov

Georgi Gerganov(Author of llama.cpp, whisper.cpp).

pi-card by nkasmanoff

Voice assistant for Raspberry Pi

Created 2 years ago

Updated 1 year ago

Intervo by Intervo

AI voice platform for goal-oriented agents

Created 1 year ago

Updated 1 year ago

agents by videosdk-live

Real-time multimodal conversational AI agents framework

Created 1 year ago

Updated 1 week ago

Patter by PatterAI

Voice AI agents on phone calls, simplified

Created 3 months ago

Updated 5 days ago

pipecat-examples by pipecat-ai

Voice and multimodal AI application development framework

Created 11 months ago

Updated 4 days ago

bolna by bolna-ai

Voice AI agents platform for building conversational apps

Created 1 year ago

Updated 1 day ago

friday-tony-stark-demo by SAGAR-TAMANG

Voice AI assistant with dynamic tool access

Created 3 months ago

Updated 5 days ago

Starred by

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera),

Justin Cormack

Justin Cormack(Former CTO of Docker), and

12 more.

vocode-core by vocodedev

Open-source library for building voice-based LLM agents

Created 3 years ago

Updated 1 year ago

dograh by dograh-hq

Open-source platform for building voice AI agents

Created 10 months ago

Updated 1 day ago

Feedback? Help us improve.