agents  by videosdk-live

Real-time multimodal conversational AI agents framework

Created 3 months ago
439 stars

Top 68.0% on SourcePulse

GitHubView on GitHub
Project Summary

This framework enables the development of real-time, multimodal conversational AI agents that can join video conferencing rooms. It targets developers building AI-powered assistants for voice and media interactions, offering seamless integration with various AI models and communication platforms.

How It Works

The SDK acts as a bridge, connecting backend systems to the VideoSDK platform, allowing AI agents to participate in real-time audio and video conversations. It supports a cascading pipeline architecture for integrating different Speech-to-Text (STT), Large Language Model (LLM), and Text-to-Speech (TTS) providers, along with features like turn detection, virtual avatars, and function tools for extended capabilities.

Quick Start & Requirements

Highlighted Details

  • Real-time Audio/Video communication with agents.
  • SIP and Telephony integration for PSTN access.
  • Support for multiple AI models (OpenAI, Gemini, AWS NovaSonic).
  • Virtual avatar integration via Simli.
  • Function tools for extending agent actions (e.g., event scheduling).
  • Agent-to-Agent (A2A) and Model Context Protocol (MCP) integration.

Maintenance & Community

  • Actively developed with contributions welcomed.
  • Community support available via VideoSDK's Discord server.
  • Links to contributing guides and plugin development resources are provided.

Licensing & Compatibility

  • The specific license is not explicitly stated in the README, but it is presented as an open-source framework. Further clarification on licensing terms would be beneficial for commercial use.

Limitations & Caveats

  • Requires specific VideoSDK authentication tokens and meeting IDs.
  • Integration with third-party AI models necessitates obtaining and configuring their respective API keys.
  • The README mentions "playground=True" for meeting options, which might indicate a development or testing focus rather than production-ready deployment without further configuration.
Health Check
Last Commit

20 hours ago

Responsiveness

Inactive

Pull Requests (30d)
15
Issues (30d)
0
Star History
142 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Andre Zayarni Andre Zayarni(Cofounder of Qdrant), and
6 more.

RealChar by Shaunwei

0.1%
6k
Real-time AI character/companion creation and interaction codebase
Created 2 years ago
Updated 1 year ago
Starred by Tobi Lutke Tobi Lutke(Cofounder of Shopify), Joe Walnes Joe Walnes(Head of Experimental Projects at Stripe), and
12 more.

LibreChat by danny-avila

0.7%
30k
Enhanced ChatGPT clone for self-hosting
Created 2 years ago
Updated 1 day ago
Feedback? Help us improve.