agents-js  by livekit

Node.js framework for building realtime multimodal AI agents

created 1 year ago
499 stars

Top 63.1% on sourcepulse

GitHubView on GitHub
Project Summary

This project provides a Node.js framework for building real-time, multimodal AI agents, enabling developers to create conversational voice agents that can process audio and text inputs. It's targeted at developers looking to integrate advanced AI capabilities into real-time communication platforms, offering a server-side participant framework.

How It Works

The framework utilizes a worker-based architecture where server-side processes (workers) manage and orchestrate AI agents. Agents are defined as functions that can compose various plugins for specific tasks like Speech-to-Text (STT), Large Language Models (LLM), and Text-to-Speech (TTS). This modular design allows for flexible integration with different AI service providers. A new phrase endpointing model is available for improved turn detection.

Quick Start & Requirements

  • Install core library: pnpm install @livekit/agents
  • Install plugins: pnpm install @livekit/agents-plugin-openai (example)
  • Environment variables required: LIVEKIT_URL, LIVEKIT_API_KEY, LIVEKIT_API_SECRET, and provider-specific keys (e.g., OPENAI_API_KEY).
  • Run agent: node my_agent.js start
  • Documentation: LiveKit Agents Docs

Highlighted Details

  • Supports multiple AI service providers via plugins (OpenAI, Deepgram, ElevenLabs, etc.).
  • Includes a "playground" web frontend for agent development and testing.
  • Offers a new, CPU-optimized phrase endpointing model for better conversational flow.
  • Agents are stateful and managed by workers, designed for production resilience.

Maintenance & Community

  • The project is in beta, with the Python version recommended for production.
  • Community support is available via LiveKit Community Slack.
  • Feedback and contributions are welcomed.

Licensing & Compatibility

  • Licensed under Apache-2.0 and REUSE-3.2 compliant.
  • Permissive license suitable for commercial use and integration into closed-source applications.

Limitations & Caveats

This SDK is in beta, meaning APIs may change and bugs may be present. The Python version of the framework is considered more mature and recommended for production use.

Health Check
Last commit

3 days ago

Responsiveness

1 day

Pull Requests (30d)
77
Issues (30d)
8
Star History
115 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Chris Van Pelt Chris Van Pelt(Cofounder of Weights & Biases), and
2 more.

agents by livekit

1.1%
7k
Voice AI agent framework for building realtime applications
created 1 year ago
updated 3 days ago
Feedback? Help us improve.