agents-js by livekit

Node.js framework for building realtime multimodal AI agents

Created 1 year ago

715 stars

Top 48.1% on SourcePulse

View on GitHub

2 Experts Love This Project

Project Summary

This project provides a Node.js framework for building real-time, multimodal AI agents, enabling developers to create conversational voice agents that can process audio and text inputs. It's targeted at developers looking to integrate advanced AI capabilities into real-time communication platforms, offering a server-side participant framework.

How It Works

The framework utilizes a worker-based architecture where server-side processes (workers) manage and orchestrate AI agents. Agents are defined as functions that can compose various plugins for specific tasks like Speech-to-Text (STT), Large Language Models (LLM), and Text-to-Speech (TTS). This modular design allows for flexible integration with different AI service providers. A new phrase endpointing model is available for improved turn detection.

Quick Start & Requirements

Install core library: pnpm install @livekit/agents
Install plugins: pnpm install @livekit/agents-plugin-openai (example)
Environment variables required: LIVEKIT_URL, LIVEKIT_API_KEY, LIVEKIT_API_SECRET, and provider-specific keys (e.g., OPENAI_API_KEY).
Run agent: node my_agent.js start
Documentation: LiveKit Agents Docs

Highlighted Details

Supports multiple AI service providers via plugins (OpenAI, Deepgram, ElevenLabs, etc.).
Includes a "playground" web frontend for agent development and testing.
Offers a new, CPU-optimized phrase endpointing model for better conversational flow.
Agents are stateful and managed by workers, designed for production resilience.

Maintenance & Community

The project is in beta, with the Python version recommended for production.
Community support is available via LiveKit Community Slack.
Feedback and contributions are welcomed.

Licensing & Compatibility

Licensed under Apache-2.0 and REUSE-3.2 compliant.
Permissive license suitable for commercial use and integration into closed-source applications.

Limitations & Caveats

This SDK is in beta, meaning APIs may change and bugs may be present. The Python version of the framework is considered more mature and recommended for production use.

Health Check

Last Commit

1 day ago

Responsiveness

1 day

Pull Requests (30d)

Issues (30d)

Star History

33 stars in the last 30 days