Discover and explore top open-source AI tools and projects—updated daily.
Joinly is an open-source, self-hosted middleware designed to enable AI agents to actively participate in video calls across platforms like Google Meet, Zoom, and Microsoft Teams. It provides AI agents with real-time interaction capabilities through voice and chat, facilitating natural conversational flows and task execution within meetings.
How It Works
Joinly operates as a connector middleware, utilizing an MCP (Meeting Control Protocol) server to expose essential meeting tools and resources to AI agents. It supports a modular design for Speech-to-Text (STT) and Text-to-Speech (TTS) services, allowing users to choose providers like Whisper, Deepgram, Kokoro, and ElevenLabs. The system is built to handle interruptions and multi-speaker interactions, ensuring a seamless conversational experience.
Quick Start & Requirements
.env
file with LLM API keys (e.g., OpenAI, Anthropic, Ollama).docker pull ghcr.io/joinly-ai/joinly:latest
followed by docker run --env-file .env ghcr.io/joinly-ai/joinly:latest --client <MeetingURL>
.ghcr.io/joinly-ai/joinly:latest-cuda
and --gpus all
.Highlighted Details
Maintenance & Community
The project is actively maintained with a roadmap outlining future features like camera integration, screen sharing, and improved client memory. Community support is available via Discord and GitHub Discussions.
Licensing & Compatibility
Licensed under the MIT License, permitting commercial use and integration with closed-source applications.
Limitations & Caveats
The Docker image is substantial (~2.3GB) due to bundled browser and models. GPU support requires specific CUDA versions and NVIDIA drivers. Some roadmap features, such as camera integration and improved client memory, are still under development.
1 week ago
Inactive