Discover and explore top open-source AI tools and projects—updated daily.
dsd2077Digital human agents with real-time video interaction
Top 44.1% on SourcePulse
CyberVerse provides an open-source platform for creating interactive digital human agents capable of real-time, face-to-face video conversations. It allows users to bring AI characters to life from a single photograph, offering a unique way to interact with AI that goes beyond traditional avatars. The platform is designed for users seeking advanced AI companions or developers looking to integrate lifelike AI agents into their applications.
How It Works
The system leverages WebRTC for real-time, peer-to-peer video streaming with low latency, incorporating embedded TURN/NAT traversal for connectivity. Digital humans are animated using state-of-the-art models like FlashHead and LiveAct, which generate facial expressions, lip-sync, and subtle movements from a single input photo without requiring 3D modeling or motion capture. CyberVerse features a modular, plugin-based architecture, allowing users to swap components such as Large Language Models (LLMs), Text-to-Speech (TTS) engines, Automatic Speech Recognition (ASR) models, and avatar backends via YAML configuration.
Quick Start & Requirements
.env), download model weights using huggingface-cli, update cyberverse_config.yaml, run make setup, and then start services with make inference, make server, and make frontend in separate terminals.Highlighted Details
Maintenance & Community
The provided README does not detail specific community channels (like Discord/Slack), notable contributors, sponsorships, or a public roadmap beyond planned features.
Licensing & Compatibility
Limitations & Caveats
The platform requires substantial GPU resources for real-time operation, with specific hardware benchmarks provided. Several advanced features, such as user-side camera input for gesture recognition, knowledge import for RAG, embeddability, and agent memory/tool use, are listed as planned but not yet implemented. The setup process involves multiple dependencies and configuration steps, and optional components like SageAttention and FlashAttention may require source compilation.
1 day ago
Inactive