Discover and explore top open-source AI tools and projects—updated daily.
Digital human system for Unreal Engine 5.3
Top 54.8% on SourcePulse
GMTalker is an immersive digital human system designed for Unreal Engine 5.3, targeting applications in scientific research, education, and virtual human development. It integrates speech recognition, synthesis, natural language understanding, lip-sync animation, and 3D rendering, offering a complete commercial digital human pipeline with local deployment capabilities.
How It Works
GMTalker employs a modular architecture with a UE5 client, a backend AI Digital Human system, and core AI services. The backend handles application logic, interacting with GPT, TTS, ASR, and Player services. These services leverage external APIs and models like OpenAI, FunASR, GPT-SoVITS, and Audio2Face. The system supports real-time interaction, including voice interruption and RAG for personalized Q&A, with a focus on natural language processing and animation synchronization.
Quick Start & Requirements
webui.bat
or ./webui.ps1
) for a one-click start.http://127.0.0.1:5002
, Web UI at http://127.0.0.1:7860
.install.md
, WebUI guide at webui.md
.Highlighted Details
Maintenance & Community
The project is developed by the Media Intelligence Team of Light Intelligence Lab. Contact information for project collaboration is provided via email (mafei@gml.ac.cn
, xuhongbo@gml.ac.cn
) and the Guangming Laboratory Official Site.
Licensing & Compatibility
Licensed under the Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). This permits use, modification, and sharing for non-commercial purposes with attribution. Commercial use is restricted.
Limitations & Caveats
Audio2Face requires downloading character models via VPN and may have slow initial loading; version 2023.1.1 is recommended. The license restricts commercial use.
1 day ago
Inactive