Audio2Face-3D by NVIDIA

Real-time 3D facial animation synthesis from audio

Created 9 months ago

328 stars

Top 82.9% on SourcePulse

Project Summary

Summary

NVIDIA Audio2Face-3D generates high-fidelity 3D facial animation from audio, producing accurate lip-sync and nuanced emotional expressions. It supports real-time and pre-recorded audio, driving animation via mesh deformations, joint transformations, or blend shapes. This technology enables lifelike, synchronized facial performances for games, film, and interactive applications.

How It Works

The system analyzes vocal data to synthesize detailed facial articulation, including jaw, tongue, and eye motion, alongside subtle skin deformations. It infers emotional states from speech tone and uses phonetic analysis for precise lip synchronization. This approach ensures lifelike facial performances precisely matching source audio.

Quick Start & Requirements

Components are distributed via NVIDIA repos and Hugging Face Hub.

SDK: C++ library for on-device/cloud animation. MIT License. Target: C++ developers, pipeline engineers. A2X SDK on Github
Training Framework: Python/Docker for custom models. Apache License. Target: ML researchers, animators. Training Framework on Github
Maya ACE Plugin: Autodesk Maya integration. MIT License. Target: 3D animators. Maya ACE on Github
Unreal Engine 5 Plugin: UE 5 integration. MIT License. Target: Game developers. UE5 Plugin Docs
NIM: Scalable deployment microservice. NVIDIA Software License. Target: DevOps. A2F NIM All leverage NVIDIA GPUs; CPU fallbacks exist.

Highlighted Details

Accurate lip-sync via phonetic analysis; nuanced emotional expression from speech tone.
Animation output via mesh deformations, joint transformations, or blend shapes.
Offers pre-trained models and custom model training.
Includes Maya and UE5 plugins, plus a scalable deployment microservice (NIM).

Maintenance & Community

Source code, scripts, and docs are in NVIDIA repos. Models and datasets are on Hugging Face Hub. GitHub links provided for SDK, Training Framework, Maya ACE, and UE plugins. NIM is on build.nvidia.com.

Licensing & Compatibility

Core components (SDK, Maya ACE, UE Plugin) are MIT. Training Framework is Apache. Models are "Open Weights" or "Custom (evaluation only)". NIM uses NVIDIA Software License Agreement and Product Specific Terms for AI Products, potentially restricting commercial use.

Limitations & Caveats

NIM license may restrict commercial deployment. Sample datasets are "evaluation only". UE5 plugin requires specific engine versions (5.4-5.6). Some models are experimental (v3.0).

Health Check

Last Commit

9 months ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

12 stars in the last 30 days