lite-avatar by HumanAIGC

Audio2face model for realtime 2D chat avatar generation

created 5 months ago

327 stars

Top 84.6% on sourcepulse

Project Summary

LiteAvatar provides a real-time 2D chat avatar system driven by audio, capable of running at 30fps on CPU without GPU acceleration. It targets developers and researchers building interactive virtual agents or avatars for applications like video conferencing and virtual assistants, offering a lightweight and efficient solution for voice-synchronized facial animation.

How It Works

The system employs an efficient Automatic Speech Recognition (ASR) model for audio feature extraction, followed by a mouth parameter prediction model that generates synchronized mouth movements from these audio features. Finally, a lightweight 2D face generator renders these movements, enabling real-time inference even on mobile devices. This pipeline prioritizes efficiency and CPU-bound operation.

Quick Start & Requirements

Install via pip install -r requirements.txt.
Recommended: Python 3.10, CUDA 11.8 (though CPU-only inference is supported).
Inference command: python lite_avatar.py --data_dir /path/to/sample_data --audio_file /path/to/audio.wav --result_dir /path/to/result.
Sample data is available at ./data/sample_data.zip.
Real-time demo available at OpenAvatarChat.

Highlighted Details

Achieves 30fps on CPU without GPU acceleration.
Utilizes Paraformer and FunASR for audio feature extraction.
Lightweight 2D face generator suitable for mobile deployment.

Maintenance & Community

The project acknowledges contributions from Paraformer and FunASR. A related paper is available for citation.

Licensing & Compatibility

The license is not explicitly stated in the README. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The README does not specify the license, which may impact commercial adoption. The project relies on specific versions of Python and CUDA, though CPU-only operation is a key feature.

Health Check

Last commit

1 month ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

63 stars in the last 90 days