video-retalking  by OpenTalker

Lip-sync system for talking-head video editing (research paper)

created 2 years ago
7,114 stars

Top 7.4% on sourcepulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

VideoReTalking is a system for editing talking head videos to achieve audio-driven lip synchronization and expression modification. It targets researchers and developers working on video editing and synthesis, enabling high-quality, lip-synced outputs even with different emotions.

How It Works

The system employs a three-stage pipeline: first, an expression editing network modifies facial expressions to a canonical form. Second, a lip-sync network synchronizes the video frames with input audio. Finally, an identity-aware face enhancement network and post-processing steps improve photo-realism. This sequential, learning-based approach allows for end-to-end processing without manual intervention.

Quick Start & Requirements

  • Install via conda and pip. Requires Python 3.8 and CUDA 11.1.
  • PyTorch version 1.9.0+cu111 and torchvision 0.10.0+cu111 are specified.
  • Pretrained models must be downloaded and placed in ./checkpoints.
  • Inference command: python3 inference.py --face <input_video> --audio <input_audio> --outfile <output_video>
  • Official demo notebook: quick_demo.ipynb

Highlighted Details

  • SIGGRAPH Asia 2022 Conference Track publication.
  • Supports editing expression (e.g., smile, surprise, angry) via parameters.
  • Can handle various talking face videos without manual alignment.
  • Acknowledges contributions from Wav2Lip, PIRenderer, GFP-GAN, GPEN, ganimation_replicate, and STIT.

Maintenance & Community

The project lists authors from Xidian University and Tencent AI Lab. No specific community channels (Discord/Slack) or roadmap are provided in the README.

Licensing & Compatibility

The README states compliance with an "open-source license" and "intellectual property declaration" but does not specify the license type. It includes a disclaimer that it is not an official Tencent product and prohibits using Tencent names/logos without permission.

Limitations & Caveats

The DNet module cannot handle extreme poses. The project's disclaimer also warns against using the code for harmful activities or misrepresentation.

Health Check
Last commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
123 stars in the last 90 days

Explore Similar Projects

Feedback? Help us improve.