champ by fudan-generative-vision

Human image animation research paper using 3D parametric guidance

Created 1 year ago

4,243 stars

Top 11.5% on SourcePulse

Project Summary

Champ addresses controllable and consistent human image animation by leveraging 3D parametric guidance, targeting researchers and developers in computer vision and graphics. It enables users to animate human subjects in images based on motion sequences, offering a novel approach to character animation.

How It Works

Champ utilizes a diffusion model guided by 3D human pose and shape parameters (SMPL) derived from motion sequences. This approach allows for fine-grained control over the animation, ensuring consistency and realism by conditioning the generation process on explicit 3D representations of human movement. The use of SMPL parameters provides a robust and interpretable way to capture and transfer human motion.

Quick Start & Requirements

Install: pip install -r requirements.txt or poetry install --no-root
Prerequisites: Ubuntu 20.04/Windows 11, CUDA 12.1, Python 3.10. Tested GPUs: A100, RTX3090.
Setup: Requires downloading pretrained models and preparing guidance motions (SMPL & Rendering). Inference with a 250-frame motion requires ~20GB VRAM.
Links: Docs, Demo, ComfyUI Wrapper

Highlighted Details

ECCV 2024 paper.
Supports animation based on various guidance signals including depth, DWPose, normal maps, and semantic maps.
Offers scripts for SMPL & Rendering and Blender add-ons for motion processing.
Training code and sample datasets are released.

Maintenance & Community

Active development with recent releases of training code and sample data.
Community contributions include a ComfyUI wrapper and a Replicate demo.
Roadmap available for future developments.
Contact: siyuzhu@fudan.edu.cn for research opportunities.

Licensing & Compatibility

The repository does not explicitly state a license. The description mentions "Thanks to runwayml" and "Thanks to stablilityai" for StableDiffusion models, implying potential licensing considerations from those sources.

Limitations & Caveats

High VRAM requirement (~20GB) for longer sequences, with options to segment motions for lower VRAM GPUs.
Training requires custom data processing into SMPL & DWPose format.
The Gradio demo is listed as TBD on the roadmap.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

10 stars in the last 30 days

Explore Similar Projects

Awesome-Human-Motion by Foruck

Curated research on AI-driven human motion understanding

Created 2 years ago

Updated 1 week ago

priorMDM by priorMDM

PyTorch code for human motion diffusion as a generative prior

Created 2 years ago

Updated 11 months ago

awesome-conditional-content-generation by haofanwang

Curated list for conditional content generation research papers

Created 3 years ago

Updated 1 year ago

OmniControl by neu-vi

Human motion generation research paper

Created 2 years ago

Updated 1 year ago

Awesome-Controllable-Video-Generation by mayuelala

Awesome list for controllable video generation papers

Created 10 months ago

Updated 2 months ago

T2M-GPT by Mael-zys

PyTorch code for text-to-motion generation research paper

Created 3 years ago

Updated 1 year ago

momask-codes by EricGuo5513

Research paper implementation for 3D human motion generation via masked modeling

Created 2 years ago

Updated 1 year ago

Starred by

Evan Conrad

Evan Conrad(Cofounder of SF Compute).

WarpFusion by Sxela

AI tool for video-to-video transformation using Stable Diffusion

Created 3 years ago

Updated 8 months ago

HY-Motion-1.0 by Tencent-Hunyuan

Text-to-3D human motion generation models

Created 1 week ago

Updated 1 week ago

Starred by

Jiaming Song

Jiaming Song(Chief Scientist at Luma AI),

Omar Sanseviero

Omar Sanseviero(DevRel at Google DeepMind), and

1 more.

motion-diffusion-model by GuyTevet

PyTorch code for text/action-to-human-motion generation via diffusion

Created 3 years ago

Updated 3 months ago

Starred by

Chenlin Meng

Chenlin Meng(Cofounder of Pika),

Taranjeet Singh

Taranjeet Singh(Cofounder of Mem0), and

3 more.

disco-diffusion by alembics

AI art and animation notebook

Created 3 years ago

Updated 2 years ago

Starred by

Tobi Lutke

Tobi Lutke(Cofounder of Shopify).

LivePortrait by KlingTeam

Portrait animation via stitching/retargeting control (research paper)

Created 1 year ago

Updated 1 month ago

Feedback? Help us improve.