OmniControl by neu-vi

Human motion generation research paper

Created 2 years ago

395 stars

Top 73.0% on SourcePulse

Project Summary

OmniControl addresses the challenge of fine-grained control over human motion generation, enabling users to specify desired joint positions and orientations at any point in time. This project is targeted at researchers and developers in computer graphics, animation, and robotics who require precise control over synthesized human movements, offering a flexible framework for creating realistic and controllable character animations.

How It Works

OmniControl leverages a diffusion model conditioned on text and spatial control signals. It employs a novel approach that allows for arbitrary joint control by integrating spatial constraints directly into the diffusion process. This method enables precise manipulation of specific body parts without sacrificing the overall naturalness and coherence of the generated motion, offering a significant advantage over methods that rely on global or limited control mechanisms.

Quick Start & Requirements

Install: Setup conda environment (conda env create -f environment.yml), activate it (conda activate omnicontrol), install spaCy (python -m spacy download en_core_web_sm), and install CLIP (pip install git+https://github.com/openai/CLIP.git).
Prerequisites: Python 3.7, CUDA-capable GPU, ffmpeg. Download SMPL files, GloVe, and T2M evaluators via provided scripts.
Data: Requires HumanML3D and KIT-ML datasets.
Models: Download pretrained models from Google Drive.
Links: Project Page, Paper.

Highlighted Details

Code for training and inference is available.
Pretrained model on HumanML3D is provided.
Evaluation code and metrics are included.
Motion generation can be controlled via text prompts and manually defined spatial signals.
Generates .mp4 animations and optionally .obj meshes via SMPLify.

Maintenance & Community

The project is associated with ICLR 2024. The code is based on MDM, MLD, and TMOS. No specific community channels or active maintenance signals are mentioned in the README.

Licensing & Compatibility

MIT License. However, the project depends on libraries like CLIP, SMPL, SMPL-X, and PyTorch3D, which have their own licenses that must also be followed. Commercial use may be restricted by these underlying dependencies.

Limitations & Caveats

Evaluation on HumanML3D takes approximately 45 hours on a single GPU. The motion rendering script requires a GPU and is borrowed from MDM. The .obj files do not preserve vertex order, necessitating the use of saved SMPL parameters for animation.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

8 stars in the last 30 days