Discover and explore top open-source AI tools and projects—updated daily.
DAVIAN-RoboticsEgocentric video generation from exocentric input
Top 51.7% on SourcePulse
EgoX is a novel framework for generating egocentric (first-person) videos from a single exocentric (third-person) video input. It addresses realistic viewpoint transformation while maintaining temporal consistency and scene structure. Designed for researchers in egocentric video synthesis, EgoX offers a powerful tool for creating immersive first-person perspectives by leveraging external observations and egocentric priors.
How It Works
The framework builds upon large-scale video diffusion models trained on the Ego-Exo4D dataset. EgoX employs a unified conditioning strategy integrating spatial and channel information within latent representations for realistic viewpoint transformation. A key advantage is its lightweight adaptation mechanism using LoRA-based fine-tuning, significantly reducing customization computational burden.
Quick Start & Requirements
scripts/infer_itw.sh, scripts/infer_ego4d.sh). Custom data inference requires specific directory structures and metadata preparation.5 days ago
Inactive
SkyworkAI