Discover and explore top open-source AI tools and projects—updated daily.
Open-X-HumanoidHumanoid robots executing skilled manipulation
New!
Top 62.5% on SourcePulse
Summary HEX is a whole-body vision-language-action (VLA) framework designed for full-sized humanoid robots, enabling cross-embodiment whole-body manipulation. It addresses the challenge of transferring learned policies across different humanoid platforms and performing long-horizon tasks by aligning heterogeneous states and learning predictive body dynamics.
How It Works HEX integrates a Qwen-VL backbone for perception and language understanding with a novel flow-matching action head. Its core innovation is aligning diverse humanoid states into shared body-part representations, facilitating the learning of predictive body dynamics from cross-embodiment data. This allows the framework to generalize across different humanoid embodiments. During deployment, HEX predicts high-level actions for arms, hands, and waist, coordinating with a separate low-level controller for leg movements to achieve stable manipulation.
Quick Start & Requirements
python=3.10), install system libs (libegl1-mesa-dev, libglu1-mesa), install Python requirements (pip install -r requirements.txt), install FlashAttention2 (manual wheel download may be needed), and install HEX (pip install -e .).libegl1-mesa-dev, libglu1-mesa.notebooks/eval_model.ipynb.Highlighted Details
Maintenance & Community The provided README does not detail specific maintenance schedules, notable contributors, sponsorships, or community channels (e.g., Discord, Slack).
Licensing & Compatibility The core software license is not explicitly stated. However, the README notes commercial restrictions on the data collection pipeline for Tienkung series robots and that the associated low-level whole-body controller is not open-sourced. This suggests potential limitations for commercial use or integration without further clarification.
Limitations & Caveats The data collection pipeline and low-level controller for Tienkung robots are proprietary. Users may encounter installation challenges with FlashAttention2 requiring manual wheel selection based on PyTorch/CUDA versions. The framework necessitates downloading external Qwen3-VL models.
2 weeks ago
Inactive
NVIDIA