Discover and explore top open-source AI tools and projects—updated daily.
dreamzero0World action models for zero-shot task generalization
New!
Top 41.9% on SourcePulse
DreamZero provides code for a World Action Model that excels at zero-shot task generalization by jointly predicting actions and videos. It targets researchers and engineers in robotics and embodied AI, offering a pre-trained model, a distributed inference server, and tools for simulation and real-world evaluations, significantly accelerating development and experimentation.
How It Works
The core of DreamZero is its ability to predict future actions and corresponding video frames simultaneously. This release focuses on the DROID model, utilizing Diffusion Transformer (DiT) caching for highly optimized inference. This approach allows for robust zero-shot performance on novel tasks without explicit task-specific training, integrating perception, action, and generation.
Quick Start & Requirements
conda create -n dreamzero python=3.11, conda activate dreamzero), followed by pip install -e . --extra-index-url https://download.pytorch.org/whl/cu129. Additional dependencies like flash-attn and Transformer Engine may be required based on hardware.torch.distributed.run with specified model path and port.sim-evals repository (URL not provided) and running eval_utils/run_sim_eval.py after obtaining API access.Highlighted Details
Maintenance & Community
No specific community channels (e.g., Discord, Slack) or detailed maintenance signals (e.g., active contributors, sponsorships) are explicitly mentioned in the provided README.
Licensing & Compatibility
Licensed under the Apache License 2.0. This license is generally permissive for commercial use and integration into closed-source projects.
Limitations & Caveats
Future enhancements include support for new embodiments (e.g., YAM), PolaRiS, and Genie 3.0 simulation environments. The simulation evaluation process requires a separate setup and API access request.
4 days ago
Inactive
bytedance