Robot policy for generalist manipulation, trained on 800k trajectories
Top 31.3% on sourcepulse
Octo provides a transformer-based generalist robotic policy (GRP) trained on 800k robot trajectories, enabling zero-shot control via language or goal images across diverse robot setups. It's designed for researchers and practitioners in robotics and AI who need a versatile, adaptable policy for various manipulation tasks.
How It Works
Octo employs a modular attention structure within its transformer backbone. This design allows it to process multimodal inputs (RGB cameras, language, goal images) and output robot actions. The modularity facilitates efficient finetuning on new robot morphologies, sensory inputs, and action spaces with minimal data and compute.
Quick Start & Requirements
pip install -e .
and pip install -r requirements.txt
.jax[cuda11_pip]
(version 0.4.20 specified).jax[tpu]
(version 0.4.20 specified).python scripts/finetune.py --config.pretrained_path=hf://rail-berkeley/octo-small-1.5 --debug
.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
1 year ago
1 day