diffusion_policy  by real-stanford

Visuomotor policy learning via action diffusion (research paper)

created 2 years ago
2,857 stars

Top 17.0% on sourcepulse

GitHubView on GitHub
Project Summary

Diffusion Policy provides a framework for learning visuomotor policies using diffusion models, targeting researchers and engineers in robotics and reinforcement learning. It enables efficient training and evaluation of policies on both simulated and real-world robotic tasks, offering a structured approach to policy learning with state or image-based observations.

How It Works

The core of Diffusion Policy lies in its action-centric diffusion model, which learns to generate a sequence of actions conditioned on a history of observations. It employs a unified interface for tasks and methods, allowing for modularity and extensibility. The framework handles data normalization, policy inference, and training/evaluation orchestration through distinct components like Datasets, Policies, and Workspaces, abstracting away environment-specific details.

Quick Start & Requirements

  • Installation: Install via Conda using conda env create -f conda_environment.yaml or mamba env create -f conda_environment.yaml.
  • Prerequisites: Linux with NVIDIA GPU, Ubuntu 20.04, Mujoco dependencies (libosmesa6-dev libgl1-mesa-glx libglfw3 patchelf), RealSense SDK, Spacemouse dependencies (libspnav-dev spacenavd).
  • Demo: Interactive Colab notebooks are available for state-based and vision-based environments.
  • Documentation: Project page and paper links are provided.

Highlighted Details

  • Supports both low-dimensional state inputs and high-dimensional image inputs.
  • Includes implementations for simulation benchmarks and real robot hardware (UR5).
  • Provides scripts for reproducing simulation results, training on real robot data, and evaluating pre-trained checkpoints.
  • Features a modular codebase structure for easily adding new tasks and methods.

Maintenance & Community

The project is associated with Columbia University and Toyota Research Institute. Links to experiment logs and further details are available on their website.

Licensing & Compatibility

  • License: MIT License.
  • Compatibility: Permissive license suitable for commercial use and integration with closed-source projects.

Limitations & Caveats

The macOS environment setup (conda_environment_macos.yaml) is noted as having incomplete support for benchmarks. The codebase structure, while flexible, involves code repetition between tasks and methods.

Health Check
Last commit

7 months ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
450 stars in the last 90 days

Explore Similar Projects

Feedback? Help us improve.