diffusion_policy  by real-stanford

Visuomotor policy learning via action diffusion (research paper)

Created 2 years ago
3,274 stars

Top 14.7% on SourcePulse

GitHubView on GitHub
Project Summary

Diffusion Policy provides a framework for learning visuomotor policies using diffusion models, targeting researchers and engineers in robotics and reinforcement learning. It enables efficient training and evaluation of policies on both simulated and real-world robotic tasks, offering a structured approach to policy learning with state or image-based observations.

How It Works

The core of Diffusion Policy lies in its action-centric diffusion model, which learns to generate a sequence of actions conditioned on a history of observations. It employs a unified interface for tasks and methods, allowing for modularity and extensibility. The framework handles data normalization, policy inference, and training/evaluation orchestration through distinct components like Datasets, Policies, and Workspaces, abstracting away environment-specific details.

Quick Start & Requirements

  • Installation: Install via Conda using conda env create -f conda_environment.yaml or mamba env create -f conda_environment.yaml.
  • Prerequisites: Linux with NVIDIA GPU, Ubuntu 20.04, Mujoco dependencies (libosmesa6-dev libgl1-mesa-glx libglfw3 patchelf), RealSense SDK, Spacemouse dependencies (libspnav-dev spacenavd).
  • Demo: Interactive Colab notebooks are available for state-based and vision-based environments.
  • Documentation: Project page and paper links are provided.

Highlighted Details

  • Supports both low-dimensional state inputs and high-dimensional image inputs.
  • Includes implementations for simulation benchmarks and real robot hardware (UR5).
  • Provides scripts for reproducing simulation results, training on real robot data, and evaluating pre-trained checkpoints.
  • Features a modular codebase structure for easily adding new tasks and methods.

Maintenance & Community

The project is associated with Columbia University and Toyota Research Institute. Links to experiment logs and further details are available on their website.

Licensing & Compatibility

  • License: MIT License.
  • Compatibility: Permissive license suitable for commercial use and integration with closed-source projects.

Limitations & Caveats

The macOS environment setup (conda_environment_macos.yaml) is noted as having incomplete support for benchmarks. The codebase structure, while flexible, involves code repetition between tasks and methods.

Health Check
Last Commit

10 months ago

Responsiveness

Inactive

Pull Requests (30d)
1
Issues (30d)
2
Star History
134 stars in the last 30 days

Explore Similar Projects

Feedback? Help us improve.