scalingup by real-stanford

Framework for language-guided robot skill learning

Created 2 years ago

404 stars

Top 71.9% on SourcePulse

Project Summary

This repository provides a framework for language-guided robot skill acquisition, enabling robots to learn new tasks from natural language instructions without expert demonstrations or manual supervision. It is designed for researchers and engineers in robotics and AI interested in efficient, scalable robot learning.

How It Works

The framework employs a data generation pipeline that leverages language models to create diverse, labeled robot trajectories. It utilizes a hierarchical approach with nested trajectories and exploration task trees to manage complexity. Seeded variation and language model queries are used to generate rich data, which is then used to train language-conditioned diffusion policies.

Quick Start & Requirements

Install: The README does not provide a specific installation command but mentions the use of Hydra for configuration.
Prerequisites: Ubuntu 18.04, 20.04, or 22.04; NVIDIA GPUs (tested on GTX 1080, RTX A6000, RTX 3080, RTX 3090).
Resources: Requires significant computational resources for data generation and policy training.
Links: Project Page, Arxiv, Video

Highlighted Details

Language-guided data generation and diffusion policy training.
No expert demonstrations, manual reward supervision, or manual language annotation required.
Hierarchical actions and policies, exploration task trees, and seeded variation for data diversity.
Utilizes language model queries for data labeling and control.

Maintenance & Community

Supported by Google Research Award, NSF Awards #2143601 and #2132519.
Mentions contributions from various individuals and projects, indicating active development and community engagement.
Contact: huy [at] cs [dot] columbia [dot] edu for questions.

Licensing & Compatibility

The repository does not explicitly state a license in the provided README snippet. Further investigation into the repository's files is required.

Limitations & Caveats

The README does not detail specific limitations or known issues. The framework's complexity and reliance on language models may introduce challenges in robustness and interpretability.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

2 stars in the last 30 days

Explore Similar Projects

awesome-in-context-rl by dunnolab

Advancing reinforcement learning through in-context learning paradigms

Created 1 year ago

Updated 4 months ago

Starred by

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera).

vla0 by NVlabs

State-of-the-art Vision-Language-Action models via text-based action representation

Created 2 months ago

Updated 3 days ago

DexGraspVLA by Psi-Robot

Vision-language-action framework for dexterous grasping

Created 10 months ago

Updated 5 months ago

RoboFlamingo by RoboFlamingo

Robotics learning framework for language-conditioned robot skills via fine-tuning

Created 2 years ago

Updated 1 year ago

CogACT by microsoft

Vision-language-action model for robotic manipulation

Created 1 year ago

Updated 2 months ago

UniVLA by OpenDriveLab

Vision-language-action framework for cross-environment policy learning

Created 8 months ago

Updated 1 month ago

Starred by

Eric Jang

Eric Jang(VP AI at 1X).

peract by peract

Robotics agent for language-conditioned manipulation tasks

Created 3 years ago

Updated 1 year ago

cliport by cliport

Robotic manipulation via imitation learning using language-conditioned policies

Created 4 years ago

Updated 2 years ago

Starred by

Alex Yu

Alex Yu(Research Scientist at OpenAI; Cofounder of Luma AI) and

Zhou Xian

Zhou Xian(Cofounder of Genesis AI).

RoboGen by Genesis-Embodied-AI

Generative robotic agent for automated robot learning via generative simulation

Created 2 years ago

Updated 1 year ago

Starred by

Benjamin Bolte

Benjamin Bolte(Cofounder of K-Scale Labs).

Awesome-Robotics-Foundation-Models by robotics-survey

Robotics survey paper resources

Created 2 years ago

Updated 1 year ago

Starred by

Aravind Srinivas

Aravind Srinivas(Cofounder of Perplexity) and

Deepak Pathak

Deepak Pathak(Cofounder of Skild AI; Professor at CMU).

RLBench by stepjam

Robot learning benchmark for vision-guided manipulation research

Created 6 years ago

Updated 11 months ago

diffusion_policy by real-stanford

Visuomotor policy learning via action diffusion (research paper)

Created 2 years ago

Updated 1 year ago

Feedback? Help us improve.