1xgpt by 1x-technologies

World model challenge for humanoid robots

Created 1 year ago

552 stars

Top 57.9% on SourcePulse

1 Expert Loves This Project

0hq

Coauthor of Sora

Project Summary

This repository hosts the 1X World Model Challenge, aimed at accelerating progress in learned simulators for general-purpose robotics. It provides a dataset of over 100 hours of first-person EVE Android robot observations and actions, along with baseline models and evaluation tools, targeting researchers and engineers in robotics and AI.

How It Works

The core of the challenge involves predicting future robot observations using learned world models. The approach utilizes a GENIE-style spatio-temporal transformer and a MAGVIT2 autoencoder to compress images into discrete tokens. This tokenized representation allows for efficient modeling of sequential data, enabling the prediction of future states within a learned simulation environment. The advantage lies in creating end-to-end learned simulators that can significantly speed up robot policy development.

Quick Start & Requirements

Install dependencies and download data: ./build.sh
Activate Python environment: source venv/bin/activate
Requires Python 3.10 or later.
Official quick-start and dataset details are available on Huggingface.

Highlighted Details

Three challenges: Compression (predicting tokens), Sampling (generating plausible videos), and Evaluation (ranking policies within a world model).
Cash prizes are offered for intermediate goals in the Compression and Sampling challenges.
The dataset consists of 16 first-person images at 2Hz, totaling 8 seconds per sequence.
Pre-trained GENIE models are provided for baseline comparison.

Maintenance & Community

The project is associated with 1X Technologies.
A Discord server is available for community discussion.
Updates on Phase 2 of the challenge are expected.

Licensing & Compatibility

The dataset is licensed under Apache 2.0.
The repository itself does not explicitly state a license, but the dataset license suggests commercial use is permitted.

Limitations & Caveats

Prizes are not available to individuals in U.S. sanctioned countries.
The challenge organizers reserve the right to withhold prizes if the spirit of the challenge is violated.
The loss metric is non-standard due to a factorized probability mass function for image tokens.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

1

Issues (30d)

0

Star History

6 stars in the last 30 days

Explore Similar Projects

Starred by

Alberto Taiuti

Alberto Taiuti(Cofounder of Luma AI) and

Alex Yu

Alex Yu(Research Scientist at OpenAI; Cofounder of Luma AI).

GR-1 by bytedance

GPT-style model for visual robot manipulation research

Created 1 year ago

Updated 1 year ago

Awesome-Human-Motion by Foruck

Curated research on AI-driven human motion understanding

Created 2 years ago

Updated 1 day ago

TokenHMR by saidwivedi

Research paper advancing human mesh recovery via tokenization

Created 1 year ago

Updated 4 months ago

priorMDM by priorMDM

PyTorch code for human motion diffusion as a generative prior

Created 3 years ago

Updated 1 year ago

arctic by zc-alexfan

Dataset for dexterous bimanual hand-object manipulation research

Created 2 years ago

Updated 2 weeks ago

RoboFlamingo by RoboFlamingo

Robotics learning framework for language-conditioned robot skills via fine-tuning

Created 2 years ago

Updated 1 year ago

Starred by

Phil Wang

Phil Wang(Prolific Research Paper Implementer).

GR00T-Dreams by NVIDIA

Synthetic data generation for robot learning

Created 8 months ago

Updated 4 months ago

RynnVLA-002 by alibaba-damo-academy

Autoregressive action world model for robotics

Created 8 months ago

Updated 2 months ago

Starred by

Amanpreet Singh

Amanpreet Singh(Cofounder of Contextual AI) and

Deshraj Yadav

Deshraj Yadav(Cofounder of Mem0).

habitat-challenge by facebookresearch

Starter code for embodied AI Habitat Challenge

Created 7 years ago

Updated 2 years ago

VIMA by vimalabs

Robot manipulation via multimodal prompts (ICML'23 paper)

Created 3 years ago

Updated 1 year ago

Starred by

Chenlin Meng

Chenlin Meng(Cofounder of Pika),

Ben Firshman

Ben Firshman(Cofounder of Replicate), and

1 more.

Best_AI_paper_2020 by louisfb01

AI paper list with video explanations and code from 2020

Created 5 years ago

Updated 4 years ago

Starred by

Forrest Iandola

Forrest Iandola(Author of SqueezeNet; Research Scientist at Meta),

Omar Sanseviero

Omar Sanseviero(DevRel at Google DeepMind), and

2 more.

Isaac-GR00T by NVIDIA

Open foundation model for humanoid robot reasoning and skills

Created 11 months ago

Updated 2 days ago

Feedback? Help us improve.