gym-sokoban by mpSchrader

Gym environment for Sokoban puzzles, suitable for RL research

Created 7 years ago

390 stars

Top 73.6% on SourcePulse

2 Experts Love This Project

truell20

Cofounder of Cursor

aravindsrinivas

Aravind Srinivas

Cofounder of Perplexity

Project Summary

This repository provides a Sokoban environment for OpenAI Gym, designed for reinforcement learning research. It addresses the challenge of irreversible mistakes in Sokoban puzzles by implementing a novel, solvable level generation algorithm based on reverse play and a heuristic scoring system.

How It Works

The environment generates random, solvable Sokoban levels using a three-phase process: topology generation via random walks, element placement (player, boxes, targets), and a crucial reverse-play phase using Depth First Search to ensure solvability and assign a difficulty score. This approach allows for training RL agents on diverse, non-overfitting scenarios.

Quick Start & Requirements

Install via pip: pip install gym-sokoban
Requires Python.
Official docs: https://github.com/mpSchrader/gym-sokoban

Highlighted Details

Implements 9 actions (Move/Push in 4 directions, No-Op).
Offers multiple rendering modes: rgb_array, human, tiny_rgb_array, tiny_human.
Includes various room configurations (e.g., Sokoban-v0 to Sokoban-huge-v0) with different sizes and box counts.
Supports variations like Fixed Targets, Multiple Player, Push&Pull, and Boxoban (DeepMind puzzles).

Maintenance & Community

Project initiated by Max-Philipp B. Schrader.
Open for contributions via issues or pull requests.

Licensing & Compatibility

License not explicitly stated in the README.

Limitations & Caveats

Larger room configurations may take significant time to generate, especially on laptops.
The README does not specify the exact license, which could impact commercial use.

Health Check

Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

1 stars in the last 30 days

Explore Similar Projects

Starred by

Evan Hubinger

Evan Hubinger(Head of Alignment Stress-Testing at Anthropic),

Jiaming Song

Jiaming Song(Chief Scientist at Luma AI), and

1 more.

gym-minecraft by tambetm

Gym environment for AI experimentation/research in Minecraft

Created 9 years ago

Updated 4 years ago

marLo by crowdAI

Reinforcement learning in Minecraft

Created 7 years ago

Updated 5 years ago

MinAtar by kenjyoung

AI testbed for reinforcement learning agents, miniaturized Atari 2600 games

Created 6 years ago

Updated 1 year ago

Starred by

Thomas Wolf

Thomas Wolf(Cofounder of Hugging Face).

minihack by facebookresearch

RL sandbox for open-ended reinforcement learning research

Created 6 years ago

Updated 11 months ago

jumanji by instadeepai

RL environments in JAX for accelerated research

Created 3 years ago

Updated 1 month ago

simple_rl by david-abel

RL framework for experimenting with reinforcement learning in Python

Created 9 years ago

Updated 1 year ago

Starred by

Aravind Srinivas

Aravind Srinivas(Cofounder of Perplexity),

Tom Brown

Tom Brown(Cofounder of Anthropic), and

1 more.

estool by hardmaru

Evolution Strategies tool for reinforcement learning research

Created 8 years ago

Updated 3 years ago

Starred by

Elie Bursztein

Elie Bursztein(Cybersecurity Lead at Google DeepMind) and

Jerry Tworek

Jerry Tworek(VP Research at OpenAI).

gym-super-mario-bros by Kautenja

OpenAI Gym environment for Super Mario Bros. on NES

Created 7 years ago

Updated 2 years ago

minerl by minerllabs

Gym environments for training agents in Minecraft

Created 6 years ago

Updated 11 months ago

Starred by

Anton Osika

Anton Osika(Cofounder of Lovable).

dissecting-reinforcement-learning by mpatacchiola

Reinforcement Learning blog post series with code examples

Created 9 years ago

Updated 2 years ago

Starred by

Yiran Wu

Yiran Wu(Coauthor of AutoGen),

Pawel Garbacki

Pawel Garbacki(Cofounder of Fireworks AI), and

3 more.

RAGEN by mll-lab-nu

Train LLM agents with reinforcement learning in interactive environments

Created 11 months ago

Updated 4 days ago

Starred by

Gabriel Almeida

Gabriel Almeida(Cofounder of Langflow) and

Evan Hubinger

Evan Hubinger(Head of Alignment Stress-Testing at Anthropic).

procgen by openai

Procedurally-generated Gym environments for RL generalization research

Created 6 years ago

Updated 2 years ago

Feedback? Help us improve.