StarCraft II environment for LLM agents, with benchmarks and summarization
Top 92.8% on sourcepulse
This repository provides TextStarCraft II, a pure language environment for Large Language Models (LLMs) to play StarCraft II. It addresses the limitations of previous AI agents in long-term strategic planning and interpretability by leveraging LLMs and a novel Chain of Summarization (CoS) approach. The project is targeted at AI researchers and developers interested in evaluating LLM capabilities in complex real-time strategy games.
How It Works
The core innovation is the Chain of Summarization (CoS) method, which processes raw game observations through single-frame summarization and multi-frame summarization. This allows LLMs to analyze game state, generate command recommendations, and make strategic decisions. This approach aims to provide greater strategy interpretability and expandability compared to traditional RL or SL methods.
Quick Start & Requirements
pip install -r requirements.txt
. ChromaDB should be installed before burnysc2
.StarCraft II\Maps
directory.test_the_env.py
for single-process testing or multiprocess_test.py
for parallel execution. Key parameters include player_race
(currently Protoss only), difficulty
, LLM_model_name
, and API keys.Highlighted Details
Maintenance & Community
No specific community links (Discord/Slack) or notable contributors are mentioned in the README.
Licensing & Compatibility
The repository does not explicitly state a license. Compatibility for commercial use or closed-source linking is not specified.
Limitations & Caveats
The environment is currently Windows-only due to Blizzard's lack of Linux support for the latest SC2 version. Only the Protoss race is supported; Zerg and Terran are under development. A single game can take approximately 7 hours to run using LLMs.
3 months ago
1 day