Qwen-AgentWorld by QwenLM

Language world models for general agents

Created 4 days ago

New!

507 stars

Top 60.7% on SourcePulse

1 Expert Loves This Project

pgarbacki

Cofounder of Fireworks AI

Project Summary

Qwen-AgentWorld: Language World Models for General Agents

Qwen-AgentWorld introduces native language world models (LWMs) for general agents, simulating complex environments across seven unified domains. It offers a generalizable, scalable, and controllable simulator, benefiting researchers and developers by enabling robust agent training and evaluation with zero-shot out-of-distribution (OOD) generalization capabilities.

How It Works

This project pioneers a "native world model" approach, integrating environment modeling from the initial CPT stage through SFT and RL training pipelines, rather than as a post-hoc addition. This core design allows for superior zero-shot generalization to unseen environments and controllable simulation. The model unifies seven distinct agent interaction domains—MCP, Search, Terminal, SWE, Android, Web, and OS—into a single, cohesive architecture trained on over 10 million real-world interaction trajectories.

Quick Start & Requirements

Primary install/run: Supports SGLang (python -m sglang.launch_server ...), vLLM (vllm serve ...), and Hugging Face Transformers (AutoModelForCausalLM.from_pretrained).
Prerequisites: GPU acceleration is implied for inference. pip install openai is required for evaluation. An OpenAI API key is needed for LLM judge scoring.
Resources: Long context lengths (256K) are supported.
Links: Technical Report, Blog, Hugging Face, ModelScope.

Highlighted Details

Qwen-AgentWorld-397B-A17B achieves the highest overall score (58.71) on the AgentWorldBench, outperforming proprietary models like GPT-5.4 (58.25).
Demonstrates significant performance gains (+4.3 to +12.3) in out-of-distribution environments and controllable simulation tasks through Sim RL.
Functions as an "Agent Foundation Model," where LWM RL warm-up effectively transfers to multi-turn, tool-calling agentic tasks across diverse benchmarks.

Maintenance & Community

Community interaction and support are available via Discord and WeChat groups, with links not directly provided in the README.

Licensing & Compatibility

Models and AgentWorldBench are licensed under Apache 2.0, permitting commercial use and integration without explicit copyleft restrictions.

Limitations & Caveats

The provided documentation does not explicitly detail known limitations, alpha status, or specific unsupported platforms or features.

Health Check

Last Commit

1 day ago

Responsiveness

Inactive

Pull Requests (30d)

1

Issues (30d)

0

Star History

509 stars in the last 4 days

Explore Similar Projects

AgentOrientedTUI by zhangweijp

Agent-oriented interface and runtime model for AI software

Created 4 months ago

Updated 2 months ago

agents-last-exam by rdi-berkeley

Evaluating AI agents on complex, real-world tasks

Created 1 month ago

Updated 16 hours ago

Starred by

Binyuan Hui

Binyuan Hui(Research Scientist at Alibaba Qwen).

awesome-computer-use by ranpox

Resources for GUI computer-use agents

Created 1 year ago

Updated 2 months ago

Starred by

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory).

uni-agent by verl-project

Scalable framework for building, running, and training general AI agents

Created 5 months ago

Updated 2 days ago

Awesome-Papers-Autonomous-Agent by lafmdp

Paper list for autonomous agent research

Created 2 years ago

Updated 2 months ago

Starred by

Vincent Weisser

Vincent Weisser(Cofounder of Prime Intellect) and

Lewis Tunstall

Lewis Tunstall(Research Engineer at Hugging Face).

meta-agents-research-environments by facebookresearch

Platform for evaluating AI agents in dynamic, realistic scenarios

Created 9 months ago

Updated 6 days ago

AgentTorch by AgentTorch

AI platform for large population modeling, simulating millions of interacting agents

Created 3 years ago

Updated 1 month ago

Starred by

Johannes Hagemann

Johannes Hagemann(Cofounder of Prime Intellect),

Vincent Weisser

Vincent Weisser(Cofounder of Prime Intellect), and

2 more.

AgentTuning by THUDM

Agent tuning for generalized LLM agent abilities

Created 2 years ago

Updated 2 years ago

Starred by

Will Brown

Will Brown(Research Lead at Prime Intellect).

AgentGym by WooooDyy

Agent framework for LLM-based agent development and evaluation

Created 2 years ago

Updated 3 weeks ago

intellagent by plurai-ai

Framework for agent diagnosis and optimization using simulated interactions

Created 1 year ago

Updated 1 month ago

Starred by

Deshraj Yadav

Deshraj Yadav(Cofounder of Mem0),

Gregor Zunic

Gregor Zunic(Cofounder of Browser Use), and

9 more.

camel by camel-ai

Multi-agent framework for studying agent scaling laws

Created 3 years ago

Updated 10 hours ago

Starred by

Tobi Lutke

Tobi Lutke(Cofounder of Shopify),

Alex Yu

Alex Yu(Research Scientist at OpenAI; Cofounder of Luma AI), and

18 more.

generative_agents by joonspk-research

Research paper code for interactive human behavior simulation using generative agents

Created 2 years ago

Updated 1 year ago

Feedback? Help us improve.