Discover and explore top open-source AI tools and projects—updated daily.
meituan-longcatPowerful Large Reasoning Model for agentic tasks
Top 98.5% on SourcePulse
LongCat-Flash-Thinking-2601 is a 560 billion parameter Large Reasoning Model (LRM) employing a Mixture-of-Experts (MoE) architecture. It significantly enhances agentic reasoning capabilities through a novel training pipeline combining environment scaling, multi-environment reinforcement learning, and robust training against environmental noise. The model offers top-tier performance on agentic benchmarks, improved generalization to out-of-distribution scenarios, and a specialized "Heavy Thinking Mode" for extremely challenging tasks.
How It Works
The model leverages a domain-parallel training recipe and an innovative MoE architecture (560B total, 27B activated parameters). Its agentic capabilities are strengthened via:
Quick Start & Requirements
Basic adaptations exist for deployment with SGLang and vLLM; detailed instructions are available in a separate Deployment Guide. Example usage in the README utilizes the transformers library. Specific local setup prerequisites (e.g., GPU, CUDA versions, Python versions) are not detailed. A chat interface is available at: https://longcat.ai.
Highlighted Details
Maintenance & Community
Direct contact is available via longcat-team@meituan.com. A WeChat Group is also mentioned for community interaction. The model weights are actively used on the Longcat AI platform.
Licensing & Compatibility
Model weights are released under the MIT License. This license explicitly does not grant rights to use Meituan trademarks or patents. Standard LLM usage considerations apply regarding accuracy, safety, fairness, and compliance with applicable laws and regulations for downstream applications.
Limitations & Caveats
The model has not been comprehensively evaluated for every possible downstream application and may exhibit performance variations across languages. Developers must carefully assess accuracy, safety, and fairness before deployment in sensitive scenarios. Maintaining database consistency can be challenging when environments contain a large number of tools, potentially leading to unverifiable tasks. Compliance with all applicable laws and regulations is the responsibility of the user.
2 weeks ago
Inactive