Discover and explore top open-source AI tools and projects—updated daily.
Multimodal LLMs for physical common sense and embodied decisions
Top 50.7% on SourcePulse
Cosmos-Reason1 is a suite of multimodal LLMs, ontologies, and benchmarks designed to imbue AI with physical common sense and embodied reasoning capabilities. Targeting researchers and developers in AI, robotics, and embodied agents, it enables models to generate physically grounded responses through long chain-of-thought reasoning.
How It Works
The models, Cosmos-Reason1-8B and Cosmos-Reason1-56B, undergo a four-stage training process: vision pre-training, general Supervised Fine-Tuning (SFT), Physical AI SFT, and Physical AI reinforcement learning. This approach leverages ontologies for physical common sense and embodied reasoning, coupled with custom benchmarks, to specifically enhance the physical reasoning abilities of multimodal LLMs.
Quick Start & Requirements
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The README lacks specific instructions for installation, usage, or direct model access, requiring users to consult external resources. Model family details are marked as "Coming Soon."
2 days ago
Inactive