Discover and explore top open-source AI tools and projects—updated daily.
OpenSenseNovaScaling multimodal models to achieve advanced spatial intelligence
Top 96.5% on SourcePulse
SenseNova-SI: Scaling Spatial Intelligence with Multimodal Foundation Models
This project addresses the limitations of current multimodal foundation models in spatial intelligence by introducing the SenseNova-SI family. It offers researchers and practitioners enhanced capabilities in understanding and generating spatial information, leveraging large-scale, curated datasets and established multimodal architectures. The primary benefit is achieving state-of-the-art performance on diverse spatial intelligence benchmarks while maintaining general multimodal understanding.
How It Works
SenseNova-SI scales existing multimodal foundation models, such as Qwen3-VL, InternVL3, and Bagel, by training them on a meticulously curated dataset named SenseNova-SI-8M. This dataset comprises approximately 8.16 million diverse samples derived from 151 sources, systematically covering a broad taxonomy of spatial capabilities. This data-centric approach aims to cultivate robust spatial reasoning and generalization.
Quick Start & Requirements
uv for environment synchronization: uv sync --extra cu124 (or other CUDA versions like cu118, cu121, cu126, etc.).uv sync), Python (3.10+ recommended via conda environments).uv installation: https://docs.astral.sh/uv/getting-started/installation/#installing-uvexample.py, example_bagel.pyHighlighted Details
Maintenance & Community
SenseNova-SI is described as an "ongoing project" with continuous updates planned. Newly trained models are publicly released. Future integration with larger in-house models is anticipated. Specific community links (Discord, Slack) or a public roadmap are not detailed in the README.
Licensing & Compatibility
The project incorporates code from BAGEL, InternVL, and lmms-engine, and directs users to consult the original repositories for licensing details. No explicit license is stated for the SenseNova-SI project itself. Compatibility for commercial use or closed-source linking would depend on the licenses of the underlying base models.
Limitations & Caveats
As an ongoing project, SenseNova-SI is subject to continuous development and updates. The reliance on external base model licenses means users must verify compatibility for their specific use cases. Future integration plans suggest current models may evolve or be superseded.
1 week ago
Inactive