Discover and explore top open-source AI tools and projects—updated daily.
OschAIAI-powered annotation tool for accelerating computer vision workflows
Top 75.9% on SourcePulse
VisioFirm is an open-source, AI-assisted annotation tool designed to accelerate computer vision dataset labeling. It targets researchers, data scientists, and ML engineers working with large image and video datasets, offering significant time savings (up to 80%) through semi-automated pre-annotation and label propagation. The tool streamlines workflows with an intuitive web interface, powerful backend, and support for various annotation types and popular model formats.
How It Works
VisioFirm leverages state-of-the-art AI models for pre-annotation, including OpenAI CLIP for classification, SAM2 for segmentation, YOLO (v5-v12) for detection, and Grounding DINO for zero-shot object grounding. For video annotation, it offers advanced label propagation via a SAM2-powered "SmartPropagator" or various OpenCV trackers, enabling frame-to-frame consistency. The system supports cross-domain annotation, allowing detection models to generate segmentation masks or vice-versa. Its backend is migrated to FastAPI for improved performance, and it includes a Python API for pipeline integration.
Quick Start & Requirements
pip install -U visiofirmgit clone https://github.com/OschAI/VisioFirm.git), navigate to the directory, and run pip install -e .visiofirm in the terminal.~/.cache/visiofirm_cache, ~/Library/Caches/visiofirm_cache, or %LOCALAPPDATA%\visiofirm_cache) before first run to avoid conflicts.Highlighted Details
Maintenance & Community
The project is maintained by Safouane El Ghazouali. Bug reports and feature requests can be submitted via the GitHub Issues page. A Discord community and a documentation website are planned for the future.
Licensing & Compatibility
VisioFirm itself is licensed under the Apache 2.0 license. However, it integrates third-party models with different licenses: Ultralytics YOLO uses AGPL-3.0, while SAM2 and GroundingDINO use Apache 2.0 and BSD 3-Clause. The AGPL-3.0 license for Ultralytics YOLO may impose copyleft restrictions on derivative works or linked applications, potentially impacting closed-source commercial use.
Limitations & Caveats
Official documentation and community support (Discord) are listed as "SOON". The v1 release requires manual cache directory management to ensure proper initialization. The AGPL-3.0 license of a key dependency (Ultralytics YOLO) may present compatibility challenges for certain commercial or closed-source applications.
1 month ago
Inactive