Discover and explore top open-source AI tools and projects—updated daily.
longzw1997Third-party GroundingDINO implementation for open-set object detection
Top 47.2% on SourcePulse
This repository provides a third-party implementation of the Grounding DINO paper, enabling open-set object detection and grounding. It is designed for researchers and practitioners looking to fine-tune or pre-train models for custom datasets, offering capabilities beyond the official release.
How It Works
The project leverages the DINO architecture, enhanced with grounded pre-training. It supports a custom odvg data format for training, which unifies object detection (OD) and visual grounding (VG) datasets. This approach allows for flexible data integration and training on diverse datasets, including large-scale ones like GRIT-20M.
Quick Start & Requirements
pip install -r requirements.txt, and compile custom ops (cd models/GroundingDINO/ops && python setup.py build install).data_format.md.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The project is a third-party implementation and may not perfectly replicate all features or performance of the official Grounding DINO. Training support for object detection data was initially marked as '✖' in the README's feature table, though the text indicates training is supported. Evaluation on custom test sets requires careful configuration of use_coco_eval and label_list.
3 months ago
1 day
IDEA-Research
mlfoundations