Third-party GroundingDINO implementation for open-set object detection
Top 51.9% on sourcepulse
This repository provides a third-party implementation of the Grounding DINO paper, enabling open-set object detection and grounding. It is designed for researchers and practitioners looking to fine-tune or pre-train models for custom datasets, offering capabilities beyond the official release.
How It Works
The project leverages the DINO architecture, enhanced with grounded pre-training. It supports a custom odvg
data format for training, which unifies object detection (OD) and visual grounding (VG) datasets. This approach allows for flexible data integration and training on diverse datasets, including large-scale ones like GRIT-20M.
Quick Start & Requirements
pip install -r requirements.txt
, and compile custom ops (cd models/GroundingDINO/ops && python setup.py build install
).data_format.md
.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The project is a third-party implementation and may not perfectly replicate all features or performance of the official Grounding DINO. Training support for object detection data was initially marked as '✖' in the README's feature table, though the text indicates training is supported. Evaluation on custom test sets requires careful configuration of use_coco_eval
and label_list
.
6 days ago
Inactive