Object detection research paper enhancing bounding box regression
Top 83.0% on sourcepulse
This repository provides implementations of Distance-IoU (DIoU) and Complete-IoU (CIoU) losses, along with Cluster-NMS (Non-Maximum Suppression) for improved object detection and instance segmentation. It targets researchers and practitioners in computer vision seeking to enhance bounding box regression and object detection performance. The primary benefit is faster convergence and better accuracy in object detection tasks.
How It Works
The project introduces DIoU and CIoU losses, which incorporate geometric factors like center point distance and aspect ratio into the bounding box regression loss function. This approach leads to more direct convergence compared to traditional IoU-based losses. Cluster-NMS is a novel NMS algorithm that groups detections by class and applies NMS within clusters, or by adding class offsets to coordinates to ensure zero IoU between different classes, improving efficiency and accuracy.
Quick Start & Requirements
environment.yml
(Conda) or manually with pip
.cd external/DCNv2 && python setup.py build develop
).sh data/scripts/COCO.sh
.Highlighted Details
Maintenance & Community
The project is associated with author Zhaohui Zheng. Further details on community or roadmap are not explicitly provided in the README.
Licensing & Compatibility
The README does not explicitly state a license. Compatibility for commercial use or closed-source linking is not specified.
Limitations & Caveats
The project is based on YOLACT++, which requires compiling DCNv2, adding a build step. Some NMS implementations might be slower than highly optimized versions like Torchvision's NMS without further engineering. The README mentions that some experimental NMS variants (SPM + Distance + Weighted Cluster-NMS) were not included in the paper.
1 year ago
1 day