Discover and explore top open-source AI tools and projects—updated daily.
Object detector research paper for densely packed scenes
Top 43.4% on SourcePulse
This repository provides a dataset and codebase for precise object detection in densely packed scenes, targeting researchers and practitioners in computer vision. It addresses the challenge of accurately identifying and localizing numerous, often similar, objects in close proximity, a common issue in retail and urban environments.
How It Works
The core innovation lies in a novel Soft-IoU layer integrated into an object detector. This layer estimates the Jaccard index between detected bounding boxes and ground truth, providing a quality score. These detections, along with their Soft-IoU scores, are then modeled as a Mixture of Gaussians. An Expectation-Maximization (EM) based merger unit clusters these Gaussians to resolve overlapping detections, leading to more precise results in crowded scenes.
Quick Start & Requirements
keras-retinanet
as a base.Highlighted Details
keras-retinanet
framework.Maintenance & Community
The project is associated with the CVPR 2019 paper "Precise Detection in Densely Packed Scenes." Contributions are welcomed.
Licensing & Compatibility
The dataset is provided for academic and non-commercial use only. The codebase license is not explicitly stated but is built on keras-retinanet
.
Limitations & Caveats
The codebase is noted as being under testing with potential glitches. The EM-merger is a stable but not time-optimized version. The dataset license restricts commercial use.
2 years ago
Inactive