PyTorch implementation for few-shot object detection research
Top 71.6% on SourcePulse
Meta-DETR is a PyTorch implementation for few-shot object detection, addressing the challenge of generalizing from base classes to novel classes with limited data. It targets researchers and practitioners in computer vision and deep learning who require state-of-the-art performance in few-shot detection scenarios, offering improved generalization by exploiting inter-class correlations.
How It Works
Meta-DETR employs an image-level meta-learning approach, bypassing the proposal quality gap common in R-CNN-based methods. It performs meta-learning across multiple support classes simultaneously, enabling effective exploitation of inter-class correlations to enhance generalization. This approach leads to superior performance compared to traditional few-shot object detectors.
Quick Start & Requirements
Highlighted Details
Maintenance & Community
The project is associated with authors from academia, including Eric P. Xing. Further community engagement details are not explicitly provided in the README.
Licensing & Compatibility
Released under the MIT license. Users are responsible for ensuring compliance with all license requirements, including those of prior works. Commercial use is permitted under MIT license terms.
Limitations & Caveats
The implementation is tested on specific older versions of Ubuntu, CUDA, and PyTorch, recommending exact setups. While broader compatibility is suggested, users may encounter setup challenges with different environments. The project relies on custom CUDA operators for Deformable Attention, requiring compilation.
3 years ago
Inactive