BCNet by lkeab

Research paper on occlusion-aware instance segmentation

Created 5 years ago

566 stars

Top 56.9% on SourcePulse

Project Summary

BCNet addresses the challenge of instance segmentation in scenes with occluded objects by explicitly modeling occlusion relationships. It targets researchers and practitioners in computer vision seeking state-of-the-art performance in complex scenes, offering improved accuracy through its novel bilayer decoupling approach.

How It Works

BCNet introduces a novel mask head that models image formation as a composition of two overlapping layers: an occluder layer and an occludee layer. This "bilayer decouple" approach explicitly separates the object boundary and mask predictions for both occluding and occluded instances within the same region of interest. This allows for more accurate segmentation of overlapping objects by considering their interaction and disentangling occluder and occludee boundaries, leading to improved performance on standard detectors like Faster R-CNN and FCOS.

Quick Start & Requirements

Install: Requires conda for environment setup, followed by pip install for dependencies and the BCNet package.
Prerequisites: PyTorch 1.4.0, torchvision 0.5.0, CUDA toolkit 10.1, Python 3.7, ninja, yacs, cython, matplotlib, tqdm, opencv-python==4.4.0.40, scikit-image, and pycocotools.
Dataset: COCO 2017 dataset with converted mask annotations for bilayer decoupling training.
Resources: Multi-GPU training is supported.
Links: Official Implementation, Paper, COCO-OCC Split

Highlighted Details

Achieves state-of-the-art performance on COCO test-dev, with mAP(mask) scores up to 41.2 (FCOS, Res-X101 FPN).
Explicitly models occlusion using a bilayer structure with two GCN layers for occluder and occludee.
Visualizations show the network's ability to handle multiple occluders within a single ROI.
Integrates seamlessly with both anchor-based (Faster R-CNN) and anchor-free (FCOS) detectors.

Maintenance & Community

Developed by Lei Ke, Yu-Wing Tai, and Chi-Keung Tang (CVPR 2021).
Related works include Mask Transfiner (CVPR 2022), VOIN (ICCV 2021).
Contact: lkeab@cse.ust.hk or GitHub issues.

Licensing & Compatibility

MIT License.
Compatible with commercial use and closed-source linking.

Limitations & Caveats

Requires specific older versions of PyTorch (1.4.0) and OpenCV (4.4.0.40), which may pose compatibility challenges with newer environments.
The installation process involves manual cloning and building of pycocotools and requires specific dataset annotation conversions.

Health Check

Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

1 stars in the last 30 days

Explore Similar Projects

PixelLM by MaverickRen

LMM for pixel-level image reasoning and segmentation

Created 2 years ago

Updated 1 year ago

no-time-to-train by miquel-espinosa

Training-free instance segmentation via reference images

Created 7 months ago

Updated 5 days ago

LOST by valeoai

Unsupervised object discovery and detection framework

Created 4 years ago

Updated 2 years ago

CIoU by Zzh-tju

Object detection research paper enhancing bounding box regression

Created 5 years ago

Updated 2 years ago

efficientdet-pytorch by bubbliiiing

PyTorch code for EfficientDet object detection

Created 5 years ago

Updated 2 years ago

ComfyUI-YoloWorld-EfficientSAM by ZHO-ZHO-ZHO

ComfyUI nodes for object detection and segmentation workflows

Created 2 years ago

Updated 1 year ago

D2Det by JialeCao001

Object detection & instance segmentation research paper

Created 5 years ago

Updated 5 years ago

yolov8-pytorch by bubbliiiing

PyTorch implementation for YOLOv8 object detection

Created 3 years ago

Updated 2 years ago

yolov7-pytorch by bubbliiiing

PyTorch implementation for YOLOv7 object detection

Created 3 years ago

Updated 2 years ago

yolox-pytorch by bubbliiiing

PyTorch implementation for the YOLOX object detection model

Created 4 years ago

Updated 2 years ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and

Kevin Hou

Kevin Hou(Head of Product Engineering at Windsurf).

ImageAI by OlafenwaMoses

Python library for computer vision tasks

Created 8 years ago

Updated 1 year ago

Starred by

Anastasios Angelopoulos

Anastasios Angelopoulos(Cofounder of LMArena),

Chenlin Meng

Chenlin Meng(Cofounder of Pika), and

1 more.

Pytorch-UNet by milesial

PyTorch implementation for image semantic segmentation

Created 8 years ago

Updated 1 year ago

Feedback? Help us improve.