InSPyReNet by plemeri

PyTorch implementation for high-resolution salient object detection

Created 4 years ago

701 stars

Top 48.7% on SourcePulse

Project Summary

This repository provides the official PyTorch implementation of InSPyReNet, a novel framework for high-resolution salient object detection (HR-SOD). It addresses the challenge of HR-SOD without requiring HR datasets by employing an image pyramid structure and a unique pyramid blending method to overcome receptive field discrepancies. The target audience includes researchers and practitioners in computer vision focused on image segmentation and object detection.

How It Works

InSPyReNet utilizes an image pyramid structure to generate saliency maps at multiple resolutions. A key innovation is its pyramid blending method, which synthesizes results from LR and HR image scales. This approach is designed to mitigate the effective receptive field (ERF) discrepancy between different resolutions, enabling accurate HR prediction without direct HR training data.

Quick Start & Requirements

Install: pip install transparent-background
Prerequisites: PyTorch. Specific backbone requirements (e.g., Res2Net, Swin Transformer) are used in provided models.
Data Download: python utils/download.py --extra --dest [DEST]
Resources: Official documentation for training, testing, and inference is available at getting_started.md. Model Zoo and pre-computed results are detailed in model_zoo.md. A web demo is available via HuggingFace.

Highlighted Details

Achieves state-of-the-art performance on various SOD metrics and boundary accuracy for HR images.
Offers a command-line tool and Python API via the transparent-background package.
Extended for lane segmentation in driving scenes (LaneSOD repository).
Supports multiple backbones including Res2Net and Swin Transformer.

Maintenance & Community

The project was presented at ACCV2022. A web demo is available on HuggingFace, provided by TasksWithCode.

Licensing & Compatibility

The repository does not explicitly state a license in the README. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The README does not specify a license, which may impact commercial adoption. Compatibility details for closed-source integration are also absent.

Health Check

Last Commit

5 months ago

Responsiveness

1 day

Pull Requests (30d)

0

Issues (30d)

1

Star History

16 stars in the last 30 days

Explore Similar Projects

PixelOE by KohakuBlueleaf

Python library for detail-oriented pixel art generation from images

Created 1 year ago

Updated 2 months ago

UniWorld by PKU-YuanGroup

Unified framework for visual tasks

Created 11 months ago

Updated 1 week ago

cross-image-attention by garibida

Research paper implementation for zero-shot appearance transfer

Created 2 years ago

Updated 1 year ago

mvits_for_class_agnostic_od by mmaaz60

Research paper for class-agnostic object detection

Created 4 years ago

Updated 2 years ago

krita-vision-tools by Acly

AI-powered image masking and editing tools for Krita

Created 2 years ago

Updated 3 weeks ago

Starred by

Jesse Clark

Jesse Clark(Cofounder of Marqo).

ZenCtrl by FotographerAI

GenAI framework for subject-driven image generation

Created 7 months ago

Updated 4 months ago

ComfyUI_LayerStyle_Advance by chflame163

ComfyUI nodes for advanced image layer styling and manipulation

Created 11 months ago

Updated 1 month ago

ComfyUI-RMBG by 1038lab

ComfyUI node for image segmentation and background removal

Created 11 months ago

Updated 1 month ago

BCNet by lkeab

Research paper on occlusion-aware instance segmentation

Created 4 years ago

Updated 2 years ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and

Kevin Hou

Kevin Hou(Head of Product Engineering at Windsurf).

ImageAI by OlafenwaMoses

Python library for computer vision tasks

Created 7 years ago

Updated 1 year ago

Starred by

Anastasios Angelopoulos

Anastasios Angelopoulos(Cofounder of LMArena),

Chenlin Meng

Chenlin Meng(Cofounder of Pika), and

1 more.

Pytorch-UNet by milesial

PyTorch implementation for image semantic segmentation

Created 8 years ago

Updated 1 year ago

Starred by

Alexandr Wang

Alexandr Wang(Chief AI Officer at Meta; Cofounder of Scale AI),

Boris Cherny

Boris Cherny(Creator of Claude Code; MTS at Anthropic), and

8 more.

awesome-deep-vision by kjw0612

Curated list of deep learning resources for computer vision

Created 10 years ago

Updated 2 years ago

Feedback? Help us improve.