RSPrompter  by KyanChen

PyTorch code for remote sensing instance segmentation via visual foundation models

created 2 years ago
613 stars

Top 54.5% on sourcepulse

GitHubView on GitHub
Project Summary

This repository provides a PyTorch implementation for RSPrompter, a method for remote sensing instance segmentation using visual foundation models. It targets researchers and practitioners in remote sensing and computer vision, offering a framework to leverage large foundation models for improved segmentation accuracy.

How It Works

RSPrompter builds upon the MMDetection framework, integrating Segment Anything Model (SAM) capabilities for instance segmentation. It introduces novel prompting techniques to adapt SAM for remote sensing data, allowing for efficient fine-tuning with methods like LoRA and variable input image sizes to manage memory usage.

Quick Start & Requirements

  • Installation: Clone the repository and install dependencies via pip and mim. Recommended environment: Python 3.10, PyTorch 2.1.x, CUDA 12.1, MMCV 2.1.x.
  • Prerequisites: Linux or Windows, Miniconda, PyTorch, MMCV, transformers, wandb, and other listed Python packages. DeepSpeed is optional for accelerated training.
  • Setup: Follow the detailed installation steps in the README.
  • Resources: Official documentation and Hugging Face Spaces for models are linked.

Highlighted Details

  • Consistent API and usage with MMDetection.
  • Open-source SAM-seg, SAM-det, and RSPrompter models.
  • Supports AMP, DeepSpeed for training.
  • Variable input image size and LoRA for memory efficiency.

Maintenance & Community

The project is actively developed, with recent updates in late 2023. Users can seek help via GitHub Issues.

Licensing & Compatibility

Licensed under Apache 2.0, permitting commercial use and integration with closed-source projects.

Limitations & Caveats

DeepSpeed support is noted as imperfect on Windows. The README suggests that low-resolution inputs reduce memory but have not been performance-verified. Some configurations may require significant GPU memory (e.g., 20.9 GB for RSPrompter-query with 1024x1024 input on a single RTX 4090).

Health Check
Last commit

1 year ago

Responsiveness

1 week

Pull Requests (30d)
0
Issues (30d)
2
Star History
18 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera), and
10 more.

open-r1 by huggingface

0.2%
25k
SDK for reproducing DeepSeek-R1
created 6 months ago
updated 4 days ago
Starred by Aravind Srinivas Aravind Srinivas(Cofounder of Perplexity), Stas Bekman Stas Bekman(Author of Machine Learning Engineering Open Book; Research Engineer at Snowflake), and
12 more.

DeepSpeed by deepspeedai

0.2%
40k
Deep learning optimization library for distributed training and inference
created 5 years ago
updated 14 hours ago
Feedback? Help us improve.