Awesome-Remote-Sensing-Multimodal-Large-Language-Model by ZhanYang-nwpu

Survey of remote sensing multimodal LLMs

Created 2 years ago

352 stars

Top 79.2% on SourcePulse

Project Summary

This repository serves as a comprehensive survey and curated collection of resources for Multimodal Large Language Models (MLLMs) applied to Remote Sensing (RS-MLLMs). It targets researchers and practitioners in the field, offering a centralized hub for the latest advancements, datasets, benchmarks, and intelligent agents, aiming to accelerate development and understanding in this specialized domain.

How It Works

The project functions as an "awesome list" style repository, meticulously gathering and categorizing papers, datasets, and benchmarks related to RS-MLLMs. It covers various aspects, including vision-language pre-training models, intelligent agents for remote sensing tasks, and comprehensive evaluation benchmarks for specific applications like image captioning, visual question answering, and image-text retrieval. The organization aims to provide a structured overview of the rapidly evolving landscape of RS-MLLMs.

Quick Start & Requirements

This repository is a curated list of research papers and resources, not a runnable software package. No installation or execution commands are applicable. The primary requirement is an interest in the field of remote sensing and multimodal large language models.

Highlighted Details

Comprehensive coverage of RS-MLLM papers, datasets, and benchmarks.
Categorization includes specific tasks like image captioning, VQA, and retrieval.
Features a section dedicated to intelligent agents for remote sensing.
Includes a list of pre-training and instruction-tuning datasets.

Maintenance & Community

The project is maintained by ZhanYang-nwpu and is updated in real-time to track the latest state of RS-MLLMs. Contact is available via zhanyangnwpu@gmail.com.

Licensing & Compatibility

The repository itself is a collection of links and information; licensing details would pertain to the individual linked research papers and their associated codebases, which are not specified here.

Limitations & Caveats

This repository is a survey and does not provide executable code or models. The "latest updates" section indicates that the first review manuscript was submitted for review in May 2024, suggesting the field is still nascent and rapidly evolving.

Awesome-Remote-Sensing-Multimodal-Large-Language-Model by ZhanYang-nwpu

Explore Similar Projects

awesome-huge-models by zhengzangw

Awesome-Parameter-Efficient-Transfer-Learning by jianghaojun

OV-DINO by wanghao9610

Awesome-CV-Foundational-Models by awaisrauf

Awesome_Matching_Pretraining_Transfering by Paranioar

awesome-vlm-architectures by gokayfem

molmo by allenai

Vary by Ucas-HaoranWei

vilbert-multi-task by facebookresearch

bert_language_understanding by brightmart

open_flamingo by mlfoundations

unilm by microsoft