awesome-remote-sensing-vision-language-models
github.com/lzw-lzw/awesome-remote-sensing-vision-language-models ↗Awesome-Remote-Sensing-Vision-Language-Models
195
GitHub Stars
91
Curated Resources
16
Categories
22 hours ago
Last Refreshed
PretrainingImage CaptioningText-based Image GenerationImage-text RetrievalVisual Question AnsweringVisual GroundingScene ClassificationObject DetectionSemantic SegmentationImage Captioning DatasetText-based Image Retrieval DatasetVisual Question Answering DatasetVisual Grounding DatasetScene Classification DatasetObject Detection DatasetSemantic Segmentation Dataset
Use this list with your AI agent
Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:
"Show me image-text retrieval resources from awesome-remote-sensing-vision-language-models"
Installation instructions →What's inside
Image-text Retrieval
- A deep semantic alignment network for the cross-modal image-text retrieval in remote sensing
J-STARS 2021
- A lightweight multi-scale crossmodal text-image retrieval method in remote sensing
TGRS 2021
- Contrasting dual transformer architectures for multi-modal remote sensing image retrieval
Applied Sciences 2023
- Deep unsupervised embedding for remote sensing image retrieval using textual cues
Applied Sciences 2020
- Direction-Oriented Visual-semantic Embedding Model for Remote Sensing Image-text Retrieval
arxiv 2023
- Multilanguage transformer for improved text to remote sensing image retrieval
J-STARS 2022
Scene Classification
- A distance-constrained semantic autoencoder for zero-shot remote sensing scene classification
J-STARS 2021
- APPLeNet: Visual Attention Parameterized Prompt Learning for Few-Shot Remote Sensing Image Generalization using CLIP
CVPR 2023
- Fine-grained object recognition and zero-shot learning in remote sensing imagery
TGRS 2017
- Generative adversarial networks for zero-shot remote sensing scene classification
Applied Sciences 2022
- Learning deep crossmodal embedding networks for zero-shot remote sensing image scene classification
TGRS 2021
- Structural alignment based zero-shot classification for remote sensing scenes
ICECE 2018
Scene Classification Dataset
- AID
Home
- NWPU-RESISC45
Home
- SATIN
Home
- UC Merced Land-Use(UCM)
Home
Image Captioning
- A multi-level attention model for remote sensing image captions
Remote Sensing 2020
- A novel SVM-based decoder for remote sensing image captioning
TGRS 2021
- Bootstrapping Interactive Image-Text Alignment for Remote Sensing Image Captioning
arxiv 2023
- Can a Machine Generate Humanlike Language Descriptions for a Remote Sensing Image?
TGRS 2017
- Changes to Captions: An Attentive Network for Remote Sensing Change Captioning
arxiv 2023
- Description Generation for Remote Sensing Images Using Attribute Attention Mechanism
Remote Sensing 2019
Visual Question Answering
- A spatial hierarchical reasoning network for remote sensing visual question answering
TGRS 2023
- Bi-modal transformer-based approach for visual question answering in remote sensing imagery
TGRS 2022
- Charting New Territories: Exploring the Geographic and Geospatial Capabilities of Multimodal LLMs
arXiv 2023
- Cross-Modal Visual Question Answering for Remote Sensing Data: The International Conference on Digital Image Computing: Techniques and Applications
DICTA 2021
- From easy to hard: Learning language-guided curriculum for visual question answering on remote sensing data
TGRS 2022
- How to find a good image-text embedding for remote sensing visual question answering?
ECML-PKDD 2021
Resources
- Brain-inspired Remote Sensing Foundation Models and Open Problems: A Comprehensive Survey
JSTARG 2023
- GeoChat: Grounded Large Vision-Language Model for Remote Sensing
arxiv 2023
- RSGPT: A Remote Sensing Vision Language Model and Benchmark
arxiv 2023
- The Potential of Visual ChatGPT For Remote Sensing
arxiv 2023
- Towards Automatic Satellite Images Captions Generation Using Large Language Models
arxiv 2023
- Tree-GPT: Modular Large Language Model Expert System for Forest Remote Sensing Image Understanding and Interactive Analysis
arxiv 2023
Semantic Segmentation
- CPSeg: Finer-grained Image Semantic Segmentation via Chain-of-Thought Language Prompting
arxiv 2023
- Few-shot segmentation of remote sensing images using deep metric learning
GRSL 2022.
- Language-aware domain generalization network for cross-scene hyperspectral image classification
TGRS 2023
- RRSIS: Referring Remote Sensing Image Segmentation
arxiv 2023
- RSPrompter: Learning to Prompt for Remote Sensing Instance Segmentation based on Visual Foundation Model
arxiv 2023
- Semi-supervised contrastive learning for few-shot segmentation of remote sensing images
Remote Sensing 2022
Showing a sample of 91 resources. View the full list on GitHub →