Skip to content

zhangda1018/Awesome-Remote-Sensing-Open-Vocabulary-Learning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 

Repository files navigation

Awesome-Remote-Sensing-Open-Vocabulary-Learning

🔥🔥🔥 Towards Open-Vocabulary Learning in Remote Sensing: A Survey

✨ The first survey for Open-Vocabulary Learning for Remote Sensing (RS-OV).

✨✨✨ Behold our meticulously curated trove of RS-OV resources!!!

🎉🚀💡 The website will be updated in real-time to track the latest state of RS-OV!!!

Please share a STAR ⭐ if this project does help

📢 Latest Updates

  • The list will be continuously updated 🔥🔥

Table of Contents


Open Vocabulary Object Detection for Remote Sensing

Title Venue Date Code Note
Unveiling the Unknown: A SAM Guided Open World Object Detection Method for Remote Sensing
Mingtao Hu, Wenxin Yin, Wenhui Diao, Xin Gao, Xian Sun
Arxiv 2026 - -
A Training-Free Guess What Vision Language Model from Snippets to Open-Vocabulary Object Detection
Guiying Zhu, Bowen Yang, Yin Zhuang, Tong Zhang, Guanqun Wang, Zhihao Che, He Chen, Lianlin Li
Arxiv 2026 - -
Open-vocabulary object detection for high-resolution remote sensing images
HuaDong Li.
CVIU 2025 - -
LLaMA-Unidetector: An LLaMA-Based Universal Framework for Open-Vocabulary Object Detection in Remote Sensing Imagery
J Xie, G Wang, T Zhang, Y Sun, H Chen, Y. Zhang.
TGRS 2025 Github -
RT-OVAD: Real-Time Open-Vocabulary Aerial Object Detection via Image-Text Collaboration
G Wei, X Yuan, Y Liu, Z Shang, X Xue, P Wang, K Yao, C Zhao, H Zhang, R Xiao.
Arxiv 2025 Github -
OpenRSD: Towards Open-prompts for Object Detection in Remote Sensing Images
Z Huang, Y Feng, S Yang, Z Liu, Q Liu, Y Wang.
ICCV 2025 Github -
Cross-View Open-Vocabulary Object Detection in Aerial Imagery
J Kini, R Gupta, M Shah.
Arxiv 2025 - -
FASE: Feature-Aligned Scene Encoding for Open-Vocabulary Object Detection in Remote Sensing
H Hwang, SS Woo.
CIKM 2025 - -
Cross-View Open-Vocabulary Object Detection in Aerial Imagery
J Kini, R Gupta, M Shah.
Arxiv 2025 - -
Mask-Guided Teacher–Student Learning for Open-Vocabulary Object Detection in Remote Sensing Images
J S Wang, Y Song, J Xiang, Y Chen, P Zhong, R Fu.
Remote Sensing 2025 - -
Locate anything on earth: Advancing open-vocabulary object detection for remote sensing community
J Pan, Y Liu, Y Fu, M Ma, J Li, DP Paudel, L Van Gool, X Huang.
AAAI 2025 Github -
Toward open vocabulary aerial object detection with clip-activated student-teacher learning
Y Li, W Guo, X Yang, N Liao, D He, J Zhou, W Yu.
ECCV 2024 Github -
CLIP-guided source-free object detection in aerial images
N Liu, X Xu, Y Su, C Liu, P Gong, HC Li.
IGARSS 2024 Github -

Open Vocabulary Semantic Segmentation for Remote Sensing

Title Venue Date Code Note
MovSeg: Efficient Adaptation of Vision-Language Models for Multispectral Open-Vocabulary Segmentation
Yingrui Ji; Chenhao Wang; Jiansheng Chen; Jingbo Chen; Anzhi Yue; Yu Meng
ISPRS 2026 - -
AerOSeg++: Scale-Aware and Texture-Guided Open-Vocabulary Segmentation with SAM Features for Remote Sensing Images
Saikat Dutta, Akhil Vasim, Hamid Rezatofighi, Biplab Banerjee.
RS 2026 - -
HG-RSOVSSeg: Hierarchical Guidance Open-Vocabulary Semantic Segmentation Framework of High-Resolution Remote Sensing Images
Wubiao Huang, Fei Deng, Huchen Li, Jing Yang.
RS 2026 - -
SegEarth-OV3: Exploring SAM 3 for Open-Vocabulary Semantic Segmentation in Remote Sensing Images
Kaiyu Li, Shengqi Zhang, Yupeng Deng, Zhi Wang, Deyu Meng, Xiangyong Cao.
Arxiv 2025 Github -
DGL-RSIS: Decoupling Global Spatial Context and Local Class Semantics for Training-Free Remote Sensing Image Segmentation
Boyi Li, Ce Zhang, Richard M. Timmerman, Wenxuan Bao.
Arxiv 2025 - Accepted to JAG 2026
Reducing semantic ambiguity in open-vocabulary remote sensing image segmentation via knowledge graph-enhanced class representations
Wubiao Huang, Huchen Li, Shuai Zhang, Fei Deng.
ISPRS 2026 Github -
Learning transferable land cover semantics for open vocabulary interactions with remote sensing images
Valérie Zermatten, Javiera Castillo-Navarro, Diego Marcos, Devis Tuia.
ISPRS 2025 Github -
CitySeg: A 3D Open Vocabulary Semantic Segmentation Foundation Model in City-scale Scenarios
Jialei Xu, Zizhuang Wei, Weikang You, Linyun Li, Weijian Sun.
Arxiv 2025 - -
Soft-Guided Open-Vocabulary Semantic Segmentation of Remote Sensing Images
K An, Y Wang, L Chen.
TGRS 2025 - -
Exploring Efficient Open-Vocabulary Segmentation in the Remote Sensing
B Li, H Dong, D Zhang, Z Zhao, J Gao, X Li.
AAAI 2026 Github -
Open-Vocabulary Semantic Segmentation for Remote Sensing Imagery via Dual-Stream Feature Extraction and Category-Adaptive Refinement
S Yuan, B He.
Arxiv 2025 - -
AerOSeg: Harnessing SAM for Open-Vocabulary Segmentation in Remote Sensing Images
Saikat Dutta, Akhil Vasim, Siddhant Gole, Hamid Rezatofighi, Biplab Banerjee.
CVPR workshop 2025 - -
Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation
C Ye, Y Zhuge, P Zhang.
AAAI 2025 Github -
Annotation-Free Open-Vocabulary Segmentation for Remote-Sensing Images
K Li, X Cao, R Liu, S Wang, Z Jiang, Z Wang, D Meng.
Arxiv 2025 Github -
SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images
K Li, R Liu, X Cao, X Bai, F Zhou, D Meng, Z Wang.
CVPR 2025 Github -
Open-Vocabulary High-Resolution Remote Sensing Image Semantic Segmentation
Q Cao, Y Chen, C Ma, X Yang.
TGRS 2025 Github -
FreeMix: Open-Vocabulary Domain Generalization of Remote-Sensing Images for Semantic Segmentation
J Wu, J Shi, Z Zhao, Z Liu, R Zhi.
TGRS 2025 Github -
Large multimodal model for open vocabulary semantic segmentation of remote sensing images
B Liu, X Chen, A Yu, F Feng, J Yue, X Yu.
EJRS 2025 - -
Expanding Open-Vocabulary Understanding for UAV Aerial Imagery: A Vision–Language Framework to Semantic Segmentation
B Huang, J Li, W Luan, J Tan, C Li, L Huang.
Drones 2025 - -
RSCLIP for Training-Free Open-Vocabulary Remote Sensing Image Semantic Segmentation
S Wang, X Sun, J Han, XX Zhu.
Techrxiv 2025 - -
AeriaICLIP: Lightweight Open-Vocabulary Segmentation for UAV-Based Aerial Images
P Jia, Y Gao, W Li, Q Gao, F Pan.
CCC 2025 - -
Toward High-Resolution UAV Imagery Open-Vocabulary Semantic Segmentation
Z Chen, Y Xie, Y Wei .
Drones 2025 - -

Open Vocabulary Instance Segmentation for Remote Sensing

Title Venue Date Code Note
SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation
S Huang, S He, H Qin, B Wen.
ICCV 2025 Github -
ZoRI: Towards Discriminative Zero-Shot Remote Sensing Instance Segmentation
Shiqi Huang, Shuting He, Bihan Wen.
AAAI 2025 Github -

Open Vocabulary Change Detection for Remote Sensing

Title Venue Date Code Note
DynamicEarth: How Far are We from Open-Vocabulary Change Detection?
K Li, X Cao, Y Deng, C Pang, Z Xin, D Meng, Z Wang.
Arxiv 2025 Github Accepted by AAAI 2026
UniVCD: A New Method for Unsupervised Change Detection in the Open-Vocabulary Era
Ziqiang Zhu, Bowei Yang.
Arxiv 2025 Github -
Semantic-cd: Remote sensing image semantic change detection towards open-vocabulary setting
Y Zhu, L Li, K Chen, C Liu, F Zhou, Z Shi.
Arxiv 2025 - -
Open-vocabulary generative vision-language models for creating a large-scale remote sensing change detection dataset
Y Zan, S Ji, S Chao, M Luo.
ISPRS 2025 Github -
OmniOVCD: Streamlining Open-Vocabulary Change Detection with SAM 3
Xu Zhang, Danyang Li, Yingjie Xia, Xiaohang Dong, Hualong Yu, Jianye Wang, Qicheng Li.
Arxiv 2026 - -

Open Vocabulary Visual Grounding for Remote Sensing

Title Venue Date Code Note
RSVG-ZeroOV: Exploring a Training-Free Framework for Zero-Shot Open-Vocabulary Visual Grounding in Remote Sensing Images
K Li, D Wang, T Wang, F Dong, Y Zhang, L Zhang, X Wang, S Li, Q Wang.
AAAI 2026 - -
On-the-Fly OVD Adaptation with FLAME: Few-shot Localization via Active Marginal-Samples Exploration
Yehonathan Refael, Amit Aides, Aviad Barzilai, George Leifman, Genady Beryozkin, Vered Silverman, Bolous Jaber, Tomer Shekel.
Arxiv 2025 - -

Other

Title Venue Date Code Note
FarSLIP: Discovering Effective CLIP Adaptation for Fine-Grained Remote Sensing Understanding
Zhenshi Li, Weikang Yu, Dilxat Muhtar, Xueliang Zhang, Pengfeng Xiao, Pedram Ghamisi, Xiaoxiang Zhu.
Arxiv 2025 Github -
Grounded Vision-Language Navigation for UAVs with Open-Vocabulary Goal Understanding
Yuhang Zhang, Haosheng Yu, Jiaping Xiao, Mir Feroskhan.
Arxiv 2025 Github -
Enhanced Grounding DINO: Efficient Cross-Modality Block for Open-Set Object Detection in Remote Sensing
Z Hu, K Gao, J Wang, Z Yang, Z Zhang, H Chen.
JSTAR 2025 - -
Advancing Open-Set Object Detection in Remote Sensing Using Multimodal Large Language Model
Nandini Saini, Ashudeep Dubey, Debasis Das, Chiranjoy Chattopadhyay.
WACV Workshop 2025 - -
GeoRSMLLM: A Multimodal Large Language Model for Vision-Language Tasks in Geoscience and Remote Sensing
Zilun Zhang, Haozhan Shen, Tiancheng Zhao, Bin Chen, Zian Guan, Yuhao Wang, Xu Jia, Yuxiang Cai, Yongheng Shang, Jianwei Yin.
Arxiv 2025 - -
InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition
Y Zheng, W Wu, Q Li, X Wang, X Zhou, A Ren, J Shen, L Zhao, G Li, X Yang.
Neurips 2025 Github -
SkySense-O: Towards Open-World Remote Sensing Interpretation with Vision-Centric Visual-Language Modeling
Q Zhu, J Lao, D Ji, J Luo, K Wu, Y Zhang, L Ru, J Wang, J Chen, M Yang, D Liu, F Zhao.
CVPR 2025 Github -
DDFAV: Remote Sensing Large Vision Language Models Dataset and Evaluation Benchmark
H Li, X Zhang, H Qu.
Remote Sensing 2025 Github -
SPECIAL: zero-shot hyperspectral image classification with CLIP
L Pang, J Yao, K Li, X Cao.
Arxiv 2025 Github -
RS-CLIP: Zero shot remote sensing scene classification via contrastive vision-language supervision
X Li, C Wen, Y Hu, N Zhou.
Arxiv 2024 Github -
Remote Sensing Vision-Language Foundation Models without Annotations via Ground Remote Alignment
Utkarsh Mall, Cheng Perng Phoo, Meilin Kelsey Liu, Carl Vondrick, Bharath Hariharan, Kavita Bala.
ICLR 2024 Github -

Foundation Model & Datasets

Name Paper Github Year
AetherVision-Bench AetherVision-Bench: An Open-Vocabulary RGB-Infrared Benchmark for Multi-Angle Segmentation across Aerial and Ground Perspectives - 2025
Falcon Falcon: A Remote Sensing Vision-Language Foundation Model Link 2025
RemoteSAM Remotesam: Towards segment anything for earth observation Link 2025
SkyEyeGPT SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model Link 2025
OS-W2S OS-W2S: An Automatic Labeling Engine for Language-Guided Open-Set Aerial Object Detection Link 2025
OVRSISBench Exploring Efficient Open-Vocabulary Segmentation in the Remote Sensing Link 2025
Git-10M Text2Earth: Unlocking Text-driven Remote Sensing Image Generation with a Global-Scale Dataset and a Foundation Model Link 2025
LAE-1M Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community Link 2024
RemoteCLIP RemoteCLIP: A Vision Language Foundation Model for Remote Sensing Link 2024
DDFAV DDFAV: Remote Sensing Large Vision Language Models Dataset and Evaluation Benchmark Link 2024
LandDiscover50K Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation Link 2024
SkyScript SkyScript: A Large and Semantically Diverse Vision-Language Dataset for Remote Sensing Link 2023
RS5M RS5M and GeoRSCLIP: A Large Scale Vision-Language Dataset and A Large Vision-Language Model for Remote Sensing Link 2023

🤖 Contact

If you have any questions about this project, please feel free to contact us.

About

Towards Open-Vocabulary Learing for Remote Sensing: A survey

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •