Skip to content
Change the repository type filter

All

    Repositories list

    • Turning Every Citation into Explainable Impact
      Python
      Other
      17100Updated Mar 13, 2026Mar 13, 2026
    • EvoTok

      Public
      Code repo for "EvoTok: A Unified Image Tokenizer via Residual Latent Evolution for Visual Understanding and Generation"
      0700Updated Mar 13, 2026Mar 13, 2026
    • GRADE

      Public
      Python
      02100Updated Mar 13, 2026Mar 13, 2026
    • Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation
      Python
      Other
      02100Updated Mar 13, 2026Mar 13, 2026
    • The official repo of CrossEarth-SAR, a sar-centric and billion-scale geospatial foundation model for cross-domain semantic segmentation
      Python
      01400Updated Mar 12, 2026Mar 12, 2026
    • RISE-Video: Can Video Generators Decode Implicit World Rules?
      Python
      02220Updated Mar 11, 2026Mar 11, 2026
    • PWOOD

      Public
      [CVPR'26] Partial Weakly-Supervised Oriented Object Detection
      Python
      0800Updated Mar 4, 2026Mar 4, 2026
    • Point2RBox-v3

      Public
      [ICLR'26] Point2RBox-v3: Self-Bootstrapping from Point Annotations via Integrated Pseudo-Label Refinement and Utilization
      Python
      11200Updated Feb 28, 2026Feb 28, 2026
    • Awesome Remote Sensing Vision-Language Datasets
      MIT License
      2581280Updated Feb 24, 2026Feb 24, 2026
    • LRS-VQA

      Public
      [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning
      Python
      14810Updated Feb 16, 2026Feb 16, 2026
    • SPWOOD

      Public
      [ICLR'26] SPWOOD: Sparse Partial Weakly-Supervised Oriented Object Detection
      Jupyter Notebook
      0110Updated Feb 15, 2026Feb 15, 2026
    • RSCoVLM

      Public
      [Remote Sensing 2026] Co-Training Vision Language Models for Remote Sensing Multi-task Learning
      Jupyter Notebook
      02500Updated Feb 12, 2026Feb 12, 2026
    • [IGARSS 2025 Oral] A Simple Aerial Detection Baseline of Multimodal Language Models.
      Jupyter Notebook
      69201Updated Feb 12, 2026Feb 12, 2026
    • OF-Diff

      Public
      [ICLR'26] OF-Diff: Object Fidelity Diffusion for Remote Sensing Image Generation
      Python
      02730Updated Feb 6, 2026Feb 6, 2026
    • VisionXLab_LaTeX_Template

      Public
      TeX
      0700Updated Feb 3, 2026Feb 3, 2026
    • SpaCE-10

      Public
      [ICLR 2026] SpaCE-10: A Comprehensive Benchmark for Multimodal Large Language Models in Compositional Spatial Intelligence
      Python
      21810Updated Jan 26, 2026Jan 26, 2026
    • DVGBench

      Public
      [ISPRS2026] DVGBench: Implicit-to-Explicit Visual Grounding Benchmark in UAV Imagery with Large Vision-Language Models
      02030Updated Jan 14, 2026Jan 14, 2026
    • [TGRS'25] AirSpatialBot: A Spatially-Aware Aerial Agent for Fine-Grained Vehicle Attribute Recognization and Retrieval
      Python
      12910Updated Jan 6, 2026Jan 6, 2026
    • avi-math

      Public
      [ISPRS'25] Multimodal Mathematical Reasoning Embedded in Aerial Vehicle Imagery: Benchmarking, Analysis, and Exploration
      Python
      11300Updated Jan 4, 2026Jan 4, 2026
    • CastDet

      Public
      [ECCV'24/IJCV'26] Code repo for "Toward Open Vocabulary Aerial Object Detection with CLIP-Activated Student-Teacher Learning"
      Python
      Apache License 2.0
      47670Updated Jan 1, 2026Jan 1, 2026
    • [TPAMI 2025] CrossEarth: Geospatial Vision Foundation Model for Cross-Domain Generalization in Remote Sensing Semantic Segmentation
      Python
      MIT License
      918030Updated Dec 21, 2025Dec 21, 2025
    • ProCLIP

      Public
      Official PyTorch implementation of ProCLIP: Progressive Vision-Language Alignment via LLM-based Embedder
      Python
      22210Updated Dec 4, 2025Dec 4, 2025
    • [IJCV] PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection
      Python
      MIT License
      24010Updated Sep 25, 2025Sep 25, 2025
    • [CVPR'25] Official repo of "Point2RBox-v2:Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances"
      Python
      44000Updated Jul 25, 2025Jul 25, 2025
    • AdapTok

      Public
      [CVPR'26] AdapTok: Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space
      Python
      MIT License
      12230Updated Jun 5, 2025Jun 5, 2025
    • [AAAI 26] Official PyTorch implementation of Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency Adaptation
      Python
      GNU General Public License v3.0
      15710Updated May 29, 2025May 29, 2025
    • GeoGround

      Public
      GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding
      28050Updated May 10, 2025May 10, 2025
    • [ICLR'25] Official repo of "PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection"
      Python
      Apache License 2.0
      53850Updated Mar 27, 2025Mar 27, 2025
    • [TPAMI] Wholly Leveraging Diversified-quality Labels for Weakly-supervised Oriented Object Detection
      Python
      Apache License 2.0
      0300Updated Feb 14, 2025Feb 14, 2025
    • [TPAMI] Wholly Leveraging Diversified-quality Labels for Weakly-supervised Oriented Object Detection
      Jupyter Notebook
      Apache License 2.0
      01110Updated Feb 14, 2025Feb 14, 2025