Skip to content

Event-AHU/SAM_ChangeCaptioning

Repository files navigation

SAM_ChangeCaptioning

SAM Guided Semantic and Motion Changed Region Mining for Remote Sensing Change Captioning, Futian Wang, Mengqi Wang, Xiao Wang*, Haowen Wang, Jin Tang, arXiv:2511.21420

News

Getting Started

Installation

1. Install requirements

Install requirements using pip:

pip install -r requirements.txt

2. Prepare dataset

The data structure of LEVIR-CC is organized as follows:

├─/root/Data/LEVIR_CC/
        ├─LevirCCcaptions.json
        ├─images
             ├─train
             │  ├─A
             │  ├─B
             ├─val
             │  ├─A
             │  ├─B
             ├─test
             │  ├─A
             │  ├─B
             ├─fine_features
             ├─semantic_features

Then extract the text files for each pair of image change descriptions in LEVIR-CC:

python preprocess_data.py

Training

Ensure that the data preparation steps above are completed before proceeding to train the model:

python train.py

Testing

Please run the following command:

python test.py

Note: The complete execution code and weights will be uploaded later. The process includes the following steps:

Step 1: Use the modified SAM to extract motion-level features of the ROI.

Step 2: Use SAM combined with GroundingDINO to extract semantic-level features of the ROI.

Step 3: Feature processing.

Step 4: Graph extraction.

Step 5: Graph information encoding.

Citation

@misc{wang2025samChangeCap,
      title={SAM Guided Semantic and Motion Changed Region Mining for Remote Sensing Change Captioning}, 
      author={Futian Wang and Mengqi Wang and Xiao Wang and Haowen Wang and Jin Tang},
      year={2025},
      eprint={2511.21420},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2511.21420}, 
}

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages