diff --git a/README.md b/README.md
index 32d5d56..c056614 100644
--- a/README.md
+++ b/README.md
@@ -113,6 +113,7 @@ This is the first work to correct hallucination in multimodal large language mod
## Multimodal Instruction Tuning
| Title | Venue | Date | Code | Demo |
|:--------|:--------:|:--------:|:--------:|:--------:|
+| 
[**Filter Images First, Generate Instructions Later: Pre-Instruction Data Selection for Visual Instruction Tuning**](https://arxiv.org/pdf/2503.07591)
| CVPR | 2025-03-10 | [Github](https://github.com/bardisafa/PreSel) | [Demo](https://bardisafa.github.io/PreSel/) |
| 
[**Qwen2.5-Omni Technical Report**](https://github.com/QwenLM/Qwen2.5-Omni/blob/main/assets/Qwen2.5_Omni.pdf)
| Qwen | 2025-03-26 | [Github](https://github.com/QwenLM/Qwen2.5-Omni) | [Demo](https://huggingface.co/spaces/Qwen/Qwen2.5-Omni-7B-Demo) |
| [**Addendum to GPT-4o System Card: Native image generation**](https://cdn.openai.com/11998be9-5319-4302-bfbf-1167e093f1fb/Native_Image_Generation_System_Card.pdf) | OpenAI | 2025-03-25 | - | - |
| 
[**Sparrow: Data-Efficient Video-LLM with Text-to-Image Augmentation**](https://arxiv.org/pdf/2411.19951)
| arXiv | 2025-03-17 | [Github](https://github.com/VITA-MLLM/Sparrow) | - |