Qwen-Doc

An Open-Source Collection of Projects on Document Understanding, Parsing, and Agents

📖 Introduction

Qwen-Doc is an open-source repository dedicated to Document AI, developed and maintained by the Tongyi-Zhiwen team.

This repository aims to bring together a series of explorations and practices centered on cutting-edge technologies such as long-context understanding, document parsing, and document-based intelligent agents. We are committed to enhancing the capabilities of Large Language Models in processing and comprehending complex documents, and we open-source our models, data, and methodologies to foster community growth.

🎉 News

Dec 15, 2025: 🔥 We released the QwenLong-L1.5 project! It provides a complete post-training recipe for long-context reasoning and memory management. The corresponding model and technical report have also been released.
Dec 15, 2025: 🔥 We released the code implementation of SPELL, which is a self-play reinforcement learning framework designed to improve long-context reasoning abilities in LLMs.
May 28, 2025: 🔥 The QwenLong-L1 project released QwenLong-L1-32B-AWQ, a version processed with AWQ int4 quantization.
May 26, 2025: 🔥 We officially open-sourced the QwenLong-L1 project, the industry's first large model trained for long-context reasoning using reinforcement learning. We also released the accompanying QwenLong-L1-32B model and the DocQA-RL-1.6K training dataset.

📂 Project List

This repository currently includes the following projects:

1. QwenLong-L1

Description: A framework designed to generalize Large Models from short-context proficiency to robust long-context reasoning capabilities using Reinforcement Learning. This project explores mechanisms like curriculum learning and difficulty-aware sampling, and releases the QwenLong-L1-32B model trained on this framework, which has achieved state-of-the-art performance on multiple long-context document question answering (DocQA) benchmarks.

2. QwenLong-L1.5

Description: A complete "Post-Training Recipe" for long-context reasoning and memory management. This project features three core contributions: a synthesis pipeline for generating complex reasoning data, the Adaptive Entropy-Controlled Policy Optimization (AEPO) algorithm optimized for long-context training, and a memory management framework that extends operation beyond the model's physical context window. Based on this recipe, we introduce the QwenLong-L1.5-30B-A3B model.

3. SPELL

Description: A self-play reinforcement learning framework designed to improve long-context reasoning abilities in LLMs. SPELL cycles a single LLM through three roles—questioner, responder, and verifier—to autonomously generate training data and rewards, without requiring external supervision. Extensive experiments across 12 models and 6 benchmarks demonstrate consistent improvements. Notably, SPELL provides a potential path for elevating the performance ceiling of models surpassing human performance.

⭐ Star History

📝 Citation

If you find our work helpful in your research, please consider citing our papers:

@article{wan2025qwenlongl1,
  title={QwenLong-L1: : Towards Long-Context Large Reasoning Models with Reinforcement Learning},
  author={Fanqi Wan, Weizhou Shen, Shengyi Liao, Yingcheng Shi, Chenliang Li, Ziyi Yang, Ji Zhang, Fei Huang, Jingren Zhou, Ming Yan},
  journal={arXiv preprint arXiv:2505.17667},
  year={2025}
}

@article{shen2025qwenlongl15,
      title={QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management}, 
      author={Weizhou Shen and Ziyi Yang and Chenliang Li and Zhiyuan Lu and Miao Peng and Huashan Sun and Yingcheng Shi and Shengyi Liao and Shaopeng Lai and Bo Zhang and Dayiheng Liu and Fei Huang and Jingren Zhou and Ming Yan},
      journal={arXiv preprint arXiv:2512.12967},
      year={2025}
}

@article{yang2025spell,
    title={SPELL: Self-Play Reinforcement Learning for evolving Long-Context Language Models},
    author={Ziyi Yang, Weizhou Shen, Ruijun Chen, Chenliang Li, Fanqi Wan, Ming Yan, Xiaojun Quan, Fei Huang},
    journal={arXiv preprint arXiv:2509.23863},
    year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
QwenLong-L1.5		QwenLong-L1.5
QwenLong-L1		QwenLong-L1
SPELL		SPELL
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Qwen-Doc

📖 Introduction

🎉 News

📂 Project List

1. QwenLong-L1

2. QwenLong-L1.5

3. SPELL

⭐ Star History

📝 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Qwen-Doc

📖 Introduction

🎉 News

📂 Project List

1. QwenLong-L1

2. QwenLong-L1.5

3. SPELL

⭐ Star History

📝 Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages