@@ -116,23 +116,53 @@ APRIL not only improves training efficiency but also achieves:
116116
117117### System Design
118118
119- ```
120- ┌─────────────────────────────────────────────────────┐
121- │ Training Pipeline │
122- ├─────────────────────────────────────────────────────┤
123- │ │
124- │ ┌──────────────┐ ┌──────────────┐ │
125- │ │ Rollout │───▶│ Buffer │ │
126- │ │ Engine │ │ Management │ │
127- │ │ (SGLang) │ └──────────────┘ │
128- │ └──────────────┘ │ │
129- │ ▲ ▼ │
130- │ │ ┌──────────────┐ │
131- │ │ │ Training │ │
132- │ └────────────│ Engine │ │
133- │ │ (Megatron/FSDP)│ │
134- │ └──────────────┘ │
135- └─────────────────────────────────────────────────────┘
119+ ``` mermaid
120+ graph TB
121+ subgraph Pipeline["🎯 APRIL Training Pipeline"]
122+ subgraph Rollout["📊 Rollout Phase"]
123+ R1[("🎲 Over-provision<br/>N' > N requests")]
124+ R2[("⚡ SGLang<br/>Inference Engine")]
125+ R3[("🛑 Active<br/>Interruption")]
126+ R1 --> R2
127+ R2 --> R3
128+ end
129+
130+ subgraph Buffer["💾 Buffer Management"]
131+ B1[("📦 Partial<br/>Rollouts")]
132+ B2[("♻️ Resume<br/>Queue")]
133+ B3[("✅ Complete<br/>Samples")]
134+ B1 --> B2
135+ R3 --> B1
136+ R3 --> B3
137+ end
138+
139+ subgraph Training["🧠 Training Phase"]
140+ T1[("🔄 Policy<br/>Update")]
141+ T2[("📈 Loss<br/>Computation")]
142+ T3[("⚙️ Megatron/<br/>FSDP Backend")]
143+ B3 --> T2
144+ T2 --> T1
145+ T1 --> T3
146+ end
147+
148+ B2 -.->|Next Iteration| R1
149+ T3 -.->|Updated Model| R2
150+ end
151+
152+ style Pipeline fill:#f9f9ff,stroke:#4a5568,stroke-width:2px
153+ style Rollout fill:#e6f7ff,stroke:#1890ff,stroke-width:1px
154+ style Buffer fill:#fff7e6,stroke:#fa8c16,stroke-width:1px
155+ style Training fill:#f0f5ff,stroke:#597ef7,stroke-width:1px
156+
157+ style R1 fill:#e6f7ff,stroke:#40a9ff
158+ style R2 fill:#e6f7ff,stroke:#40a9ff
159+ style R3 fill:#e6f7ff,stroke:#40a9ff
160+ style B1 fill:#fff7e6,stroke:#ffa940
161+ style B2 fill:#fff7e6,stroke:#ffa940
162+ style B3 fill:#fff7e6,stroke:#ffa940
163+ style T1 fill:#f0f5ff,stroke:#85a5ff
164+ style T2 fill:#f0f5ff,stroke:#85a5ff
165+ style T3 fill:#f0f5ff,stroke:#85a5ff
136166```
137167
138168### Core Components
0 commit comments