Commit 5c527ff
Support overlap scheduling for speculative decoding
Co-authored-by: Nathan Wang <nathan.r.wang@gmail.com>1 parent 6b39f9c commit 5c527ff
File tree
12 files changed
+2605
-47
lines changed- python/sglang/srt
- disaggregation
- layers
- attention
- managers
- speculative
12 files changed
+2605
-47
lines changedLines changed: 25 additions & 7 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
2 | 2 | | |
3 | 3 | | |
| 4 | + | |
4 | 5 | | |
5 | 6 | | |
6 | 7 | | |
| |||
146 | 147 | | |
147 | 148 | | |
148 | 149 | | |
149 | | - | |
| 150 | + | |
| 151 | + | |
| 152 | + | |
| 153 | + | |
| 154 | + | |
| 155 | + | |
| 156 | + | |
| 157 | + | |
| 158 | + | |
| 159 | + | |
| 160 | + | |
| 161 | + | |
| 162 | + | |
| 163 | + | |
| 164 | + | |
| 165 | + | |
| 166 | + | |
| 167 | + | |
| 168 | + | |
| 169 | + | |
| 170 | + | |
| 171 | + | |
| 172 | + | |
| 173 | + | |
150 | 174 | | |
151 | | - | |
152 | | - | |
153 | | - | |
154 | | - | |
155 | | - | |
156 | | - | |
157 | 175 | | |
158 | 176 | | |
159 | 177 | | |
Lines changed: 13 additions & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
2 | 2 | | |
| 3 | + | |
3 | 4 | | |
4 | 5 | | |
5 | 6 | | |
| |||
13 | 14 | | |
14 | 15 | | |
15 | 16 | | |
16 | | - | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
17 | 25 | | |
18 | 26 | | |
19 | 27 | | |
| |||
1894 | 1902 | | |
1895 | 1903 | | |
1896 | 1904 | | |
1897 | | - | |
| 1905 | + | |
| 1906 | + | |
| 1907 | + | |
| 1908 | + | |
1898 | 1909 | | |
1899 | 1910 | | |
1900 | 1911 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
59 | 59 | | |
60 | 60 | | |
61 | 61 | | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
62 | 65 | | |
63 | 66 | | |
64 | 67 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
108 | 108 | | |
109 | 109 | | |
110 | 110 | | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
111 | 114 | | |
112 | 115 | | |
113 | 116 | | |
| |||
903 | 906 | | |
904 | 907 | | |
905 | 908 | | |
| 909 | + | |
| 910 | + | |
906 | 911 | | |
907 | 912 | | |
908 | 913 | | |
| |||
1540 | 1545 | | |
1541 | 1546 | | |
1542 | 1547 | | |
1543 | | - | |
| 1548 | + | |
| 1549 | + | |
| 1550 | + | |
| 1551 | + | |
| 1552 | + | |
| 1553 | + | |
| 1554 | + | |
| 1555 | + | |
| 1556 | + | |
| 1557 | + | |
| 1558 | + | |
| 1559 | + | |
| 1560 | + | |
| 1561 | + | |
| 1562 | + | |
| 1563 | + | |
1544 | 1564 | | |
1545 | 1565 | | |
1546 | 1566 | | |
| |||
1648 | 1668 | | |
1649 | 1669 | | |
1650 | 1670 | | |
| 1671 | + | |
| 1672 | + | |
| 1673 | + | |
| 1674 | + | |
| 1675 | + | |
| 1676 | + | |
| 1677 | + | |
| 1678 | + | |
1651 | 1679 | | |
1652 | 1680 | | |
1653 | 1681 | | |
1654 | 1682 | | |
1655 | | - | |
| 1683 | + | |
| 1684 | + | |
1656 | 1685 | | |
1657 | 1686 | | |
1658 | 1687 | | |
| |||
1766 | 1795 | | |
1767 | 1796 | | |
1768 | 1797 | | |
| 1798 | + | |
1769 | 1799 | | |
1770 | 1800 | | |
1771 | 1801 | | |
| |||
1918 | 1948 | | |
1919 | 1949 | | |
1920 | 1950 | | |
| 1951 | + | |
| 1952 | + | |
1921 | 1953 | | |
1922 | 1954 | | |
1923 | 1955 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
317 | 317 | | |
318 | 318 | | |
319 | 319 | | |
320 | | - | |
| 320 | + | |
321 | 321 | | |
322 | 322 | | |
323 | 323 | | |
| |||
334 | 334 | | |
335 | 335 | | |
336 | 336 | | |
337 | | - | |
| 337 | + | |
| 338 | + | |
| 339 | + | |
| 340 | + | |
| 341 | + | |
| 342 | + | |
| 343 | + | |
| 344 | + | |
338 | 345 | | |
339 | | - | |
| 346 | + | |
340 | 347 | | |
341 | 348 | | |
342 | 349 | | |
| |||
820 | 827 | | |
821 | 828 | | |
822 | 829 | | |
823 | | - | |
| 830 | + | |
| 831 | + | |
| 832 | + | |
| 833 | + | |
| 834 | + | |
824 | 835 | | |
825 | 836 | | |
826 | 837 | | |
827 | 838 | | |
828 | 839 | | |
829 | 840 | | |
830 | 841 | | |
831 | | - | |
| 842 | + | |
| 843 | + | |
| 844 | + | |
| 845 | + | |
| 846 | + | |
| 847 | + | |
| 848 | + | |
832 | 849 | | |
833 | 850 | | |
834 | 851 | | |
| |||
1789 | 1806 | | |
1790 | 1807 | | |
1791 | 1808 | | |
| 1809 | + | |
| 1810 | + | |
| 1811 | + | |
| 1812 | + | |
| 1813 | + | |
| 1814 | + | |
| 1815 | + | |
| 1816 | + | |
| 1817 | + | |
| 1818 | + | |
| 1819 | + | |
| 1820 | + | |
| 1821 | + | |
| 1822 | + | |
| 1823 | + | |
| 1824 | + | |
| 1825 | + | |
| 1826 | + | |
| 1827 | + | |
| 1828 | + | |
| 1829 | + | |
| 1830 | + | |
| 1831 | + | |
| 1832 | + | |
| 1833 | + | |
| 1834 | + | |
| 1835 | + | |
| 1836 | + | |
| 1837 | + | |
| 1838 | + | |
| 1839 | + | |
| 1840 | + | |
1792 | 1841 | | |
1793 | 1842 | | |
1794 | 1843 | | |
| |||
Lines changed: 55 additions & 14 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
51 | 51 | | |
52 | 52 | | |
53 | 53 | | |
54 | | - | |
55 | | - | |
56 | | - | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
57 | 62 | | |
58 | 63 | | |
59 | 64 | | |
| |||
205 | 210 | | |
206 | 211 | | |
207 | 212 | | |
208 | | - | |
209 | | - | |
210 | | - | |
| 213 | + | |
| 214 | + | |
| 215 | + | |
| 216 | + | |
| 217 | + | |
| 218 | + | |
| 219 | + | |
| 220 | + | |
| 221 | + | |
| 222 | + | |
| 223 | + | |
| 224 | + | |
| 225 | + | |
| 226 | + | |
| 227 | + | |
| 228 | + | |
| 229 | + | |
| 230 | + | |
| 231 | + | |
211 | 232 | | |
212 | 233 | | |
213 | 234 | | |
| |||
217 | 238 | | |
218 | 239 | | |
219 | 240 | | |
| 241 | + | |
| 242 | + | |
| 243 | + | |
| 244 | + | |
| 245 | + | |
| 246 | + | |
| 247 | + | |
| 248 | + | |
| 249 | + | |
| 250 | + | |
| 251 | + | |
| 252 | + | |
| 253 | + | |
| 254 | + | |
| 255 | + | |
| 256 | + | |
| 257 | + | |
| 258 | + | |
| 259 | + | |
220 | 260 | | |
221 | 261 | | |
222 | 262 | | |
223 | | - | |
| 263 | + | |
| 264 | + | |
224 | 265 | | |
225 | 266 | | |
226 | 267 | | |
227 | 268 | | |
228 | 269 | | |
229 | 270 | | |
230 | | - | |
| 271 | + | |
231 | 272 | | |
232 | 273 | | |
233 | 274 | | |
234 | 275 | | |
235 | 276 | | |
236 | | - | |
237 | | - | |
238 | | - | |
| 277 | + | |
239 | 278 | | |
240 | 279 | | |
241 | | - | |
| 280 | + | |
242 | 281 | | |
243 | 282 | | |
244 | 283 | | |
| |||
247 | 286 | | |
248 | 287 | | |
249 | 288 | | |
250 | | - | |
251 | | - | |
| 289 | + | |
| 290 | + | |
| 291 | + | |
| 292 | + | |
252 | 293 | | |
253 | 294 | | |
254 | 295 | | |
| |||
0 commit comments