Commit 7f903f7
Tomer Natan
Add BF16 trtllm-gen MoE: activation_type + routing_replay_out + new BMM cubins
Cherry-pick of flashinfer-ai#2864 (squashed) plus:
- activation_type param for trtllm_bf16_moe/trtllm_bf16_routed_moe (Swiglu=3, Relu2=6)
- routing_replay_out param for BF16 kernel (same pattern as FP8)
- Updated batched GEMM artifacts and checksums
- validateAndCastActivationType for safety
- Bf16MoeLauncher::init accepts ActivationType + routing_replay_out1 parent f8eb66b commit 7f903f7
4 files changed
Lines changed: 49 additions & 12 deletions
File tree
- csrc
- flashinfer
- fused_moe
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
68 | 68 | | |
69 | 69 | | |
70 | 70 | | |
71 | | - | |
72 | | - | |
| 71 | + | |
| 72 | + | |
73 | 73 | | |
74 | 74 | | |
75 | 75 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
476 | 476 | | |
477 | 477 | | |
478 | 478 | | |
479 | | - | |
480 | | - | |
481 | | - | |
482 | | - | |
| 479 | + | |
483 | 480 | | |
484 | 481 | | |
485 | 482 | | |
| |||
1670 | 1667 | | |
1671 | 1668 | | |
1672 | 1669 | | |
1673 | | - | |
| 1670 | + | |
| 1671 | + | |
1674 | 1672 | | |
1675 | 1673 | | |
1676 | 1674 | | |
| |||
1686 | 1684 | | |
1687 | 1685 | | |
1688 | 1686 | | |
| 1687 | + | |
| 1688 | + | |
| 1689 | + | |
| 1690 | + | |
| 1691 | + | |
| 1692 | + | |
| 1693 | + | |
| 1694 | + | |
| 1695 | + | |
| 1696 | + | |
| 1697 | + | |
| 1698 | + | |
| 1699 | + | |
| 1700 | + | |
1689 | 1701 | | |
1690 | 1702 | | |
1691 | 1703 | | |
| |||
1719 | 1731 | | |
1720 | 1732 | | |
1721 | 1733 | | |
1722 | | - | |
| 1734 | + | |
| 1735 | + | |
1723 | 1736 | | |
1724 | 1737 | | |
1725 | 1738 | | |
| |||
1751 | 1764 | | |
1752 | 1765 | | |
1753 | 1766 | | |
1754 | | - | |
| 1767 | + | |
1755 | 1768 | | |
1756 | 1769 | | |
1757 | 1770 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
137 | 137 | | |
138 | 138 | | |
139 | 139 | | |
140 | | - | |
| 140 | + | |
141 | 141 | | |
142 | 142 | | |
143 | 143 | | |
| |||
158 | 158 | | |
159 | 159 | | |
160 | 160 | | |
161 | | - | |
| 161 | + | |
162 | 162 | | |
163 | 163 | | |
164 | 164 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1132 | 1132 | | |
1133 | 1133 | | |
1134 | 1134 | | |
| 1135 | + | |
| 1136 | + | |
1135 | 1137 | | |
1136 | 1138 | | |
1137 | 1139 | | |
| |||
1339 | 1341 | | |
1340 | 1342 | | |
1341 | 1343 | | |
| 1344 | + | |
| 1345 | + | |
1342 | 1346 | | |
1343 | 1347 | | |
1344 | 1348 | | |
| |||
1387 | 1391 | | |
1388 | 1392 | | |
1389 | 1393 | | |
1390 | | - | |
| 1394 | + | |
1391 | 1395 | | |
1392 | 1396 | | |
1393 | 1397 | | |
| |||
1411 | 1415 | | |
1412 | 1416 | | |
1413 | 1417 | | |
| 1418 | + | |
| 1419 | + | |
1414 | 1420 | | |
1415 | 1421 | | |
1416 | 1422 | | |
| |||
1437 | 1443 | | |
1438 | 1444 | | |
1439 | 1445 | | |
| 1446 | + | |
| 1447 | + | |
1440 | 1448 | | |
1441 | 1449 | | |
1442 | 1450 | | |
| |||
1469 | 1477 | | |
1470 | 1478 | | |
1471 | 1479 | | |
| 1480 | + | |
| 1481 | + | |
1472 | 1482 | | |
1473 | 1483 | | |
1474 | 1484 | | |
| |||
2265 | 2275 | | |
2266 | 2276 | | |
2267 | 2277 | | |
| 2278 | + | |
| 2279 | + | |
2268 | 2280 | | |
2269 | 2281 | | |
2270 | 2282 | | |
| |||
2302 | 2314 | | |
2303 | 2315 | | |
2304 | 2316 | | |
| 2317 | + | |
| 2318 | + | |
| 2319 | + | |
2305 | 2320 | | |
2306 | 2321 | | |
2307 | 2322 | | |
| |||
2329 | 2344 | | |
2330 | 2345 | | |
2331 | 2346 | | |
| 2347 | + | |
| 2348 | + | |
2332 | 2349 | | |
2333 | 2350 | | |
2334 | 2351 | | |
| |||
2360 | 2377 | | |
2361 | 2378 | | |
2362 | 2379 | | |
| 2380 | + | |
| 2381 | + | |
2363 | 2382 | | |
2364 | 2383 | | |
2365 | 2384 | | |
| |||
2396 | 2415 | | |
2397 | 2416 | | |
2398 | 2417 | | |
| 2418 | + | |
| 2419 | + | |
| 2420 | + | |
2399 | 2421 | | |
2400 | 2422 | | |
2401 | 2423 | | |
| |||
2423 | 2445 | | |
2424 | 2446 | | |
2425 | 2447 | | |
| 2448 | + | |
| 2449 | + | |
2426 | 2450 | | |
2427 | 2451 | | |
2428 | 2452 | | |
| |||
0 commit comments