Commit 39472d8
Shanmugamr1992/megatron inference ultra (#3784)
Co-authored-by: Teodor-Dumitru Ene <teodord.ene@gmail.com>
Co-authored-by: Robert Kirby <rkirby@nvidia.com>
Co-authored-by: Shanmugam Ramasamy <shanmugamr@cw-dfw-cs-001-login-01.cm.cluster>
Co-authored-by: rislam <rislam@nvidia.com>1 parent 8f539df commit 39472d8
File tree
4 files changed
+288
-53
lines changed- megatron/core
- inference
- engines
- text_generation_server/dynamic_text_gen_server/endpoints
- ssm
4 files changed
+288
-53
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
884 | 884 | | |
885 | 885 | | |
886 | 886 | | |
887 | | - | |
| 887 | + | |
| 888 | + | |
| 889 | + | |
| 890 | + | |
888 | 891 | | |
889 | 892 | | |
890 | 893 | | |
| |||
0 commit comments