Commit e31d8d7
[TRTLLM-11540][feat] Revert EAGLE3 dynamic tree speculative decoding support (NVIDIA#12062) (NVIDIA#13006)
Signed-off-by: Balaram Buddharaju <169953907+brb-nv@users.noreply.github.com>
Signed-off-by: Yiqing Yan <yiqingy@nvidia.com>
Co-authored-by: Yiqing Yan <yiqingy@nvidia.com>1 parent 1480140 commit e31d8d7
File tree
37 files changed
+414
-3571
lines changed- cpp/tensorrt_llm
- kernels
- speculativeDecoding
- trtllmGenKernels/fmha
- thop
- docs/source
- features
- models
- examples/llm-api
- tensorrt_llm
- _torch
- attention_backend
- sparse
- modules
- pyexecutor
- speculative
- llmapi
- tests
- integration/test_lists
- unittest
- _torch
- modeling
- speculative
- thop/parallel
- others
37 files changed
+414
-3571
lines changedLines changed: 0 additions & 356 deletions
This file was deleted.
0 commit comments