Commit 805f29a
[Feature] refactor metax_gpu attention and moe and remove some useless code (PaddlePaddle#3688)
Co-authored-by: yongqiangma <xing.wo@163.com>1 parent cab7a63 commit 805f29a
File tree
5 files changed
+399
-299
lines changed- fastdeploy
- model_executor/layers
- backends/metax
- attention
- moe
- quantization
5 files changed
+399
-299
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
894 | 894 | | |
895 | 895 | | |
896 | 896 | | |
897 | | - | |
| 897 | + | |
898 | 898 | | |
899 | 899 | | |
900 | 900 | | |
| |||
0 commit comments