Commit 02adf7a
authored
[CHUNK_PREFILL] kernel refactor using new api (vllm-project#76)
* clang-format
Signed-off-by: Yizhou Wang <yizhou.wang@intel.com>
* add varlen
Signed-off-by: Yizhou Wang <yizhou.wang@intel.com>
* add page
Signed-off-by: Yizhou Wang <yizhou.wang@intel.com>
* fix acc issue
Signed-off-by: Yizhou Wang <yizhou.wang@intel.com>
* use half/bf16
Signed-off-by: Yizhou Wang <yizhou.wang@intel.com>
* remove debug code
Signed-off-by: Yizhou Wang <yizhou.wang@intel.com>
* fix ut
Signed-off-by: Yizhou Wang <yizhou.wang@intel.com>
* ignore fused_moe ut
Signed-off-by: Yizhou Wang <yizhou.wang@intel.com>
* fix pre-commit
Signed-off-by: Yizhou Wang <yizhou.wang@intel.com>
---------
Signed-off-by: Yizhou Wang <yizhou.wang@intel.com>1 parent 4b50ec9 commit 02adf7a
13 files changed
Lines changed: 1286 additions & 1974 deletions
File tree
- .github/workflows
- csrc
- flash_attn
- xpu
- cutlass_kernels
- collective
- tests/flash_attn
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
31 | 31 | | |
32 | 32 | | |
33 | 33 | | |
34 | | - | |
| 34 | + | |
35 | 35 | | |
36 | 36 | | |
37 | 37 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
171 | 171 | | |
172 | 172 | | |
173 | 173 | | |
174 | | - | |
| 174 | + | |
175 | 175 | | |
176 | 176 | | |
177 | 177 | | |
178 | 178 | | |
179 | | - | |
| 179 | + | |
180 | 180 | | |
181 | 181 | | |
182 | 182 | | |
| |||
195 | 195 | | |
196 | 196 | | |
197 | 197 | | |
198 | | - | |
199 | | - | |
200 | 198 | | |
201 | 199 | | |
202 | 200 | | |
| |||
205 | 203 | | |
206 | 204 | | |
207 | 205 | | |
| 206 | + | |
208 | 207 | | |
209 | 208 | | |
210 | 209 | | |
| |||
277 | 276 | | |
278 | 277 | | |
279 | 278 | | |
280 | | - | |
| 279 | + | |
281 | 280 | | |
282 | 281 | | |
283 | 282 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
81 | 81 | | |
82 | 82 | | |
83 | 83 | | |
| 84 | + | |
| 85 | + | |
84 | 86 | | |
85 | 87 | | |
86 | 88 | | |
| |||
99 | 101 | | |
100 | 102 | | |
101 | 103 | | |
| 104 | + | |
| 105 | + | |
102 | 106 | | |
103 | 107 | | |
104 | 108 | | |
| |||
0 commit comments