Commit 4854fd4
authored
add xpu tuning to FLJSD (#647)
## Summary
Tuning on XPU: In fused linear JSD, if device is xpu, set MAX_FUSED_SIZE
to 4096 instead of default 65536 // 2. This gives slightly better
performance on xpu.
Very similar to #645
## Testing Done
- Hardware Type: Intel(R) Data Center GPU Max 1550
- [x] run `make test` to ensure correctness
- [x] run `make checkstyle` to ensure code style
- [x] run `make test-convergence` to ensure convergence1 parent bebe030 commit 4854fd4
1 file changed
Lines changed: 2 additions & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
8 | 8 | | |
9 | 9 | | |
10 | 10 | | |
| 11 | + | |
11 | 12 | | |
12 | 13 | | |
13 | 14 | | |
14 | 15 | | |
15 | | - | |
| 16 | + | |
16 | 17 | | |
17 | 18 | | |
18 | 19 | | |
| |||
0 commit comments