Fix a bug in flash attention where kv_seq_len should divide block_k_major. #10989
Job | Run time |
---|---|
2s | |
3s | |
1h 11m 47s | |
1h 11m 48s | |
1m 43s | |
1m 45s | |
9m 17s | |
10m 0s | |
14m 48s | |
1h 13m 12s | |
18m 29s | |
18m 26s | |
6m 54s | |
6m 39s | |
11m 5s | |
11m 8s | |
13m 36s | |
12m 48s | |
5m 48s | |
5m 41s | |
6m 3s | |
6m 2s | |
6h 17m 4s |