Commit 9b67422
Extract PQ4 kernels to includable headers in impl/pq_4bit/ (#4868)
Summary:
Move kernel templates from .cpp anonymous namespaces into includable headers,
parameterized on SIMDLevel SL. No behavior change — existing .cpp files include
the headers and instantiate with defaults.
New headers:
- kernels_simd256.h: multi-BB kernel (from search_1.cpp) + single-BB QBS
256-bit kernel (from search_qbs.cpp non-AVX512 path)
- kernels_simd512.h: AVX512 nq1/nqx kernels + dispatcher (from search_qbs.cpp)
- decompose_qbs.h: unified kernel_accumulate_block<NQ, SL> that replaces
#ifndef __AVX512F__ with if constexpr on SL, plus QBS decomposition logic
Template param order: <int NQ, SIMDLevel SL, class ResultHandler, class Scaler>
to enable ergonomic SL propagation via kernel_accumulate_block<Q1, SL>(...).
~900 lines moved (code motion), ~100 lines changed. Pure refactor.
Reviewed By: mdouze, mnorris11
Differential Revision: D953921551 parent d28354e commit 9b67422
6 files changed
Lines changed: 864 additions & 801 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
263 | 263 | | |
264 | 264 | | |
265 | 265 | | |
| 266 | + | |
| 267 | + | |
| 268 | + | |
266 | 269 | | |
267 | 270 | | |
268 | 271 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
9 | 9 | | |
10 | 10 | | |
11 | 11 | | |
| 12 | + | |
12 | 13 | | |
13 | 14 | | |
14 | 15 | | |
| |||
21 | 22 | | |
22 | 23 | | |
23 | 24 | | |
24 | | - | |
25 | | - | |
26 | | - | |
27 | | - | |
28 | | - | |
29 | | - | |
30 | | - | |
31 | | - | |
32 | | - | |
33 | | - | |
34 | | - | |
35 | | - | |
36 | | - | |
37 | | - | |
38 | | - | |
39 | | - | |
40 | | - | |
41 | | - | |
42 | | - | |
43 | | - | |
44 | | - | |
45 | | - | |
46 | | - | |
47 | | - | |
48 | | - | |
49 | | - | |
50 | | - | |
51 | | - | |
52 | | - | |
53 | | - | |
54 | | - | |
55 | | - | |
56 | | - | |
57 | | - | |
58 | | - | |
59 | | - | |
60 | | - | |
61 | | - | |
62 | | - | |
63 | | - | |
64 | | - | |
65 | | - | |
66 | | - | |
67 | | - | |
68 | | - | |
69 | | - | |
70 | | - | |
71 | | - | |
72 | | - | |
73 | | - | |
74 | | - | |
75 | | - | |
76 | | - | |
77 | | - | |
78 | | - | |
79 | | - | |
80 | | - | |
81 | | - | |
82 | | - | |
83 | | - | |
84 | | - | |
85 | | - | |
86 | | - | |
87 | | - | |
88 | | - | |
89 | | - | |
90 | | - | |
91 | | - | |
92 | | - | |
93 | | - | |
94 | | - | |
95 | | - | |
96 | | - | |
97 | | - | |
98 | | - | |
99 | | - | |
100 | | - | |
101 | | - | |
102 | | - | |
103 | | - | |
104 | | - | |
105 | | - | |
106 | | - | |
107 | | - | |
108 | | - | |
109 | | - | |
110 | | - | |
111 | | - | |
112 | | - | |
113 | | - | |
114 | | - | |
115 | | - | |
116 | | - | |
117 | | - | |
118 | | - | |
119 | 25 | | |
120 | 26 | | |
121 | 27 | | |
| |||
0 commit comments