Skip to content

Commit a6dccf2

Browse files
committed
added sdpa llm inference for qwen3.5 model support.
Signed-off-by: Zhang, Xiaolin <xiaolin.zhang@intel.com>
1 parent 390e6fa commit a6dccf2

File tree

3 files changed

+586
-0
lines changed

3 files changed

+586
-0
lines changed

src/cpp/src/module_genai/module_type.hpp

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -31,6 +31,7 @@ namespace module {
3131
X(LLMInferenceModule, 40) \
3232
X(DenoiserLoopModule, 41) \
3333
X(VAEDecoderModule, 42) \
34+
X(LLMInferenceSDPAModule, 43) \
3435
X(ResultModule, 50) \
3536
X(SaveImageModule, 51) \
3637
X(SaveVideoModule, 52) \

0 commit comments

Comments
 (0)