Skip to content

Commit e372e7b

Browse files
committed
bump llama.cpp
1 parent f3b0bbb commit e372e7b

File tree

2 files changed

+2
-2
lines changed

2 files changed

+2
-2
lines changed

deps/llama.cpp

Submodule llama.cpp updated 370 files

shards/modules/llm/embedd.cpp

+1-1
Original file line numberDiff line numberDiff line change
@@ -334,7 +334,7 @@ struct Embed {
334334
}
335335
batch.n_tokens = tokens.size();
336336

337-
llama_kv_cache_clear(llmContext.ctx.get());
337+
llama_kv_self_clear(llmContext.ctx.get());
338338

339339
if (llama_model_has_encoder(model)) {
340340
if (llama_encode(llmContext.ctx.get(), batch) < 0) {

0 commit comments

Comments
 (0)