Skip to content

Commit 96fbe00

Browse files
authored
model : fix llama_model::n_gpu_layers() (#24188)
1 parent 2016bf2 commit 96fbe00

1 file changed

Lines changed: 2 additions & 1 deletion

File tree

src/llama-model.cpp

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1636,7 +1636,8 @@ const float * llama_model::tensor_split() const {
16361636
}
16371637

16381638
uint32_t llama_model::n_gpu_layers() const {
1639-
return params.n_gpu_layers >= 0 ? params.n_gpu_layers : hparams.n_layer() + 1;
1639+
// note: plus 1 for the "output" layer
1640+
return params.n_gpu_layers >= 0 ? params.n_gpu_layers : hparams.n_layer_all + 1;
16401641
}
16411642

16421643
llama_split_mode llama_model::split_mode() const {

0 commit comments

Comments
 (0)