Skip to content
Discussion options

You must be logged in to vote

Hi @d-shehu ,

llama-bench doesn't share the argument parser used by llama-cli and llama-server. It's a standalone benchmarking tool with its own (intentionally smaller) option set, defined separately in tools/llama-bench/llama-bench.cpp rather than through the shared common args. So nothing is broken on your end — several of your flags either don't exist in bench or use different conventions, and the command just needs to be translated.

There are three separate issues in your command.

1. -ngl all must be a number

The flag is supported, but on b9222 bench's -ngl / --n-gpu-layers takes an integer <n> (default 99), not the auto / all keyword that cli/server accept. The error invalid paramete…

Replies: 2 comments 1 reply

Comment options

You must be logged in to vote
1 reply
@d-shehu
Comment options

Answer selected by d-shehu
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants