Releases: MoonRide303/llama.cpp
Releases · MoonRide303/llama.cpp
b5523
convert: small addition to support LlamaModel (#13838) Co-authored-by: dinhhuy <[email protected]>
b5289
CUDA: fix --split-mode row for MMQ (#13323)
convert: small addition to support LlamaModel (#13838) Co-authored-by: dinhhuy <[email protected]>
CUDA: fix --split-mode row for MMQ (#13323)