Skip to content

Releases: ggml-org/llama.cpp

b4722

15 Feb 15:14
68ff663
Compare
Choose a tag to compare
repo : update links to new url (#11886)

* repo : update links to new url

ggml-ci

* cont : more urls

ggml-ci

b4721

15 Feb 10:45
f355229
Compare
Choose a tag to compare
server: fix type promotion typo causing crashes w/ --jinja w/o tools …

b4720

15 Feb 08:34
fc1b0d0
Compare
Choose a tag to compare
vulkan: initial support for IQ1_S and IQ1_M quantizations (#11528)

* vulkan: initial support for IQ1_S and IQ1_M quantizations

* vulkan: define MMV kernels for IQ1 quantizations

* devops: increase timeout of Vulkan tests again

* vulkan: simplify ifdef for init_iq_shmem

b4719

14 Feb 21:22
89daa25
Compare
Choose a tag to compare
llguidance build fixes for Windows (#11664)

* setup windows linking for llguidance; thanks @phil-scott-78

* add build instructions for windows and update script link

* change VS Community link from DE to EN

* whitespace fix

b4718

14 Feb 19:50
300907b
Compare
Choose a tag to compare
opencl: Fix rope and softmax (#11833)

* opencl: fix `ROPE`

* opencl: fix `SOFT_MAX`

* Add fp16 variant

* opencl: enforce subgroup size for `soft_max`

b4717

14 Feb 15:29
94b87f8
Compare
Choose a tag to compare
cuda : add ampere to the list of default architectures (#11870)

b4716

14 Feb 13:28
dbc2ec5
Compare
Choose a tag to compare
docker : drop to CUDA 12.4 (#11869)

* docker : drop to CUDA 12.4

* docker : update readme [no ci]

b4714

14 Feb 09:24
38e32eb
Compare
Choose a tag to compare
ggml: optimize some vec dot functions for LoongArch ASX (#11842)

* Optimize ggml_vec_dot_q3_K_q8_K for LoongArch ASX

* Optimize ggml_vec_dot_q4_K_q8_K for LoongArch ASX

* Optimize ggml_vec_dot_q6_K_q8_K for LoongArch ASX

* Optimize ggml_vec_dot_q5_K_q8_K for LoongArch ASX

* Optimize ggml_vec_dot_q2_K_q8_K for LoongArch ASX

* Optimize mul_sum_i8_pairs_float for LoongArch ASX

* Optimize ggml_vec_dot_iq4_xs_q8_K for LoongArch ASX

b4713

14 Feb 03:36
a4f011e
Compare
Choose a tag to compare
vulkan: linux builds + small subgroup size fixes (#11767)

* mm subgroup size

* upload vulkan x86 builds

b4712

14 Feb 01:45
a7b8ce2
Compare
Choose a tag to compare
llama-bench : fix unexpected global variable initialize sequence issu…