Skip to content

Actions: ggml-org/llama.cpp

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
103,493 workflow runs
103,493 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

CUDA: optimize FA for GQA + large batches
Python Type-Check #1882: Pull request #12014 opened by JohannesGaessler
February 21, 2025 22:19 1m 9s JohannesGaessler:cuda-fa-mma-23
February 21, 2025 22:19 1m 9s
CUDA: optimize FA for GQA + large batches
EditorConfig Checker #22057: Pull request #12014 opened by JohannesGaessler
February 21, 2025 22:19 19s JohannesGaessler:cuda-fa-mma-23
February 21, 2025 22:19 19s
CUDA: optimize FA for GQA + large batches
Pull Request Labeler #8397: Pull request #12014 opened by JohannesGaessler
February 21, 2025 22:19 17s
February 21, 2025 22:19 17s
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976)
Server #11057: Pull request #12000 synchronize by gcp
February 21, 2025 21:51 Action required gcp:cpy_cuda_quants
February 21, 2025 21:51 Action required
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976)
EditorConfig Checker #22056: Pull request #12000 synchronize by gcp
February 21, 2025 21:51 Action required gcp:cpy_cuda_quants
February 21, 2025 21:51 Action required
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976)
CI #19654: Pull request #12000 synchronize by gcp
February 21, 2025 21:51 Action required gcp:cpy_cuda_quants
February 21, 2025 21:51 Action required
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976)
Pull Request Labeler #8396: Pull request #12000 synchronize by gcp
February 21, 2025 21:51 2m 9s
February 21, 2025 21:51 2m 9s
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976)
CI #19653: Pull request #12000 synchronize by gcp
February 21, 2025 21:51 Action required gcp:cpy_cuda_quants
February 21, 2025 21:51 Action required
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976)
Server #11056: Pull request #12000 synchronize by gcp
February 21, 2025 21:51 Action required gcp:cpy_cuda_quants
February 21, 2025 21:51 Action required
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976)
EditorConfig Checker #22055: Pull request #12000 synchronize by gcp
February 21, 2025 21:51 Action required gcp:cpy_cuda_quants
February 21, 2025 21:51 Action required
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976)
Pull Request Labeler #8395: Pull request #12000 synchronize by gcp
February 21, 2025 21:51 2m 7s
February 21, 2025 21:51 2m 7s
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976)
EditorConfig Checker #22054: Pull request #12000 synchronize by gcp
February 21, 2025 21:51 Action required gcp:cpy_cuda_quants
February 21, 2025 21:51 Action required
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976)
CI #19652: Pull request #12000 synchronize by gcp
February 21, 2025 21:51 Action required gcp:cpy_cuda_quants
February 21, 2025 21:51 Action required
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976)
Server #11055: Pull request #12000 synchronize by gcp
February 21, 2025 21:51 Action required gcp:cpy_cuda_quants
February 21, 2025 21:51 Action required
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976)
Pull Request Labeler #8394: Pull request #12000 synchronize by gcp
February 21, 2025 21:51 2m 3s
February 21, 2025 21:51 2m 3s
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976)
CI #19651: Pull request #12000 synchronize by gcp
February 21, 2025 21:51 Action required gcp:cpy_cuda_quants
February 21, 2025 21:51 Action required
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976)
Server #11054: Pull request #12000 synchronize by gcp
February 21, 2025 21:51 Action required gcp:cpy_cuda_quants
February 21, 2025 21:51 Action required
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976)
EditorConfig Checker #22053: Pull request #12000 synchronize by gcp
February 21, 2025 21:51 Action required gcp:cpy_cuda_quants
February 21, 2025 21:51 Action required
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976)
Pull Request Labeler #8393: Pull request #12000 synchronize by gcp
February 21, 2025 21:51 1m 56s
February 21, 2025 21:51 1m 56s
Add Granite Vision Support
Server #11053: Pull request #11794 synchronize by alex-jw-brooks
February 21, 2025 21:48 9m 20s alex-jw-brooks:granite_vision
February 21, 2025 21:48 9m 20s
Add Granite Vision Support
flake8 Lint #17498: Pull request #11794 synchronize by alex-jw-brooks
February 21, 2025 21:48 18s alex-jw-brooks:granite_vision
February 21, 2025 21:48 18s
Add Granite Vision Support
CI #19650: Pull request #11794 synchronize by alex-jw-brooks
February 21, 2025 21:48 45m 25s alex-jw-brooks:granite_vision
February 21, 2025 21:48 45m 25s