Skip to content
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
175 commits
Select commit Hold shift + click to select a range
ac77a9d
Github actions from https://github.com/Thireus/llama.cpp
Thireus Jun 2, 2025
3c7947b
Fix FATAL: Avoid pinning exact package versions. Use '~=' instead.
Thireus Jun 2, 2025
0e729ae
Fix FATAL: Avoid pinning exact package versions. Use '~=' instead.
Thireus Jun 2, 2025
709e60b
Fix FATAL: Avoid pinning exact package versions. Use '~=' instead.
Thireus Jun 2, 2025
5073c69
Remove incompatible CMAKE options
Thireus Jun 2, 2025
aa93732
Remove incompatible CMAKE options
Thireus Jun 2, 2025
d92d40e
Remove incompatible CMAKE options
Thireus Jun 2, 2025
209e106
Remove incompatible CMAKE options
Thireus Jun 2, 2025
d616bb0
Add missing CMakeLists
Thireus Jun 2, 2025
dca481e
Debug target list
Thireus Jun 2, 2025
ea9d13e
Remove target
Thireus Jun 2, 2025
4acf4cd
No comment
Thireus Jun 2, 2025
35601f8
Delete ggml/src/ggml-cuda/CMakeLists.txt
Thireus Jun 3, 2025
a77e304
Build Windows Cuda 12.8 only
Thireus Jun 3, 2025
96763ca
Fix Windows sysl
Thireus Jun 3, 2025
9d14372
Fix Windows hip
Thireus Jun 3, 2025
ea57437
Debug Ubuntu build - Check workspace contents
Thireus Jun 3, 2025
fa021b6
Fixing Ambiguous Overload C2668
Thireus Jun 3, 2025
6135832
Update iqk_quantize.cpp
Thireus Jun 3, 2025
37d796f
Update iqk_quantize.cpp
Thireus Jun 3, 2025
27fc1d2
Update iqk_quantize.cpp
Thireus Jun 3, 2025
ff7c80f
Update iqk_quantize.cpp
Thireus Jun 3, 2025
d498a98
Only Windows Cuda + -DGGML_SCHED_MAX_COPIES=1 -DGGML_CUDA_IQK_FORCE_…
Thireus Jun 3, 2025
f26fe36
Fix release and add -DGGML_AVX512=ON
Thireus Jun 3, 2025
6e44051
Merge pull request #1 from ikawrakow/main
Thireus Jun 3, 2025
4e1d64d
Merge pull request #2 from ikawrakow/main
Thireus Jun 5, 2025
7a123b8
Merge branch 'ikawrakow:main' into main
Thireus Jun 5, 2025
fa54b58
Check if ffn_up and ffn_gate are of the same type before using fmoe
Jun 6, 2025
23c3e73
Merge pull request #3 from ikawrakow/ik/check_up_gate_fmoe
Thireus Jun 6, 2025
b26d935
Revert "Check if ffn_up and ffn_gate are of the same type before usin…
Thireus Jun 6, 2025
e5fec55
Merge pull request #4 from Thireus/revert-3-ik/check_up_gate_fmoe
Thireus Jun 6, 2025
929c412
Merge pull request #5 from ikawrakow/main
Thireus Jun 6, 2025
32baab4
Revert "Revert "Check if ffn_up and ffn_gate are of the same type bef…
Thireus Jun 6, 2025
916c9c3
Merge pull request #6 from Thireus/revert-4-revert-3-ik/check_up_gate…
Thireus Jun 6, 2025
fdfad30
Just leave the check.
Jun 6, 2025
2a35bed
Revert "Revert "Revert "Check if ffn_up and ffn_gate are of the same …
Thireus Jun 6, 2025
611e41a
Merge pull request #7 from Thireus/revert-6-revert-4-revert-3-ik/chec…
Thireus Jun 6, 2025
c2acf90
Update llama.cpp - https://github.com/ikawrakow/ik_llama.cpp/pull/495
Thireus Jun 6, 2025
9451d1f
Update llama.cpp - https://github.com/ikawrakow/ik_llama.cpp/pull/495
Thireus Jun 6, 2025
5a8bb97
Merge pull request #8 from ikawrakow/ik/check_up_gate_fmoe
Thireus Jun 6, 2025
7139beb
Fix #499
Jun 6, 2025
3c8d9aa
Merge pull request #9 from ikawrakow/ik/fix_499
Thireus Jun 6, 2025
4cb6bab
Merge branch 'ikawrakow:main' into main
Thireus Jun 7, 2025
02272cd
Merge branch 'ikawrakow:main' into main
Thireus Jun 8, 2025
9578d32
Merge branch 'ikawrakow:main' into main
Thireus Jun 9, 2025
2075882
Revert Windows fixes for iqk_quantize.cpp
Thireus Jun 11, 2025
6beef4a
Merge branch 'ikawrakow:main' into main
Thireus Jun 11, 2025
44b2d01
Merge branch 'ikawrakow:main' into main
Thireus Jun 12, 2025
363cef3
Merge branch 'ikawrakow:main' into main
Thireus Jun 12, 2025
7869a94
Merge branch 'ikawrakow:main' into main
Thireus Jun 13, 2025
79fc7dd
Merge branch 'ikawrakow:main' into main
Thireus Jun 14, 2025
a042910
Merge branch 'ikawrakow:main' into main
Thireus Jun 17, 2025
805c936
Merge branch 'ikawrakow:main' into main
Thireus Jun 17, 2025
d116e0a
Merge branch 'ikawrakow:main' into main
Thireus Jun 18, 2025
832b72a
Merge branch 'ikawrakow:main' into main
Thireus Jun 18, 2025
e3bd1fd
Merge branch 'ikawrakow:main' into main
Thireus Jun 18, 2025
9c0499f
Merge branch 'ikawrakow:main' into main
Thireus Jun 19, 2025
00c907d
Merge branch 'ikawrakow:main' into main
Thireus Jun 19, 2025
999e349
Merge branch 'ikawrakow:main' into main
Thireus Jun 19, 2025
5cfbadf
Merge branch 'ikawrakow:main' into main
Thireus Jun 20, 2025
2b3efc8
Merge branch 'ikawrakow:main' into main
Thireus Jun 21, 2025
2d18dde
Merge branch 'ikawrakow:main' into main
Thireus Jun 22, 2025
661026c
Merge branch 'ikawrakow:main' into main
Thireus Jun 23, 2025
a664904
Update ggml.h
Thireus Jun 24, 2025
65dd65c
Update llama.cpp
Thireus Jun 24, 2025
d707367
Merge branch 'ikawrakow:main' into main
Thireus Jun 24, 2025
be6cd6b
Merge branch 'ikawrakow:main' into main
Thireus Jun 27, 2025
6399cb2
Merge branch 'ikawrakow:main' into main
Thireus Jun 27, 2025
39885f5
Merge branch 'ikawrakow:main' into main
Thireus Jul 2, 2025
98bc2bb
Update build.yml
Thireus Jul 2, 2025
930aacc
Update release.yml
Thireus Jul 2, 2025
b9891cb
Update build.yml
Thireus Jul 2, 2025
8f78e28
Update build.yml
Thireus Jul 2, 2025
a495a9d
Update release.yml
Thireus Jul 2, 2025
64373f8
Update build.yml
Thireus Jul 2, 2025
9181248
Update release.yml
Thireus Jul 2, 2025
d185f1f
Update release.yml
Thireus Jul 2, 2025
d12f9e2
Merge branch 'ikawrakow:main' into main
Thireus Jul 3, 2025
37dc9f8
Merge branch 'ikawrakow:main' into main
Thireus Jul 4, 2025
095ec69
Merge branch 'ikawrakow:main' into main
Thireus Jul 6, 2025
a66e943
Update release.yml
Thireus Jul 6, 2025
6493d72
Update arm64-windows-llvm.cmake
Thireus Jul 6, 2025
e5120dc
Add files via upload
Thireus Jul 6, 2025
66cc3da
Update arm64-windows-llvm.cmake
Thireus Jul 6, 2025
9b8ec51
Update release.yml
Thireus Jul 6, 2025
68b5494
Update x64-windows-llvm.cmake
Thireus Jul 6, 2025
88cdc86
Update x64-windows-llvm.cmake
Thireus Jul 6, 2025
8d43012
Update release.yml
Thireus Jul 6, 2025
0f33f92
Update x64-windows-llvm.cmake
Thireus Jul 6, 2025
014ceb6
Update release.yml
Thireus Jul 6, 2025
fc1b8db
Update release.yml
Thireus Jul 6, 2025
fd0fbcd
Merge branch 'ikawrakow:main' into main
Thireus Jul 6, 2025
def51f4
Update release.yml
Thireus Jul 6, 2025
30aea8b
Update release.yml
Thireus Jul 6, 2025
12e3111
Update release.yml
Thireus Jul 6, 2025
64b0e63
Update x64-windows-llvm.cmake
Thireus Jul 6, 2025
c46f8c1
Update release.yml
Thireus Jul 6, 2025
3858ede
Update iqk_quantize.cpp
Thireus Jul 6, 2025
0559159
Update release.yml
Thireus Jul 6, 2025
68d79df
Update iqk_quantize.cpp
Thireus Jul 6, 2025
dc5a9aa
Update iqk_quantize.cpp
Thireus Jul 6, 2025
d79498e
Update iqk_quantize.cpp
Thireus Jul 6, 2025
a274521
Update iqk_quantize.cpp
Thireus Jul 6, 2025
24388be
Update iqk_quantize.cpp
Thireus Jul 6, 2025
d404814
Update iqk_quantize.cpp
Thireus Jul 6, 2025
ad25cc0
Update release.yml
Thireus Jul 6, 2025
5e639fb
Update release.yml
Thireus Jul 6, 2025
8d281b7
Update release.yml
Thireus Jul 6, 2025
f10dfc1
Update release.yml
Thireus Jul 6, 2025
549e08a
Update release.yml
Thireus Jul 6, 2025
611fc93
Update x64-windows-llvm.cmake
Thireus Jul 6, 2025
590959f
Update x64-windows-llvm.cmake
Thireus Jul 6, 2025
c2e3ab7
Update release.yml
Thireus Jul 6, 2025
41a9c8a
Update release.yml
Thireus Jul 6, 2025
c269f41
Merge branch 'ikawrakow:main' into main
Thireus Jul 7, 2025
fa621a8
Merge branch 'ikawrakow:main' into main
Thireus Jul 10, 2025
18b0375
Merge branch 'ikawrakow:main' into main
Thireus Jul 11, 2025
a6db9c4
https://github.com/ikawrakow/ik_llama.cpp/pull/603
Thireus Jul 12, 2025
4a76a2c
Merge branch 'ikawrakow:main' into main
Thireus Jul 13, 2025
96ff200
Merge branch 'ikawrakow:main' into main
Thireus Jul 13, 2025
42bab92
Merge branch 'ikawrakow:main' into main
Thireus Jul 13, 2025
d9a21ee
Merge branch 'ikawrakow:main' into main
Thireus Jul 14, 2025
9128fc4
Merge branch 'ikawrakow:main' into main
Thireus Jul 15, 2025
8c2a6ee
Fix C2131: expression did not evaluate
Thireus Jul 15, 2025
1b097c7
Merge branch 'ikawrakow:main' into main
Thireus Jul 15, 2025
aa0aa02
Update build.yml
Thireus Jul 16, 2025
a60805a
Update ggml.h
Thireus Jul 16, 2025
edaba8e
Merge branch 'ikawrakow:main' into main
Thireus Jul 16, 2025
09d7fac
Update release.yml
Thireus Jul 17, 2025
a47f31f
Revert changes
Thireus Jul 17, 2025
4fbac8e
Update CMakeLists.txt
Thireus Jul 17, 2025
5b1b204
Update CMakeLists.txt
Thireus Jul 17, 2025
699dc2e
Update ggml.h
Thireus Jul 17, 2025
4c81b88
Update llama.cpp
Thireus Jul 17, 2025
118dd3e
Update CMakeLists.txt
Thireus Jul 17, 2025
5237653
Update CMakeLists.txt
Thireus Jul 17, 2025
2b7eaa0
Update CMakeLists.txt
Thireus Jul 17, 2025
27125b1
Update llama.cpp
Thireus Jul 17, 2025
378986d
Merge branch 'ikawrakow:main' into main
Thireus Jul 17, 2025
b407232
Merge branch 'ikawrakow:main' into main
Thireus Jul 18, 2025
ca20df1
Merge branch 'ikawrakow:main' into main
Thireus Jul 20, 2025
87fd730
Merge branch 'ikawrakow:main' into main
Thireus Jul 20, 2025
c90e8a1
Merge branch 'ikawrakow:main' into main
Thireus Jul 22, 2025
758a987
Merge branch 'ikawrakow:main' into main
Thireus Jul 22, 2025
607e01e
Merge branch 'ikawrakow:main' into main
Thireus Jul 23, 2025
642c2b0
Merge branch 'ikawrakow:main' into main
Thireus Jul 23, 2025
efab478
Merge branch 'ikawrakow:main' into main
Thireus Jul 24, 2025
bb4c917
Merge branch 'ikawrakow:main' into main
Thireus Jul 27, 2025
2e53b96
Update convert_hf_to_gguf.py
Thireus Jul 29, 2025
89ebd61
Update constants.py
Thireus Jul 29, 2025
f619be7
Update llama.cpp
Thireus Jul 29, 2025
662283f
Update llama.cpp
Thireus Jul 29, 2025
0c41931
Update llama.cpp
Thireus Jul 29, 2025
56f5b23
Update llama.cpp
Thireus Jul 29, 2025
8dc1c03
Update llama.cpp
Thireus Jul 29, 2025
92b3a36
Update llama.cpp
Thireus Jul 29, 2025
3012914
Update llama.cpp
Thireus Jul 29, 2025
3e252ff
Update llama.cpp
Thireus Jul 29, 2025
7a53691
Update llama.cpp
Thireus Jul 29, 2025
a855c75
Update llama.cpp
Thireus Jul 29, 2025
6882e83
Update llama.cpp
Thireus Jul 29, 2025
32026e8
Update llama.cpp
Thireus Jul 29, 2025
57d272d
Update llama.cpp
Thireus Jul 29, 2025
aacba11
Update llama.cpp
Thireus Jul 29, 2025
59b8b88
Update llama.cpp
Thireus Jul 29, 2025
299c23a
Update llama.cpp
Thireus Jul 29, 2025
019ff22
Update llama.cpp
Thireus Jul 29, 2025
fba5973
Update llama.cpp
Thireus Jul 29, 2025
d28c8a4
Update llama.cpp
Thireus Jul 29, 2025
ff0c368
Update llama.cpp
Thireus Jul 29, 2025
6258a12
Update llama.cpp
Thireus Jul 29, 2025
9b1fdff
Revert "Update llama.cpp"
Thireus Jul 29, 2025
746b87f
Update llama.cpp
Thireus Jul 29, 2025
9ce87a2
Update llama.cpp
Thireus Jul 29, 2025
92f5b07
Update llama.cpp
Thireus Jul 29, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
22 changes: 22 additions & 0 deletions .github/actions/get-tag-name/action.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,22 @@
name: "Determine tag name"
description: "Determine the tag name to use for a release"
outputs:
name:
description: "The name of the tag"
value: ${{ steps.tag.outputs.name }}

runs:
using: "composite"
steps:
- name: Determine tag name
id: tag
shell: bash
run: |
BUILD_NUMBER="$(git rev-list --count HEAD)"
SHORT_HASH="$(git rev-parse --short=7 HEAD)"
if [[ "${{ env.BRANCH_NAME }}" == "master" ]]; then
echo "name=b${BUILD_NUMBER}" >> $GITHUB_OUTPUT
else
SAFE_NAME=$(echo "${{ env.BRANCH_NAME }}" | tr '/' '-')
echo "name=${SAFE_NAME}-b${BUILD_NUMBER}-${SHORT_HASH}" >> $GITHUB_OUTPUT
fi
67 changes: 67 additions & 0 deletions .github/actions/windows-setup-cuda/action.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,67 @@
name: "Windows - Setup CUDA Toolkit"
description: "Setup CUDA Toolkit for Windows"
inputs:
cuda_version:
description: "CUDA toolkit version"
required: true

runs:
using: "composite"
steps:
- name: Install Cuda Toolkit 11.7
if: ${{ inputs.cuda_version == '11.7' }}
shell: pwsh
run: |
mkdir -p "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.7"
choco install unzip -y
curl -O "https://developer.download.nvidia.com/compute/cuda/redist/cuda_cudart/windows-x86_64/cuda_cudart-windows-x86_64-11.7.99-archive.zip"
curl -O "https://developer.download.nvidia.com/compute/cuda/redist/cuda_nvcc/windows-x86_64/cuda_nvcc-windows-x86_64-11.7.99-archive.zip"
curl -O "https://developer.download.nvidia.com/compute/cuda/redist/cuda_nvrtc/windows-x86_64/cuda_nvrtc-windows-x86_64-11.7.99-archive.zip"
curl -O "https://developer.download.nvidia.com/compute/cuda/redist/libcublas/windows-x86_64/libcublas-windows-x86_64-11.7.4.6-archive.zip"
curl -O "https://developer.download.nvidia.com/compute/cuda/redist/cuda_nvtx/windows-x86_64/cuda_nvtx-windows-x86_64-11.7.91-archive.zip"
curl -O "https://developer.download.nvidia.com/compute/cuda/redist/visual_studio_integration/windows-x86_64/visual_studio_integration-windows-x86_64-11.7.91-archive.zip"
curl -O "https://developer.download.nvidia.com/compute/cuda/redist/cuda_nvprof/windows-x86_64/cuda_nvprof-windows-x86_64-11.7.101-archive.zip"
curl -O "https://developer.download.nvidia.com/compute/cuda/redist/cuda_cccl/windows-x86_64/cuda_cccl-windows-x86_64-11.7.91-archive.zip"
unzip '*.zip' -d "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.7"
xcopy "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.7\cuda_cudart-windows-x86_64-11.7.99-archive\*" "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.7" /E /I /H /Y
xcopy "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.7\cuda_nvcc-windows-x86_64-11.7.99-archive\*" "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.7" /E /I /H /Y
xcopy "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.7\cuda_nvrtc-windows-x86_64-11.7.99-archive\*" "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.7" /E /I /H /Y
xcopy "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.7\libcublas-windows-x86_64-11.7.4.6-archive\*" "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.7" /E /I /H /Y
xcopy "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.7\cuda_nvtx-windows-x86_64-11.7.91-archive\*" "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.7" /E /I /H /Y
xcopy "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.7\visual_studio_integration-windows-x86_64-11.7.91-archive\*" "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.7" /E /I /H /Y
xcopy "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.7\cuda_nvprof-windows-x86_64-11.7.101-archive\*" "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.7" /E /I /H /Y
xcopy "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.7\cuda_cccl-windows-x86_64-11.7.91-archive\*" "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.7" /E /I /H /Y
echo "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.7\bin" | Out-File -FilePath $env:GITHUB_PATH -Encoding utf8 -Append
echo "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.7\libnvvp" | Out-File -FilePath $env:GITHUB_PATH -Encoding utf8 -Append
echo "CUDA_PATH=C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.7" | Out-File -FilePath $env:GITHUB_ENV -Append -Encoding utf8
echo "CUDA_PATH_V11_7=C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.7" | Out-File -FilePath $env:GITHUB_ENV -Append -Encoding utf8

- name: Install Cuda Toolkit 12.8
if: ${{ inputs.cuda_version == '12.8' }}
shell: pwsh
run: |
mkdir -p "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.8"
choco install unzip -y
curl -O "https://developer.download.nvidia.com/compute/cuda/redist/cuda_cudart/windows-x86_64/cuda_cudart-windows-x86_64-12.8.90-archive.zip"
curl -O "https://developer.download.nvidia.com/compute/cuda/redist/cuda_nvcc/windows-x86_64/cuda_nvcc-windows-x86_64-12.8.93-archive.zip"
curl -O "https://developer.download.nvidia.com/compute/cuda/redist/cuda_nvrtc/windows-x86_64/cuda_nvrtc-windows-x86_64-12.8.93-archive.zip"
curl -O "https://developer.download.nvidia.com/compute/cuda/redist/libcublas/windows-x86_64/libcublas-windows-x86_64-12.8.4.1-archive.zip"
curl -O "https://developer.download.nvidia.com/compute/cuda/redist/cuda_nvtx/windows-x86_64/cuda_nvtx-windows-x86_64-12.8.90-archive.zip"
curl -O "https://developer.download.nvidia.com/compute/cuda/redist/cuda_profiler_api/windows-x86_64/cuda_profiler_api-windows-x86_64-12.8.90-archive.zip"
curl -O "https://developer.download.nvidia.com/compute/cuda/redist/visual_studio_integration/windows-x86_64/visual_studio_integration-windows-x86_64-12.8.90-archive.zip"
curl -O "https://developer.download.nvidia.com/compute/cuda/redist/cuda_nvprof/windows-x86_64/cuda_nvprof-windows-x86_64-12.8.90-archive.zip"
curl -O "https://developer.download.nvidia.com/compute/cuda/redist/cuda_cccl/windows-x86_64/cuda_cccl-windows-x86_64-12.8.90-archive.zip"
unzip '*.zip' -d "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.8"
xcopy "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.8\cuda_cudart-windows-x86_64-12.8.90-archive\*" "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.8" /E /I /H /Y
xcopy "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.8\cuda_nvcc-windows-x86_64-12.8.93-archive\*" "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.8" /E /I /H /Y
xcopy "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.8\cuda_nvrtc-windows-x86_64-12.8.93-archive\*" "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.8" /E /I /H /Y
xcopy "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.8\libcublas-windows-x86_64-12.8.4.1-archive\*" "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.8" /E /I /H /Y
xcopy "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.8\cuda_nvtx-windows-x86_64-12.8.90-archive\*" "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.8" /E /I /H /Y
xcopy "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.8\cuda_profiler_api-windows-x86_64-12.8.90-archive\*" "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.8" /E /I /H /Y
xcopy "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.8\visual_studio_integration-windows-x86_64-12.8.90-archive\*" "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.8" /E /I /H /Y
xcopy "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.8\cuda_nvprof-windows-x86_64-12.8.90-archive\*" "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.8" /E /I /H /Y
xcopy "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.8\cuda_cccl-windows-x86_64-12.8.90-archive\*" "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.8" /E /I /H /Y
echo "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.8\bin" | Out-File -FilePath $env:GITHUB_PATH -Encoding utf8 -Append
echo "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.8\libnvvp" | Out-File -FilePath $env:GITHUB_PATH -Encoding utf8 -Append
echo "CUDA_PATH=C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.8" | Out-File -FilePath $env:GITHUB_ENV -Append -Encoding utf8
echo "CUDA_PATH_V12_8=C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.8" | Out-File -FilePath $env:GITHUB_ENV -Append -Encoding utf8
30 changes: 30 additions & 0 deletions .github/actions/windows-setup-curl/action.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,30 @@
name: 'Windows - Setup CURL'
description: 'Composite action, to be reused in other workflow'
inputs:
curl_version:
description: 'CURL version'
required: false
default: '8.6.0_6'
architecture:
description: 'Architecture of the libcurl to download'
required: false
default: 'win64'
outputs:
curl_path:
description: "Path to the downloaded libcurl"
value: ${{ steps.get_libcurl.outputs.curl_path }}

runs:
using: "composite"
steps:
- name: libCURL
id: get_libcurl
shell: powershell
env:
CURL_VERSION: ${{ inputs.curl_version }}
ARCHITECTURE: ${{ inputs.architecture }}
run: |
curl.exe -o $env:RUNNER_TEMP/curl.zip -L "https://curl.se/windows/dl-${env:CURL_VERSION}/curl-${env:CURL_VERSION}-${env:ARCHITECTURE}-mingw.zip"
mkdir $env:RUNNER_TEMP/libcurl
tar.exe -xvf $env:RUNNER_TEMP/curl.zip --strip-components=1 -C $env:RUNNER_TEMP/libcurl
echo "curl_path=$env:RUNNER_TEMP/libcurl" >> $env:GITHUB_OUTPUT
88 changes: 88 additions & 0 deletions .github/labeler.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,88 @@
# https://github.com/actions/labeler
Kompute:
- changed-files:
- any-glob-to-any-file:
- ggml/include/ggml-kompute.h
- ggml/src/ggml-kompute/**
- README-kompute.md
Apple Metal:
- changed-files:
- any-glob-to-any-file:
- ggml/include/ggml-metal.h
- ggml/src/ggml-metal/**
- README-metal.md
SYCL:
- changed-files:
- any-glob-to-any-file:
- ggml/include/ggml-sycl.h
- ggml/src/ggml-sycl/**
- docs/backend/SYCL.md
- examples/sycl/**
Nvidia GPU:
- changed-files:
- any-glob-to-any-file:
- ggml/include/ggml-cuda.h
- ggml/src/ggml-cuda/**
Vulkan:
- changed-files:
- any-glob-to-any-file:
- ggml/include/ggml-vulkan.h
- ggml/src/ggml-vulkan/**
documentation:
- changed-files:
- any-glob-to-any-file:
- docs/**
- media/**
testing:
- changed-files:
- any-glob-to-any-file:
- tests/**
build:
- changed-files:
- any-glob-to-any-file:
- cmake/**
- CMakeLists.txt
- CMakePresets.json
examples:
- changed-files:
- any-glob-to-any-file:
- examples/**
- tools/**
devops:
- changed-files:
- any-glob-to-any-file:
- .devops/**
- .github/**
- ci/**
python:
- changed-files:
- any-glob-to-any-file:
- "**/*.py"
- requirements/**
- gguf-py/**
- .flake8
script:
- changed-files:
- any-glob-to-any-file:
- scripts/**
android:
- changed-files:
- any-glob-to-any-file:
- examples/llama.android/**
server:
- changed-files:
- any-glob-to-any-file:
- tools/server/**
ggml:
- changed-files:
- any-glob-to-any-file:
- ggml/**
nix:
- changed-files:
- any-glob-to-any-file:
- "**/*.nix"
- .github/workflows/nix-*.yml
- .devops/nix/nixpkgs-instances.nix
embedding:
- changed-files:
- any-glob-to-any-file: examples/embedding/
Loading
Loading