You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Update llama.cpp to b908baf1825b1a89afef87b09e22c32af2ca6548 (#888)
* Update llama.cpp submodule to b908baf1825b1a89afef87b09e22c32af2ca6548
Updates patches and integration code for new llama.cpp version:
- Regenerated all patches for updated upstream code
- Added common_ngram-mod.cpp.patch (adds #include <algorithm>)
- Added vendor_cpp-httplib_httplib.cpp.patch (XNU futex workaround moved from .h)
- Added common/license.cpp stub for LICENSES symbol
- Removed obsolete vendor_minja_minja.hpp.patch (jinja now built-in)
- Removed obsolete vendor_cpp-httplib_httplib.h.patch (code moved to .cpp)
- Updated chatbot.h/cpp for common_chat_syntax -> common_chat_parser_params rename
- Removed minja test from tests/BUILD.mk
* Updated license.cpp with the one generated by cmake in upstream llama.cpp
* Updated info about license.cpp in patches' README
* Remove minja from tests
* Updated refs to minja in docs
Copy file name to clipboardExpand all lines: llama.cpp.patches/README.md
+5-3Lines changed: 5 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -11,7 +11,9 @@ llama.cpp.patches/
11
11
├── renames.sh # Script for file renames/moves (if any)
12
12
├── llamafile-files/ # Additional files to copy into llama.cpp
13
13
│ ├── BUILD.mk # Makefile for building llama.cpp with cosmocc
14
-
│ └── README.llamafile # License and modification notes
14
+
│ ├── README.llamafile # License and modification notes
15
+
│ └── common/
16
+
│ └── license.cpp # Llama.cpp's license file (cmake creates this at build time)
15
17
└── patches/ # Patch files for upstream sources
16
18
```
17
19
@@ -40,6 +42,7 @@ These patches address compatibility issues when building with Cosmopolitan libc
40
42
|`common_arg.cpp.patch`| Adds `COSMOCC` platform detection for `PATH_MAX` (includes `linux/limits.h`) |
41
43
|`common_common.cpp.patch`| Adds platform-aware cache directory detection for Cosmopolitan (checks `LOCALAPPDATA`, `XDG_CACHE_HOME`, falls back to `~/.cache/`) |
42
44
|`common_download.cpp.patch`| Adds `COSMOCC` platform detection for `PATH_MAX`|
45
+
|`common_ngram-mod.cpp.patch`| Adds missing `#include <algorithm>` for `std::fill`|
43
46
44
47
### Threading and Signal Handling
45
48
@@ -49,7 +52,7 @@ Cosmopolitan libc has specific behaviors with condition variables and signals th
49
52
|-------|-------------|
50
53
|`common_log.cpp.patch`| Blocks `SIGINT`/`SIGTERM` on logger thread to prevent `EINTR` exceptions; uses `wait_for()` instead of `wait()` to work around XNU futex timeout bug (~72 minute expiry) |
51
54
|`tools_server_server-queue.cpp.patch`| Same threading fixes for server queue: signal masking and `wait_for()` timeouts |
52
-
|`vendor_cpp-httplib_httplib.h.patch`| Fixes httplib thread pool with `wait_for()` instead of `wait()` for XNU futex compatibility |
55
+
|`vendor_cpp-httplib_httplib.cpp.patch`| Fixes httplib thread pool with `wait_for()` instead of `wait()` for XNU futex compatibility |
53
56
54
57
### Cross-Module Memory Management
55
58
@@ -93,7 +96,6 @@ These patches integrate llamafile's file handling APIs for loading models from b
93
96
| Patch | Description |
94
97
|-------|-------------|
95
98
|`vendor_miniaudio_miniaudio.h.patch`| Removes `__COSMOPOLITAN__` from Windows platform detection (Cosmopolitan handles this at runtime) |
96
-
|`vendor_minja_minja.hpp.patch`| Replaces regex-based Jinja comment parsing with manual parsing to prevent stack overflow on large templates |
0 commit comments