llamafile update 20260202 #871
aittalam
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello everyone, here’s our latest update for llamafile!
Since our Christmas update that brought in the TUI and metal support in our WIP branch, we updated llama.cpp to f47edb8 (5 January 2026) and implemented several key improvements:
Overall this has been a great learning experience. We promised we would have shared our learnings too, so here’s more information for you. First of all, our process:
I personally also collected a lot of notes about llamafile: some of them started from an analysis of the old code and of the diffs wrt the old llama.cpp, others from bugs investigations, most have been at least partially generated by Claude (more info about this below). Here are a few I think could be interesting, let us know if you find them useful:
Last but not least, as I talked about using AI, I would like this to be as open as the code itself. I have found Claude useful not just for code analysis/writing or wrapping up my notes and code udpates into documents, but also in automating a set of tasks that I would otherwise repeat manually (e.g. “test this bugfix by rebuilding the code, running llamafile with these parameters and checking if and where it segfaults” 😅). In the spirit of transparency, we are sharing:
All the updates are in our WIP branch, and more recent alpha binaries are available here.
Beta Was this translation helpful? Give feedback.
All reactions