Feature Request: vlut.cpp

### Prerequisites

- [x] I am running the latest code. Mention the version if possible as well.
- [x] I carefully followed the [README.md](https://github.com/ggerganov/llama.cpp/blob/master/README.md).
- [x] I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed).
- [x] I reviewed the [Discussions](https://github.com/ggerganov/llama.cpp/discussions), and have a new and useful enhancement to share.

### Feature Description

This paper claims that it can achieve a 3 times speedup compared with TQ2_0 llama.cpp on CPU for bitnet using a LUT-based inference engine. Can you have a look and see if it can be integreted into the ik_llama,cpp engine.

Paper: https://arxiv.org/pdf/2512.06443
Code: https://github.com/Cipherxzc/vlut.cpp

### Motivation

This paper claims that it can achieve a 3 times speedup compared with TQ2_0 llama.cpp on CPU for bitnet using a LUT-based inference engine.

### Possible Implementation

Code: https://github.com/Cipherxzc/vlut.cpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature Request: vlut.cpp #1095

Prerequisites

Feature Description

Motivation

Possible Implementation

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Feature Request: vlut.cpp #1095

Description

Prerequisites

Feature Description

Motivation

Possible Implementation

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions