Skip to content

Conversation

@gkisalapl
Copy link
Contributor

No description provided.

mnalewaj and others added 2 commits October 24, 2025 16:02
- Quantization with unaligned data. Tests for debug int4 kernels GEMM and GEMV
- Added temporary CPU implementation for unaligned input data in INT4 GEMM
- Fix for quantization and dequantization of INT4 data format
- Add better support for unaligned N size - good results for N divisible by 8

**Self evaluation:**
1. Build test:     [X]Passed [ ]Failed [ ]Skipped
2. Run test:     [X]Passed [ ]Failed [ ]Skipped

Signed-off-by: Maciej Nalewaj <[email protected]>
Signed-off-by: Grzegorz Kisala <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants