-
Notifications
You must be signed in to change notification settings - Fork 460
Pull requests: vllm-project/llm-compressor
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add MoE calibration wrapper for GLM-4.7-Flash (Glm4MoeLiteMoE)
#2547
opened Mar 30, 2026 by
Nottlespike
•
Draft
Remove reindexing step from Mistral Large 3 FP8 example
documentation
Improvements or additions to documentation
#2530
opened Mar 27, 2026 by
omkar-334
Loading…
[AWQ] AWQ as transform
quality-failed
#2526
opened Mar 26, 2026 by
brian-dellabetta
•
Draft
1 of 4 tasks
feat: add ActivationOrdering support for per-channel GPTQ quantization
needs-rebase
ready
When a PR is ready for review
#2525
opened Mar 26, 2026 by
matdou
Loading…
[Docs] Add Developer Guides section
documentation
Improvements or additions to documentation
ready
When a PR is ready for review
#2517
opened Mar 25, 2026 by
dsikka
Loading…
[AWQ] Restructure AWQModifier as smoothing-only, decouple from Quanti…
documentation
Improvements or additions to documentation
needs-rebase
#2511
opened Mar 24, 2026 by
colldata79
Loading…
[Examples] Reorganize examples by model/scheme/algo hierarchy
documentation
Improvements or additions to documentation
needs-rebase
[Refactor] Rename offload_model and dispatch_for_sequential to set_onload_device
documentation
Improvements or additions to documentation
ready
When a PR is ready for review
#2501
opened Mar 23, 2026 by
nrmlthms
Loading…
[Feature] Allow targeting multiples of sequential targets
quality-failed
ready
When a PR is ready for review
#2493
opened Mar 20, 2026 by
aayush7511
Loading…
[AWQ] Add joint scale+shrinkage optimization to grid search
#2492
opened Mar 20, 2026 by
dzhengAP
Loading…
DeepSeek V3.2 support
documentation
Improvements or additions to documentation
needs-rebase
#2491
opened Mar 19, 2026 by
brian-dellabetta
•
Draft
5 of 6 tasks
Add Attention Quantization Examples
documentation
Improvements or additions to documentation
ready
When a PR is ready for review
#2484
opened Mar 18, 2026 by
kylesayrs
Loading…
[model_free_ptq] split fused moe experts to ensure quantization
documentation
Improvements or additions to documentation
model_free_ptq
For any PR/issue related to the `model_free_ptq` pathway
#2464
opened Mar 11, 2026 by
liwei109
Loading…
[examples][awq] Update AWQ examples to stacked recipe pattern
documentation
Improvements or additions to documentation
#2461
opened Mar 10, 2026 by
dzhengAP
Loading…
feat: defer activation qparam calculation to sequential epoch end
ready
When a PR is ready for review
#2455
opened Mar 9, 2026 by
dzhengAP
Loading…
[AWQ] Update MoE mappings to include router in balance layers
ready
When a PR is ready for review
#2451
opened Mar 6, 2026 by
brian-dellabetta
Loading…
2 of 3 tasks
[Docs] Update author from NeuralMagic to vLLM
dequeued
ready
When a PR is ready for review
#2444
opened Mar 4, 2026 by
kylesayrs
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.