-
Notifications
You must be signed in to change notification settings - Fork 432
Pull requests: vllm-project/llm-compressor
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[dev] split fused moe experts to ensure quantization
documentation
Improvements or additions to documentation
quality-failed
#2464
opened Mar 11, 2026 by
liwei109
Loading…
[Warnings] Replace deprecated is_fx_tracing with is_fx_tracing_symbolic_tracing
quality-failed
#2463
opened Mar 10, 2026 by
kylesayrs
Loading…
[examples][awq] Update AWQ examples to stacked recipe pattern
documentation
Improvements or additions to documentation
#2461
opened Mar 10, 2026 by
dzhengAP
Loading…
feat: defer activation qparam calculation to sequential epoch end
documentation
Improvements or additions to documentation
#2455
opened Mar 9, 2026 by
dzhengAP
Loading…
[AWQ] Update MoE mappings to include router in balance layers
ready
When a PR is ready for review
#2451
opened Mar 6, 2026 by
brian-dellabetta
Loading…
2 of 3 tasks
AWQ smooth layer quantization (v2) [not for land]
documentation
Improvements or additions to documentation
quality-failed
refactor(awq): restructure AWQModifier to be similar to SmoothQuantCl…
documentation
Improvements or additions to documentation
ready
When a PR is ready for review
#2402
opened Feb 24, 2026 by
vishnuprasanth-j
Loading…
[DDP][GPTQ] Fixes and Testing
documentation
Improvements or additions to documentation
needs-rebase
quality-failed
Feature/calibrate weights dfs fused modules
needs-rebase
#2394
opened Feb 23, 2026 by
GOavi101
Loading…
Feature/intermediates cache prefetch
ready
When a PR is ready for review
#2392
opened Feb 22, 2026 by
GOavi101
Loading…
[Distributed] Extend QuantizationModifier to support distributed activation calibration
documentation
Improvements or additions to documentation
#2391
opened Feb 22, 2026 by
Etelis
Loading…
3 tasks done
perf: make MSE observer compatible with torch.compile
documentation
Improvements or additions to documentation
ready
When a PR is ready for review
#2384
opened Feb 18, 2026 by
Bias92
Loading…
feat: add Qwen3.5 MoE calibration module
documentation
Improvements or additions to documentation
nvfp4
For any PR / issue related to NVFP4 support
quality-failed
qwen
For any PR / issue related to Qwen support
ready
When a PR is ready for review
#2383
opened Feb 18, 2026 by
Sehyo
Loading…
Add model_free_ptq example for glm 4.6 block fp8
documentation
Improvements or additions to documentation
#2343
opened Feb 10, 2026 by
mgoin
Loading…
[MoE] MiniMax-M2/M2.1 calibration follow-up
documentation
Improvements or additions to documentation
ready
When a PR is ready for review
#2335
opened Feb 6, 2026 by
LudovicoYIN
Loading…
Add GSM8K evaluation script and AWQ+FP8 results
documentation
Improvements or additions to documentation
ready
When a PR is ready for review
#2330
opened Feb 4, 2026 by
rtj1
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.