Skip to content

Pull requests: vllm-project/llm-compressor

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add DeepSeekV3 support for spinquant
#2465 opened Mar 11, 2026 by carrot-o0o Loading…
[dev] split fused moe experts to ensure quantization documentation Improvements or additions to documentation quality-failed
#2464 opened Mar 11, 2026 by liwei109 Loading…
[Bugfix] QAC with basic pipeline quality-failed
#2462 opened Mar 10, 2026 by kylesayrs Loading…
[examples][awq] Update AWQ examples to stacked recipe pattern documentation Improvements or additions to documentation
#2461 opened Mar 10, 2026 by dzhengAP Loading…
testing ddp + awq documentation Improvements or additions to documentation
#2457 opened Mar 10, 2026 by HDCharles Draft
feat: defer activation qparam calculation to sequential epoch end documentation Improvements or additions to documentation
#2455 opened Mar 9, 2026 by dzhengAP Loading…
[AWQ] Update MoE mappings to include router in balance layers ready When a PR is ready for review
#2451 opened Mar 6, 2026 by brian-dellabetta Loading…
2 of 3 tasks
[Docs] Update author from NeuralMagic to vLLM
#2444 opened Mar 4, 2026 by kylesayrs Loading…
AWQ smooth layer quantization (v2) [not for land] documentation Improvements or additions to documentation quality-failed
#2431 opened Mar 3, 2026 by HDCharles Draft
refactor(awq): restructure AWQModifier to be similar to SmoothQuantCl… documentation Improvements or additions to documentation ready When a PR is ready for review
#2402 opened Feb 24, 2026 by vishnuprasanth-j Loading…
[DDP][GPTQ] Fixes and Testing documentation Improvements or additions to documentation needs-rebase quality-failed
#2400 opened Feb 24, 2026 by HDCharles Draft
Feature/intermediates cache prefetch ready When a PR is ready for review
#2392 opened Feb 22, 2026 by GOavi101 Loading…
[Distributed] Extend QuantizationModifier to support distributed activation calibration documentation Improvements or additions to documentation
#2391 opened Feb 22, 2026 by Etelis Loading…
3 tasks done
perf: make MSE observer compatible with torch.compile documentation Improvements or additions to documentation ready When a PR is ready for review
#2384 opened Feb 18, 2026 by Bias92 Loading…
feat: add Qwen3.5 MoE calibration module documentation Improvements or additions to documentation nvfp4 For any PR / issue related to NVFP4 support quality-failed qwen For any PR / issue related to Qwen support ready When a PR is ready for review
#2383 opened Feb 18, 2026 by Sehyo Loading…
Add model_free_ptq example for glm 4.6 block fp8 documentation Improvements or additions to documentation
#2343 opened Feb 10, 2026 by mgoin Loading…
[MoE] MiniMax-M2/M2.1 calibration follow-up documentation Improvements or additions to documentation ready When a PR is ready for review
#2335 opened Feb 6, 2026 by LudovicoYIN Loading…
Add GSM8K evaluation script and AWQ+FP8 results documentation Improvements or additions to documentation ready When a PR is ready for review
#2330 opened Feb 4, 2026 by rtj1 Loading…
ProTip! Filter pull requests by the default branch with base:main.