Feat (vLLM): initial export support #1444

Giuseppe5 · 2026-01-24T00:54:12Z

Reason for this PR

Initial support for vLLM export.

To do:

Check that input/output quantization work as intended
Test multiple quantizers
Improve quantizers interface
Support for rotation (and smoothquant?)

Changes Made in this PR

We are re-using the inference quantizers also for vLLM.
This is still fake-quantization style, but should be faster than plain torch execution, even in eager mode.

The same template could be easily extended to support real quantization, torch.compile, etc. etc.

Testing Summary

TBD

pablomlago · 2026-01-29T10:52:36Z

requirements/requirements-llm.txt

 torch>=2.4
 tqdm
 transformers[sentencepiece]<5.0
+vllm


I feel like vLLM should be an optional dependency.

Maybe we can do it in a similar way to what we did for lighteval/lm_eval

I'm leaving it for now so that test run and I can see what other things I'm breaking in the process, but I'll remove before this PR is merged

Giuseppe5 force-pushed the vllm_export branch 2 times, most recently from 4e3e36a to 7d4a78c Compare January 27, 2026 15:04

Giuseppe5 added 16 commits January 28, 2026 15:55

Fix

fecfcb6

Feat (vLLM): initial export support

195443c

Cleanup

df68ed8

More cleanup

19aa9c9

More bugfix, cleanup

aac450d

More cleanup and fixes

fb46fe6

Removed too much stuff

1244425

temp

69b1d49

Temp 2

6f544c6

cleanup

ed6b8f1

requirements

7225614

import

2e94286

import 2

0a0c062

Fix init

fd5edcc

fix init 2

67be3f8

Fix proxies

b9ae23a

Giuseppe5 force-pushed the vllm_export branch from 3259272 to b9ae23a Compare January 28, 2026 15:55

Giuseppe5 added 3 commits January 28, 2026 16:57

Update quantize.py

399363e

Update main.py

3a7ed83

sync

c8716a7

pablomlago reviewed Jan 29, 2026

View reviewed changes

Fix

79cc073

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat (vLLM): initial export support #1444

Feat (vLLM): initial export support #1444

Uh oh!

Giuseppe5 commented Jan 24, 2026 •

edited

Loading

Uh oh!

pablomlago Jan 29, 2026

Uh oh!

Giuseppe5 Jan 29, 2026

Uh oh!

Giuseppe5 Jan 29, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Feat (vLLM): initial export support #1444

Are you sure you want to change the base?

Feat (vLLM): initial export support #1444

Uh oh!

Conversation

Giuseppe5 commented Jan 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reason for this PR

Changes Made in this PR

Testing Summary

Uh oh!

pablomlago Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

Giuseppe5 Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

Giuseppe5 Jan 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Giuseppe5 commented Jan 24, 2026 •

edited

Loading

Giuseppe5 Jan 29, 2026 •

edited

Loading