Commit 82f0b97
Store raw hf_quant_config and convert at config.json write time
Restore `self._hf_quant_config` to hold the raw modelopt-native schema
(matching upstream main) and call `convert_hf_quant_config_format` inline
when writing `config.json["quantization_config"]` for newer vLLM. Drops
the temporary `raw_hf_quant_config` variable and trims the post-write
region to match main more closely.
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>1 parent 4754fb0 commit 82f0b97
1 file changed
Lines changed: 5 additions & 7 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
346 | 346 | | |
347 | 347 | | |
348 | 348 | | |
349 | | - | |
| 349 | + | |
350 | 350 | | |
351 | 351 | | |
352 | 352 | | |
353 | 353 | | |
354 | 354 | | |
355 | 355 | | |
356 | | - | |
357 | | - | |
358 | | - | |
359 | | - | |
360 | 356 | | |
361 | | - | |
| 357 | + | |
362 | 358 | | |
363 | 359 | | |
364 | 360 | | |
| |||
378 | 374 | | |
379 | 375 | | |
380 | 376 | | |
381 | | - | |
| 377 | + | |
| 378 | + | |
| 379 | + | |
382 | 380 | | |
383 | 381 | | |
384 | 382 | | |
| |||
0 commit comments