Fix QuantState and dict conversions by cyyever · Pull Request #1729 · bitsandbytes-foundation/bitsandbytes

cyyever · 2025-08-18T06:48:53Z

Fix the failure of QuantState.as_dict then followed by QuantState.from_dict.
A reproducing example is

import torch

from bitsandbytes.functional import (
    QuantState,
    dequantize_4bit,
    quantize_4bit,
)


a = torch.rand(2, 3)
quantized, quant_state = quantize_4bit(A=a, quant_type="nf4")
quant_dict = quant_state.as_dict()
quant_state = QuantState.from_dict(quant_dict, device=torch.device("cuda"))
quantized = dequantize_4bit(A=quantized, quant_state=quant_state)

Before this PR, it failed with (on bitsandbytes 0.47.0):

  File "/home/cyy/a.py", line 13, in <module>
    quant_state = QuantState.from_dict(quant_dict, device=torch.device("cuda"))
  File "/home/cyy/.local/lib/python3.13/site-packages/bitsandbytes/functional.py", line 469, in from_dict
    raise ValueError(
        f"There should be exactly one `quant_state` item with ending from {cls.valid_qs_type_keys}.\nDetected {qs_key}.",
    )
ValueError: There should be exactly one `quant_state` item with ending from ['bitsandbytes__fp4', 'bitsandbytes__nf4'].
Detected [].

matthewdouglas · 2025-09-03T17:36:18Z

Hi,
Can you provide more detail on the issue that this fixes? A simple reproducer with example output would be ideal. Thanks!

cyyever · 2025-09-04T00:45:17Z

@matthewdouglas Added

Signed-off-by: cyy <cyyever@outlook.com>

Signed-off-by: Yuanyuan Chen <cyyever@outlook.com>

github-actions · 2025-09-18T23:25:03Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

TimDettmers

PR Review: #1729 — Fix QuantState and dict conversions

Bug fix: corrects two bugs in QuantState.as_dict() and one bug in QuantState.from_dict() that prevent the as_dict/from_dict round-trip from working when shape or quant_type are None (as produced by quantize_blockwise).

No blocking issues. The fix is correct, well-scoped, and includes regression tests.

Analysis

There are three distinct bugs being fixed:

as_dict() crashes when self.shape is None — quantize_blockwise() creates QuantState without setting shape (defaults to None). The old code unconditionally calls tuple(self.shape) on line 572, which raises TypeError. The fix adds a None guard: tuple(self.shape) if self.shape is not None else None.
as_dict(packed=True) crashes when self.quant_type is None — The old code unconditionally concatenates "bitsandbytes__" + self.quant_type on line 590, which raises TypeError when quant_type is None (again from quantize_blockwise). The fix conditionally appends the quant_type suffix only when it is not None.
from_dict() rejects valid unpacked dicts — The old validation logic at lines 523–528 used if/elif with not len(qs_key) and len(qs_key) != 1 checks. When given an unpacked dict (from as_dict(packed=False)) that has quant_type as a plain key but no quant_state.* packed key, the elif branch fires because len(qs_key) == 0 != 1, raising a misleading ValueError. The fix restructures the conditionals so that packed-format validation only runs when quant_type is absent from the dict, correctly recognizing unpacked dicts.

All three fixes are narrowly scoped and correct.

Serialization Compatibility

This is the critical concern for QuantState changes. After careful analysis:

4-bit checkpoints (the common case): quantize_4bit() always sets both shape and quant_type, so the as_dict(packed=True) path used by Linear4bit._save_to_state_dict() is unaffected. The packed key format quant_state.bitsandbytes__nf4 / quant_state.bitsandbytes__fp4 is unchanged.
Old checkpoints loading with new code: Old checkpoints use the packed format with quant_type always set. from_dict() still handles these correctly — the new validation logic is equivalent to the old logic when a packed quant_state.* key is present.
New checkpoints loading with old code: Not a concern for the packed format since the output is identical for the 4-bit case. For blockwise QuantState (which is never serialized to disk via _save_to_state_dict), the unpacked format now includes None values for shape and quant_type, but this dict is only used in-memory.
Latent note: The as_dict(packed=True) path with quant_type=None produces a key "quant_state.bitsandbytes__" which from_dict would still reject (since "bitsandbytes__" is not in valid_qs_type_keys). This is not a practical issue because blockwise QuantState is never serialized via the packed path, but it is worth noting as a theoretical incomplete fix. Not blocking.

Test Coverage

The PR adds as_dict/from_dict round-trip coverage to four existing test functions:

test_dynamic_blockwise_quantization (two loops: default code and custom code)
test_fp8_quant
test_4bit_quant

This covers both blockwise quantization (where shape/quant_type are None) and 4-bit quantization (where they are set). The tests use packed=False (the default), which is the path that was broken. The tests verify the round-trip by running dequantization after reconstruction, confirming the reconstructed QuantState is functionally correct.

The test does not cover the packed=True round-trip for blockwise QuantState, but as noted above, that code path is not used in practice.

Minor: Typo fix

The PR also fixes a documentation typo in the dequantize_4bit docstring: "The the absolute" -> "The absolute". This is fine.

Security: Clear (no new imports, no network access, no filesystem writes, no obfuscation, no invisible Unicode characters)
Downstream impact: None (4-bit checkpoint serialization format is unchanged; QuantState constructor signature unchanged; vLLM's QuantState.from_dict() usage is unaffected)
Tests: Adequate (round-trip coverage for both blockwise and 4-bit paths)
CI: Not triggered (fork PR — a maintainer needs to approve the workflow run before merge)
Serialization: Compatible (no change to packed checkpoint format for 4-bit; blockwise QuantState is not persisted)
Cross-PR conflicts: PR #1866 also modifies bitsandbytes/functional.py (adds __getattr__ to QuantState) but touches different lines. No direct conflict, though #1866's __getattr__ calls self.as_dict(packed=True) which would be affected by the as_dict change here. Since #1866 only exercises the 4-bit path (where quant_type is always set), there is no semantic conflict.
Commit hygiene: 7 commits including 2 merge commits. Recommend squash merge.

bitsandbytes/functional.py

Signed-off-by: Yuanyuan Chen <cyyever@outlook.com>

cyyever force-pushed the fix_quant_state branch from f8ff22b to 5557581 Compare August 18, 2025 07:55

cyyever force-pushed the fix_quant_state branch from 60a2df6 to 5eb1e03 Compare August 26, 2025 01:24

cyyever force-pushed the fix_quant_state branch from 5eb1e03 to 398e915 Compare September 2, 2025 16:29

cyyever force-pushed the fix_quant_state branch from 398e915 to 4c95352 Compare September 11, 2025 00:41

cyyever added 5 commits September 17, 2025 13:13

Fix QuantState.as_dict

a47a40e

Signed-off-by: cyy <cyyever@outlook.com>

Fix QuantState.from_dict

9b7b373

Signed-off-by: cyy <cyyever@outlook.com>

Fix QuantState.as_dict

d53c8f6

Signed-off-by: cyy <cyyever@outlook.com>

Add test

b169b03

Signed-off-by: cyy <cyyever@outlook.com>

Fix comment

5256287

Signed-off-by: Yuanyuan Chen <cyyever@outlook.com>

cyyever force-pushed the fix_quant_state branch from 4c95352 to 5256287 Compare September 17, 2025 05:13

This was referenced Feb 16, 2026

fix: Replace hard-coded precision thresholds with std-based bounds #1864

Open

fix: Stop quantize_blockwise and quantize_4bit from mutating user-provided absmax #1863

Open

TimDettmers reviewed Feb 16, 2026

View reviewed changes

bitsandbytes/functional.py Show resolved Hide resolved

Fix self.quant_type is None

e59236a

Signed-off-by: Yuanyuan Chen <cyyever@outlook.com>

cyyever force-pushed the fix_quant_state branch from 429db24 to e59236a Compare February 17, 2026 00:40

Merge branch 'main' into fix_quant_state

597e81a

cyyever requested a review from TimDettmers February 17, 2026 00:41

Merge branch 'main' into fix_quant_state

bf16801

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix QuantState and dict conversions#1729

Fix QuantState and dict conversions#1729
cyyever wants to merge 8 commits intobitsandbytes-foundation:mainfrom
cyyever:fix_quant_state

cyyever commented Aug 18, 2025 •

edited

Loading

Uh oh!

matthewdouglas commented Sep 3, 2025

Uh oh!

cyyever commented Sep 4, 2025

Uh oh!

github-actions bot commented Sep 18, 2025

Uh oh!

TimDettmers left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Uh oh!

Conversation

cyyever commented Aug 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

matthewdouglas commented Sep 3, 2025

Uh oh!

cyyever commented Sep 4, 2025

Uh oh!

github-actions bot commented Sep 18, 2025

Uh oh!

TimDettmers left a comment

Choose a reason for hiding this comment

PR Review: #1729 — Fix QuantState and dict conversions

Analysis

Serialization Compatibility

Test Coverage

Minor: Typo fix

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

cyyever commented Aug 18, 2025 •

edited

Loading