Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix CPU dequantization to use nested dequantized scaling constant #1549

Conversation

zyklotomic
Copy link
Contributor

I believe that in the CPU case, we have forgotten to use the updated absmax if that absmax too was quantized, like in QLoRA's doubleDequant().

@zyklotomic
Copy link
Contributor Author

Similarly, I also noticed slightly above, in:

code = quant_state.code.to(A.device)
the updated code variable is not used. Not sure if this is an error or extraneous.

@matthewdouglas matthewdouglas added bug Something isn't working x64 CPU labels Mar 5, 2025
@matthewdouglas matthewdouglas self-assigned this Mar 5, 2025
@matthewdouglas matthewdouglas self-requested a review March 5, 2025 15:07
@matthewdouglas matthewdouglas added this to the v0.46.0 milestone Mar 5, 2025
Copy link

github-actions bot commented Mar 5, 2025

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@matthewdouglas matthewdouglas merged commit d8d157f into bitsandbytes-foundation:main Mar 7, 2025
34 checks passed
@matthewdouglas
Copy link
Member

Thanks! The issue above seems like an oversight too, but it's refactored and addressed in #1544.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working x64 CPU
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants