Open
Description
Checklist
- 1. If the issue you raised is not a feature but a question, please raise a discussion at https://github.com/mit-han-lab/nunchaku/discussions/new/choose. Otherwise, it will be closed.
- 2. I will do my best to describe the issue in English.
Motivation
Has your team tried using FP8 quantization? The accuracy will be better compared to NF4/INT4? Also, are there any plans to support it in the future?
Related resources
No response