Skip to content

[NNCF] FP8/FP4 support#1524

Open
daniil-lyakhov wants to merge 1 commit intohuggingface:mainfrom
daniil-lyakhov:dl/nncf/fp4_fp8
Open

[NNCF] FP8/FP4 support#1524
daniil-lyakhov wants to merge 1 commit intohuggingface:mainfrom
daniil-lyakhov:dl/nncf/fp4_fp8

Conversation

@daniil-lyakhov
Copy link
Contributor

What does this PR do?

[NNCF] FP8/FP4 support

Before submitting

  • This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
  • Did you make sure to update the documentation with your changes?
  • Did you write any new necessary tests?

Copy link
Contributor

@nikita-savelyevv nikita-savelyevv left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks!

Could you please also add a couple of cases to TRANSFORMERS_4BIT_CONFIGURATIONS? https://github.com/huggingface/optimum-intel/blob/main/tests/openvino/test_quantization.py#L558

),
(
"text-generation-with-past",
"opt125m",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could you please use some other model, e.g. llama? opt125m is too large and in the future we'd like to replace it with a different one. Perhaps, group size will need to reduced.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done

@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@daniil-lyakhov daniil-lyakhov marked this pull request as ready for review February 16, 2026 17:48
@daniil-lyakhov
Copy link
Contributor Author

@ljaljushkin, @andrey-churkin please take a look

@andrey-churkin andrey-churkin self-requested a review February 17, 2026 16:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants