Speedup quantization in refit for FP8 GRPO

**Is your feature request related to a problem? Please describe.**
Track the effort of accelerating the in-flight quantization in refit when vllm uses FP8 precision weights.

**Describe the solution you'd like**
A clear and concise description of what you want to happen.

**Describe alternatives you've considered**
A clear and concise description of any alternative solutions or features you've considered.

**Additional context**
Add any other context or screenshots about the feature request here.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Speedup quantization in refit for FP8 GRPO #1467

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Speedup quantization in refit for FP8 GRPO #1467

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions