Skip to content

Extend DeepSpeed inference initialization API with a 'quantize_groups' argument #12848

Extend DeepSpeed inference initialization API with a 'quantize_groups' argument

Extend DeepSpeed inference initialization API with a 'quantize_groups' argument #12848

Re-run triggered January 24, 2025 22:27
Status Success
Total duration 7m 26s
Artifacts

nv-accelerate-v100.yml

on: pull_request
Fit to window
Zoom out
Zoom in