Skip to content

Extend DeepSpeed inference initialization API with a 'quantize_groups' argument #12999

Extend DeepSpeed inference initialization API with a 'quantize_groups' argument

Extend DeepSpeed inference initialization API with a 'quantize_groups' argument #12999

Re-run triggered January 24, 2025 22:27
Status Success
Total duration 1h 32m 22s
Artifacts

nv-torch-latest-v100.yml

on: pull_request
Fit to window
Zoom out
Zoom in