Fix MPS #3444

goanpeca · 2025-09-04T16:06:24Z

Fix MPS by detaching before moving to cpu and moving to numpy array

Copilot

Pull Request Overview

This PR fixes an issue with MPS (Metal Performance Shaders) compatibility by adding detach() calls before moving tensors to CPU in the FID score calculation.

Key Changes

Added .detach() calls before .cpu() for mu1, mu2, sigma1, and sigma2 tensors to prevent gradient tracking issues when moving from GPU to CPU

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

vfdev-5 · 2025-09-04T20:25:41Z

@goanpeca please post here the error you see without this PR to understand better why we need now to call detach

goanpeca · 2025-09-04T22:37:00Z

@vfdev-5 it was a suggestion from copilot on the first PR to add the detach method before the CPU.

Added .detach() calls before .cpu() for mu1, mu2, sigma1, and sigma2 tensors to prevent gradient tracking issues when
moving from GPU to CPU

Detach the tensor from the computational graph (if it requires gradients):
If the tensor was created with requires_grad=True or is part of a computation where gradients are being tracked, you need to detach it from the computational graph using the .detach() method. This prevents errors related to attempting to convert a tensor that requires gradients to a NumPy array, as NumPy arrays do not have a concept of gradients.

Is this something that could apply to mu1, mu2, sigma1, sigma2?

goanpeca · 2025-09-05T14:26:21Z

@vfdev-5 the error was something on

tests/ignite/metrics/gan/test_fid.py:51: 
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
ignite/metrics/gan/fid.py:40: in fid_score
    covmean, _ = scipy.linalg.sqrtm(sigma1.mm(sigma2).cpu(), disp=False)
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

...
          if disp is False:
              try:
  >               arg2 = norm(res @ res - A, 'fro')**2 / norm(A, 'fro')
                              ^^^^^^^^^^^^^
  E               TypeError: unsupported operand type(s) for -: 'numpy.ndarray' and 'Tensor'
  
  .venv/lib/python3.11/site-packages/scipy/linalg/_matfuncs.py:547: TypeError

so that is why we had to add a .numpy() call. For the detach see previous message, it was a suggestion from copilot, but if it does not apply, I can revert it!

vfdev-5

Thanks Gonzalo!

Copilot AI review requested due to automatic review settings September 4, 2025 16:06

goanpeca mentioned this pull request Sep 4, 2025

Fix max iters issue and add tests #3439

Draft

Copilot AI reviewed Sep 4, 2025

View reviewed changes

github-actions bot added the module: metrics Metrics module label Sep 4, 2025

goanpeca marked this pull request as draft September 4, 2025 22:39

goanpeca force-pushed the fix/mps-test branch 4 times, most recently from 5190667 to 55f3789 Compare September 5, 2025 14:22

Fix MPS

1d7f83d

goanpeca force-pushed the fix/mps-test branch from 55f3789 to 1d7f83d Compare September 5, 2025 14:31

goanpeca marked this pull request as ready for review September 5, 2025 14:31

vfdev-5 approved these changes Sep 5, 2025

View reviewed changes

vfdev-5 enabled auto-merge September 5, 2025 14:46

vfdev-5 added this pull request to the merge queue Sep 5, 2025

Merged via the queue into pytorch:master with commit 1fc3214 Sep 5, 2025
26 checks passed

goanpeca deleted the fix/mps-test branch September 5, 2025 15:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix MPS #3444

Fix MPS #3444

Uh oh!

goanpeca commented Sep 4, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

vfdev-5 commented Sep 4, 2025

Uh oh!

goanpeca commented Sep 4, 2025 •

edited

Loading

Uh oh!

goanpeca commented Sep 5, 2025 •

edited by vfdev-5

Loading

Uh oh!

vfdev-5 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Fix MPS #3444

Fix MPS #3444

Uh oh!

Conversation

goanpeca commented Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Key Changes

Uh oh!

vfdev-5 commented Sep 4, 2025

Uh oh!

goanpeca commented Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

goanpeca commented Sep 5, 2025 • edited by vfdev-5 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vfdev-5 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

goanpeca commented Sep 4, 2025 •

edited

Loading

goanpeca commented Sep 4, 2025 •

edited

Loading

goanpeca commented Sep 5, 2025 •

edited by vfdev-5

Loading