fix: handle device in the same way as dtype in `aten.full_like` decomposition #3538

junstar92 · 2025-05-29T04:11:24Z

Description

This PR extends the changes introduced in PR #3535 by applying similar handling for the device, which was previously missed.

In the original PR, the focus was on ensuring the correct propagation of dtype when using torch.full_like. However, torch.full_like also accepts a device argument, and if a device is explicitly passed, it may differ from the input tensor's device. This can result in the output tensor being created on a different device than the input, leading to device mismatch issues.

import torch
from torch.export._trace import _export
from torch_tensorrt.dynamo.lowering import get_decompositions

device0 = torch.device("cuda", index=0)
device1 = torch.device("cuda", index=1)


class MyModel(torch.nn.Module):
    def __init__(self):
        super().__init__()

    def forward(self, x):
        return torch.ones_like(x, dtype=torch.bool, device=torch.device("cuda", index=1))


model = MyModel().to(device0)
x = torch.randn(1, 10, dtype=torch.float16).to(device0)
ep = _export(model, (x,))
ep = ep.run_decompositions(get_decompositions(False))
gm = ep.module()
y = gm(x)

assert y.device == device1, f"{device1} expected, but got {y.device}"

Results:

AssertionError: cuda:1 expected, but got cuda:0

To prevent this, this PR ensures that the device is handled in the same way as dtype in the previous PR.

Type of change

Bug fix (non-breaking change which fixes an issue)

Checklist:

My code follows the style guidelines of this project (You can use the linters)
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas and hacks
I have made corresponding changes to the documentation
I have added tests to verify my fix or my feature
New and existing unit tests pass locally with my changes
I have added the relevant labels to my PR in so that relevant reviewers are notified

peri044

LGTM

fix: address device argument like dtype

06e2b4f

facebook-github-bot added the cla signed label May 29, 2025

github-actions bot added component: lowering Issues re: The lowering / preprocessing passes component: api [Python] Issues re: Python API component: dynamo Issues relating to the `torch.compile` or `torch._dynamo.export` paths labels May 29, 2025

github-actions bot requested a review from gs-olive May 29, 2025 04:11

narendasan requested review from peri044 and apbose May 29, 2025 20:31

peri044 approved these changes May 29, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: handle device in the same way as dtype in `aten.full_like` decomposition #3538

fix: handle device in the same way as dtype in `aten.full_like` decomposition #3538

Uh oh!

junstar92 commented May 29, 2025

Uh oh!

peri044 left a comment

Uh oh!

Uh oh!

fix: handle device in the same way as dtype in aten.full_like decomposition #3538

Are you sure you want to change the base?

fix: handle device in the same way as dtype in aten.full_like decomposition #3538

Uh oh!

Conversation

junstar92 commented May 29, 2025

Description

Type of change

Checklist:

Uh oh!

peri044 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

fix: handle device in the same way as dtype in `aten.full_like` decomposition #3538

fix: handle device in the same way as dtype in `aten.full_like` decomposition #3538