Add NPU support for `torchmonarch`

## **Summary**

`torchmonarch` currently targets GPU-based execution (e.g., CUDA) and does not support NPU accelerators Issue #1653 and Issue #2296 . Similar to the ongoing discussion in  Issue #1649  and Issue #1683 , extending Monarch to additional accelerator backends would broaden its applicability across diverse hardware environments.

## **Current status**

We’ve implemented **experimental Ascend NPU support** and have successfully run Monarch workloads on Ascend machines:

* **Single-node tensor execution**
* **Multi-node tensor execution**

These results indicate that Monarch can be extended to support NPUs without requiring changes to user-facing APIs or application code.

## **Evidence**

Attached screenshots:

1. Monarch tensor execution on a single Ascend machine

<img width="1920" height="713" alt="Image" src="https://github.com/user-attachments/assets/10be0f72-df53-495d-95a0-a8187ce068b4" />

2. Monarch multi-machine tensor execution on two Ascend machines

<img width="715" height="382" alt="Image" src="https://github.com/user-attachments/assets/193147a3-f088-4e2a-8191-0fb963d05d6a" />

## **Proposed next steps**

Before submitting code, we’d like maintainer guidance on:

* Be upstreamed incrementally via **Issues → PRs** in `torchmonarch`, or
* Whether NPU support should live as a **separate sub-project / experimental backend**

If helpful, we can start with a **small, gated PR** (e.g., minimal NPU device discovery / backend abstraction) to align on design and review expectations.

## **Goal**

Contribute NPU support in a way that is aligned with Monarch’s architecture, avoids long-lived forks, and expands Monarch’s hardware ecosystem.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add NPU support for `torchmonarch` #2294

Summary

Current status

Evidence

Proposed next steps

Goal

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Add NPU support for torchmonarch #2294

Description

Summary

Current status

Evidence

Proposed next steps

Goal

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Add NPU support for `torchmonarch` #2294