[RFC] PyTorch Custom Operators & Multi-Backend Support

# Purpose

We intend to integrate [PyTorch Custom Operators](https://pytorch.org/tutorials/advanced/custom_ops_landing_page.html) as the primary mechanism for dispatching to device-specific operator implementations. An initial scaffolding of this is presented in PR #1544. This RFC will serve as a guideline to collect community feedback and refine our development plans moving forward.

# Why?
* Registering operators with `torch.library` allows us to take advantage of the existing device dispatch mechanisms in PyTorch.
* We can treat calls to functionality in our CUDA kernels, or other low-level backend implementations, as opaque for improved `torch.compile` support.
* We can provide naive implementations of operators with only PyTorch code as a fallback option. This may additionally serve as a secondary CPU baseline, as per #1021.
* This helps to simplify the development for additional backends, while taking an idiomatic modern PyTorch approach.

# What about the `multi-backend-refactor` branch?

We are planning to deprecate further development on that branch upon the merging of #1544. After that point, the expectation is that we will implement backends using the new custom operator registration mechanisms. We expect to be able to reuse much of the existing implementations in the refactoring process. 

Our goal is to aggressively mainline our in-tree backends, while additionally enabling out-of-tree backends. We will expand on this topic in the near future.

## Supersedure 
This RFC is intended to supersede topics which were covered in previously related RFCs which remained open as of this writing:
* #894 
* #1020 
* #1021 

## Related Issues

Related issues and discussions include:
* #47 
* #107 
* #485 
* #1031 
* #1271 
* #1338 
* #1339 
* #1340 
* #1402 
* #1437 
* #1460 
* #1526 

Additionally, this relates to the following issues and discussions which have been closed:
* #990 
* #997 
* #1032 
* #1378 

## Relevant contributors
The following contributors may have particular interest and feedback on this topic:

@Titus-von-Koeller 
@christoph-koehncke 
@jiqing-feng 
@pnunna93 
@akx 
@rickardp 
@ji-huazhong 
@SlightwindSec
 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[RFC] PyTorch Custom Operators & Multi-Backend Support #1545

Purpose

Why?

What about the `multi-backend-refactor` branch?

Supersedure

Related Issues

Relevant contributors

Sub-issues

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[RFC] PyTorch Custom Operators & Multi-Backend Support #1545

Description

Purpose

Why?

What about the multi-backend-refactor branch?

Supersedure

Related Issues

Relevant contributors

Sub-issues

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

What about the `multi-backend-refactor` branch?