Cpu memory graph break #3886

cehongwang · 2025-11-04T20:05:10Z

Description

Please include a summary of the change and which issue is fixed. Please also include relevant motivation and context. List any dependencies that are required for this change.

Fixes # (issue)

Type of change

Please delete options that are not relevant and/or add your own.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

Checklist:

My code follows the style guidelines of this project (You can use the linters)
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas and hacks
I have made corresponding changes to the documentation
I have added tests to verify my fix or my feature
New and existing unit tests pass locally with my changes
I have added the relevant labels to my PR in so that relevant reviewers are notified

github-actions

There are some changes that do not conform to Python style guidelines:

--- /home/runner/work/TensorRT/TensorRT/py/torch_tensorrt/dynamo/_compiler.py	2025-11-04 20:05:23.825034+00:00
+++ /home/runner/work/TensorRT/TensorRT/py/torch_tensorrt/dynamo/_compiler.py	2025-11-04 20:05:55.253944+00:00
@@ -876,15 +876,14 @@
    # This is done to release CPU memory.
    for attr in dir(gm):
        if attr.startswith("_frozen_param"):
            delattr(gm, attr)

-
-
    from torch_tensorrt.dynamo.conversion._ConverterRegistry import DYNAMO_CONVERTERS
+
    DYNAMO_CONVERTERS.disallowed_targets = set()
-    
+
    for name, _ in partitioned_module.named_children():
        submodule = getattr(partitioned_module, name)
        # filter on the GraphModule
        if not isinstance(submodule, torch.fx.graph_module.GraphModule):
            continue

narendasan

Do you have a test case or something to demonstrate this feature?

narendasan · 2025-11-04T20:10:40Z

py/torch_tensorrt/dynamo/partitioning/_adjacency_partitioner.py


 logger = logging.getLogger(__name__)
+NON_BREAKABLE_OP_LISTS = [
+    ["addmm", "addmm"],


Just a note for implementation later.

this should use an actual subgraph definition

it should use pytorch op targets not strings

addmm should be decomposed right so the graph we want is mm -> add

There should be a user facing API to modify this list similar to what we have for passes

narendasan · 2025-11-04T20:11:15Z

py/torch_tensorrt/dynamo/partitioning/_adjacency_partitioner.py

-
-    def calculate_num_of_break(self, subgraphs: List[Subgraph]) -> int:
+
+    def calculate_size_budget(


Should there be an API to define this manually?

Yeah I think so. For now you can just hardcode and play with it

narendasan · 2025-11-04T20:11:48Z

py/torch_tensorrt/dynamo/partitioning/_adjacency_partitioner.py

        This function breaks the subgraphs into smaller subgraphs at the specified frequency to save CPU memory.
        """
-        op_to_break = "add."
+        op_to_break = "addmm."


Why is this hardcoded?

This function is not called. I just left there for experiment and would delete later

cehongwang added 3 commits October 16, 2025 22:22

rebased to main

4ffb8b3

Added the experiment files

013c772

Implemeted the prototype

57b04d7

meta-cla bot added the cla signed label Nov 4, 2025

github-actions bot requested a review from peri044 November 4, 2025 20:05

github-actions bot requested changes Nov 4, 2025

View reviewed changes

narendasan reviewed Nov 4, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Cpu memory graph break #3886

Cpu memory graph break #3886

Uh oh!

cehongwang commented Nov 4, 2025

Uh oh!

github-actions bot left a comment

Uh oh!

narendasan left a comment

Uh oh!

narendasan Nov 4, 2025

Uh oh!

narendasan Nov 4, 2025

Uh oh!

cehongwang Nov 4, 2025

Uh oh!

narendasan Nov 4, 2025

Uh oh!

cehongwang Nov 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		def calculate_num_of_break(self, subgraphs: List[Subgraph]) -> int:

		def calculate_size_budget(

Cpu memory graph break #3886

Are you sure you want to change the base?

Cpu memory graph break #3886

Uh oh!

Conversation

cehongwang commented Nov 4, 2025

Description

Type of change

Checklist:

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

narendasan left a comment

Choose a reason for hiding this comment

Uh oh!

narendasan Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

narendasan Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

cehongwang Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

narendasan Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

cehongwang Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants