you can reduce cuda mem usage by discarding intermediate tensors

According to readme, the big-part of cuda memory use is in Aggregator. And the output of aggregator is not fully used. Only 4 layers of output_list are processed in following stages. You can safely discard the other layers.

```
intermediate_layer_idx: List[int] = [4, 11, 17, 23]
```

https://github.com/facebookresearch/vggt/blob/main/vggt/models/aggregator.py#L253

<img width="2308" height="726" alt="Image" src="https://github.com/user-attachments/assets/bec61f27-4568-43f3-ad4a-f492c0a8838d" />


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

you can reduce cuda mem usage by discarding intermediate tensors #455

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

you can reduce cuda mem usage by discarding intermediate tensors #455

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions