Skip to content

Importing NVRx to call the corresponding routines migrated to NVRx #3837

@sbak5

Description

@sbak5

Is your feature request related to a problem? Please describe.
Adding dependency in MCore to call corresponding routines in NVRx for checkpointing.

We're in the middle of migrating the checkpointing at dist_checkpointing.

Tag the @mcore-oncall
to get oncall's attention to this issue.

Describe the solution you'd like
CI pipeline / any checkpointing routines having dependency on the migrated routines will run only when NVRx is installed. If not, it will run corresponding torch.distributed.checkpoint routines.

Async ckpt will be enabled only with NVRx installed.

Describe alternatives you've considered
A clear and concise description of any alternative solutions or features you've considered.

Additional context
Add any other context or screenshots about the feature request here.

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions