enable loading `universal checkpointing` checkpoint in `DeepSpeedStrategy`

### Description & Motivation

After I trained a model in some numbers of gpus, say, 8 gpus for a while, It's difficult to load the checkpoint to 16 gpus with optimizer and model states unchanged. The deepspeed has developed the universal checkpointing strategy to solve this problem, but I didn't see the `pytorch-lightning` has this feature.

### Pitch

I want the `pytorch-lightning` could support this feature

### Alternatives

try to add `universal_checkpoint` as a param of `DeepSpeedStrategy` and modify the class refering to `https://www.deepspeed.ai/tutorials/universal-checkpointing/`

### Additional context

_No response_

cc @borda @awaelchli

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

enable loading `universal checkpointing` checkpoint in `DeepSpeedStrategy` #20065

Description & Motivation

Pitch

Alternatives

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

enable loading universal checkpointing checkpoint in DeepSpeedStrategy #20065

Description

Description & Motivation

Pitch

Alternatives

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

enable loading `universal checkpointing` checkpoint in `DeepSpeedStrategy` #20065