GPU Middle Class? 

Does torchtune have any plans to support "GPU middle class" users? 

We're trying to evaluate using torchtune for post-training, especially since there are many useful features implemented (RLHF, LORA, etc). However, one big sticking point is that the system seems heavily geared towards single-node training. Are there plans to support multi-node training (e.g. 16-64 nodes) and things like model parallelism, 128k context training, etc?

If not, is torchtitan the recommended system to use?  

Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPU Middle Class? #2161

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

GPU Middle Class? #2161

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions