RAM exhaustion when training on custom paired dataset (H2R works fine)

Hey! First off, thanks for releasing the Human2Robot (H2R) dataset and training code — I fine-tuned the model on the provided H2R data from Hugging Face and the results were incredible.

I’m now trying to replicate the setup using my own paired Human→Robot dataset. However, when switching to my dataset, I consistently hit RAM exhaustion very early in training(the dataset size is 20 episodes for both human and robot), even though the H2R dataset loads and trains without any issues on the same machine.

This makes me suspect there may be implicit assumptions or constraints on the dataset format, video properties, or loading pipeline that my custom data violates.

I wanted to ask:

Are there specific requirements or recommended specs for recording / preprocessing custom paired datasets?

Are there known failure modes that could cause excessive RAM usage with certain video formats or dataset structures?

below I've attached two of the video samples. Even after heavy compression, still not able to load them.

https://github.com/user-attachments/assets/6f157b92-b37a-4e4c-abd7-2eecf8cb4dc1

https://github.com/user-attachments/assets/42ebc6b2-82df-43e3-9f91-9ad036751ef8


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RAM exhaustion when training on custom paired dataset (H2R works fine) #4

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

RAM exhaustion when training on custom paired dataset (H2R works fine) #4

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions