Skip to content

How to create custom dataset. (MGN) #655

@yuki51192

Description

@yuki51192

Hello, I have a question regarding the datasets used in Learning to Simulate, such as water-3d.

My goal is to build datasets for other types of liquids and train a MeshGraphNet (MGN) model that can predict the dynamics of those materials. I have tried different simulation engines (e.g., PhysX and MPM-based solvers), but the ground-truth data I generated is not ideal, and the trained models do not function properly.

One issue I noticed is that the value ranges of my dataset are very different from the original datasets.
For example:

The original datasets (e.g., water-3d) use values within the range 0.125–0.875

My simulated data has values ranging from 0 to 80

As a result, the predicted outputs of my model differ significantly from the ground truth.

If there are specific requirements or considerations when generating new datasets—such as:

whether the dataset needs to be normalized to a fixed value range,

recommended preprocessing steps,

or how the original ground-truth data (e.g., water-3d) was generated,

I would greatly appreciate any guidance or clarification.

Best regards.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions