Possibility for batch/parallel/quicker data collection?

Hi,

Thanks for this great repo, great help for our data generation pipeline, on our own robot!

By far, the capture process will [get data one by one](https://github.com/NVlabs/MobilityGen/blob/1416c2b49070e72ea96c0a7417eaf01a008af9fd/exts/omni.ext.mobility_gen/omni/ext/mobility_gen/sensors.py#L141), each will trigger one GPU->CPU sync, it will take a long time if we want to capture multi data formats.

<img width="500" height="300" alt="Image" src="https://github.com/user-attachments/assets/3ada7273-b6e1-4590-b62c-f131b34178c0" />

The nsys report:

<img width="1024" height="512" alt="Image" src="https://github.com/user-attachments/assets/8b8d5460-f0ea-43f4-b85b-6869aa8b235e" />

***

After searching the doc (with help of gpt/gemini), it seems BasicWriter do have some nice supports: the configurable write queue and threads, the jpeg format, etc:

https://docs.isaacsim.omniverse.nvidia.com/latest/replicator_tutorials/tutorial_replicator_getting_started.html#custom-writer-and-annotators-with-multiple-cameras

And maybe some helper classes in isacclab. 


But the state data dict we are using in this repo is super easy to understand and use, we can just add more key/value and write all of them to one big npy, for easier usage in training.

So, I am wondering, is there any plans to make the multimodal data generation, quicker?

Sincerely.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Possibility for batch/parallel/quicker data collection? #39

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Possibility for batch/parallel/quicker data collection? #39

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions