SPDL (Scalable and Performant Data Loading) is a library and project to explore the design and performance of data loading with thread-based parallelism.
Please checkout the documentation.
SPDL is BSD 2-Clause licensed, as found in the LICENSE file.
Please use the following BibTex for citing our project if you find it useful.
@misc{hira2025scalableperformantdataloading,
title={Scalable and Performant Data Loading},
author={Moto Hira and Christian Puhrsch and Valentin Andrei and Roman Malinovskyy and Gael Le Lan and Abhinandan Krishnan and Joseph Cummings and Miguel Martin and Gokul Gunasekaran and Yuta Inoue and Alex J Turner and Raghuraman Krishnamoorthi},
year={2025},
eprint={2504.20067},
archivePrefix={arXiv},
primaryClass={cs.DC},
url={https://arxiv.org/abs/2504.20067},
}