What's Changed
- Fixed mocking for DFTracer by @hariharan-devarajan in #220
- Fixed iterator to only store data for that rank. by @hariharan-devarajan in #216
- Fix PyPI Publish Issue and Improve Project Metadata by @izzet in #224
- Fix missing import for chunking. by @hariharan-devarajan in #223
- Improve CI Performance. by @hariharan-devarajan in #227
- For sample indexing we fix the uneven sampling by @hariharan-devarajan in #226
- fix misleading generator message by @rayandrew in #231
- fix negative value of computation time when stdev exists by @rayandrew in #233
- Bugfix: fix type of number for offset and size by @hariharan-devarajan in #229
- Fix wrong configuration for hdf5 chunking by @rayandrew in #237
- fix last step is not executed by @rayandrew in #236
- fix wrong tracing location of fetch data by @rayandrew in #238
- enable option to disable pin_memory in pytorch by @rayandrew in #239
- Change
maxtoabsfor preprocess time by @rayandrew in #240 - New improved modelling for LLM Deepspeed. by @hariharan-devarajan in #230
- Add user config to specify type of distribution of time configuration by @rayandrew in #241
- fixed bug on doc action by @zhenghh04 in #251
- Update jekyll-gh-pages.yml by @zhenghh04 in #254
- upgrade pydftracer package by @rayandrew in #242
- Support for setting different DLIO_LOG_LEVEL by @zhenghh04 in #222
- Enhancing metric calculation and output functionality by @zhenghh04 in #253
- Checkpointing support for transformer type models by @zhenghh04 in #247
- Update jekyll-gh-pages.yml by @zhenghh04 in #257
- Fix doc deployment issue by @zhenghh04 in #258
- left over logging fix by @zhenghh04 in #259
- docker: use pip install to match readme by @glimchb in #265
- ci: add docker build and publish by @glimchb in #263
- Darshan preload environment variable removed by @zhenghh04 in #260
- Copyright update by @zhenghh04 in #261
- Update docker.yml by @zhenghh04 in #266
- ci: also publish docker image on releases by @glimchb in #267
- Fix saving checkpoint print by @LouisDDN in #270
- docs: small readme typo by @glimchb in #268
- Fixed loading checkpoint timer by @zhenghh04 in #273
- Refactor: move pydftracer dependency to extras for better management by @hariharan-devarajan in #275
- Enhancement for checkpoint feature by @zhenghh04 in #276
- Separate read and write checkpoints. by @zhenghh04 in #278
- configs by @zhenghh04 in #284
- docker: improve docker cache and remove sources by @glimchb in #287
- Fixes for v2.0 benchmark by @johnugeorge in #289
- Reorganized the code provided by YardenMa for O_DIRECT support with NPY and NPZ formats and pytorch by @timothy-chau in #286
- RAM optimisations for checkpointing by @LouisDDN in #283
- Randomize tensor data by default (checkpoint) by @LouisDDN in #291
- S3 Fix by @zhenghh04 in #294
- Mlperf storage v2.0 by @zhenghh04 in #303
- Dimension-based Dataset Generation by @rayandrew in #301
- docs(profiling): fix dftracer repo location by @glimchb in #304
- Add DFTracer AI logging support with dftracer by @rayandrew in #302
- increase tests timeout to 600s (10 minutes) by @rayandrew in #312
New Contributors
- @rayandrew made their first contribution in #231
- @glimchb made their first contribution in #265
- @timothy-chau made their first contribution in #286
Full Changelog: v2.0.0...v2.0.1