Skip to content

Conversation

@SamuelJanas
Copy link

@SamuelJanas SamuelJanas commented Oct 8, 2023

  • submitted a draft
  • test fairness/multi-run
  • Find bottlenecks in Dataset
  • optimize
  • Write tips in the blogpost

@SamuelJanas
Copy link
Author

It took longer than expected as I had to guarantee that the benchmarking was fair to the best of my knowledge.😅
I'll focus on finding the bottlenecks for now to find some ways to improve the datasets overall.

@roszcz
Copy link
Member

roszcz commented Oct 8, 2023

Thanks, look good!

In terms of presentation, I think we can hide the detailed code used to generate the results and focus here only on the performance metrics. Can you move the code to a separate repository?

For the performance itself, can you also investigate impact of moving data to GPU, and parameters like num_workers in the torch DataLoader?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants