perf: dont use a vstack before indexing #25

ilan-gold · 2025-08-22T13:03:43Z

@felix0097 Here is the no vstack. I don't want to merge it yet because it is generic over sparse and dense and with sparse, it doesn't help (and is more complicated). An overview of the two pipelines for comparison (after data fetching):

main

What's on main now will stack together the fetched chunks and then yield from the vstacked result batch_size subsets. If preload_gpu is enabled, the vstacking occurs just after the data is loaded onto the GPU

this branch

With this branch, the chunks are either left alone or converted to the GPU if preloading is enabled. Then they are yielded from based on batch_size.

I put this on a branch first to make sure it benefits dense. If it does, I'll put the feature behind a flag, and we can turn it on for dense only (or sparse if I have missed something perf-wise)

codecov · 2025-08-22T13:06:39Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 94.91%. Comparing base (2ef3b8a) to head (623bbe7).

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #25      +/-   ##
==========================================
+ Coverage   94.71%   94.91%   +0.19%     
==========================================
  Files           7        7              
  Lines         511      531      +20     
==========================================
+ Hits          484      504      +20     
  Misses         27       27

Files with missing lines	Coverage Δ
arrayloaders/io/zarr_loader.py	`95.01% <100.00%> (+0.35%)`	⬆️

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

github-actions · 2025-08-22T13:07:06Z

Deployment URL: https://7fc446fa.arrayloaders.pages.dev

ilan-gold added 9 commits August 19, 2025 11:55

feat: remove vstack

23b816e

fix: remove shape hotpath

124dfdb

Merge branch 'ig/remove_shape_hotpath' into ig/no_vstack

89c8233

fix: use array

171c472

feat: cupy early

4a7518e

Merge branch 'main' into ig/no_vstack_cupy

946fd65

fix: oops, actually yield properly when batch size matches in-memory

bcb6016

fix: properly handle leftovers

c446e63

Merge branch 'main' into ig/no_vstack_cupy

623bbe7

github-actions bot deployed to preview August 22, 2025 13:07 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf: dont use a vstack before indexing #25

perf: dont use a vstack before indexing #25

Uh oh!

ilan-gold commented Aug 22, 2025 •

edited

Loading

Uh oh!

codecov bot commented Aug 22, 2025

Uh oh!

github-actions bot commented Aug 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

perf: dont use a vstack before indexing #25

Are you sure you want to change the base?

perf: dont use a vstack before indexing #25

Uh oh!

Conversation

ilan-gold commented Aug 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

main

this branch

Uh oh!

codecov bot commented Aug 22, 2025

Codecov Report

Uh oh!

github-actions bot commented Aug 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ilan-gold commented Aug 22, 2025 •

edited

Loading