Replies: 2 comments 1 reply
-
|
I swear there was already an issue about this somewhere... I don't think there is a way to do it today, but there could be. It would require iterating through all the chunks in all the relevant manifests, so it could end up being a compute-intensive operation. |
Beta Was this translation helpful? Give feedback.
0 replies
-
|
Thanks Tom, that makes sense. I'll try a different strategy of adding a variable with time dimension to my Icechunk repo dataset containing the NetCDF file name and use that as my set of already-processed-files. Or maybe generate the filenames from the time variable data! |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
What would be the best way to get a list (or set) of the unique URLs referenced in a virtual Icechunk v2 repo?
The use case is that I've got virtual FMRC Icechunk repo where I'm creating references for new NetCDF files produced by the forecast.
I'd like to generate one set of NetCDF file names from the existing Icechunk repo, and another set by scanning the forecast NetCDF files in the bucket, and then generate refs and append whatever files are missing.
Beta Was this translation helpful? Give feedback.
All reactions