Skip to content

Conversation

@emmanuelmathot
Copy link
Contributor

Implement sharding support for GeoZarr conversion, enhancing the CLI with new commands for optimized Sentinel-2 datasets. Improve multiscale pyramid creation, downsampling methods, and metadata handling while ensuring compatibility with chunk dimensions. Refactor code for better readability and maintainability, and add comprehensive tests to validate the new features.

emmanuelmathot and others added 30 commits September 26, 2025 09:30
- Introduced a new dataset tree structure for Sentinel-2 data, detailing conditions, quality, and measurements.
- Added a comprehensive test script to verify the sharding fix for GeoZarr conversion.
- Implemented tests for shard dimension calculations and encoding creation with sharding enabled/disabled.
- Enhanced output for better debugging and validation of shard dimensions against chunk dimensions.
…essing

- Created the `s2_optimization` module for optimizing Sentinel-2 Zarr datasets.
- Implemented CLI commands for converting Sentinel-2 datasets to optimized structures.
- Developed band mapping and resolution definitions for Sentinel-2 optimization.
- Added the `S2OptimizedConverter` class for handling the conversion process.
- Implemented data consolidation logic to reorganize Sentinel-2 structure.
- Created multiscale pyramid generation for optimized data.
- Added downsampling operations for various data types (reflectance, classification, quality masks).
- Implemented validation logic for optimized Sentinel-2 datasets.
- Developed unit tests for band mapping, converter functionality, and resampling operations.
- Implement unit tests for initialization, pyramid levels structure, chunk alignment, and shard dimension calculations.
- Create tests for encoding generation, dataset writing, and level dataset creation with various resolutions.
- Include integration tests for realistic measurements data and edge cases handling.
- Ensure coverage for time separation logic and coordinate preservation during processing.
emmanuelmathot and others added 23 commits October 26, 2025 09:37
…metadata handling in S2 multiscale pyramid creation
…ary datatree loading and streamline dataset writing return
…ee initialization in S2 converter and multiscale classes
…issing intermediary groups in Zarr structure
…mprove test for geo metadata integration with level creation
…st script

- Deleted the comprehensive optimization plan for the Sentinel-2 Zarr conversion, which included details on the current state, proposed structure, technical specifications, implementation plan, and expected benefits.
- Removed the test script for verifying the sharding fix in GeoZarr conversion, which included tests for shard dimensions and encoding creation.
@emmanuelmathot emmanuelmathot requested review from d-v-b and removed request for d-v-b November 24, 2025 11:28
- Removed --skip-geometry and --skip-meteorology options from CLI as they are no longer needed.
- Updated S2OptimizedConverter to initialize and use CRS for datasets, improving metadata handling.
- Modified S2MultiscalePyramid to accept CRS and ensure geographic metadata is written correctly.
- Improved logging messages for clarity and consistency throughout the conversion process.
@emmanuelmathot emmanuelmathot changed the title Add sharding support and optimize Sentinel-2 Zarr conversion Few updates to S2 conversion {NOT TE BE MERGED] Nov 24, 2025
@emmanuelmathot emmanuelmathot changed the title Few updates to S2 conversion {NOT TE BE MERGED] Few updates to S2 conversion [NOT TE BE MERGED] Nov 24, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants