Bugfixes, tests, and tools around checksums #2259

bbockelm · 2025-04-26T16:15:29Z

This PR improves the checksum functionality:

Adds a bugfix that prevents checksum failures from not being reported as checksum mismatch errors (@jhiemstrawisc - I think this is probably important enough to put into a 7.16 RC).
Records the client and server-side checksums into the plugin outputs. This will allow ElasticSearch to detect the frequency of errors.
Adds a flag to pelican object stat that allows CLI users to get a checksum back.

Example:

pelican object stat --json --checksums md5 pelican://`hostname`:8444/tmp/test/hello_world.txt
{"Name":"/tmp/test/hello_world.txt","Size":12,"ModTime":"2025-04-26T15:16:26Z","IsCollection":false,"checksums":{"md5":"6f5902ac237024bdd0c176cb93063dc4"}}

client/handle_http.go

client/handle_http_test.go

jhiemstrawisc · 2025-04-28T17:17:35Z

client/handle_http_test.go

+}
+
+// Test behavior when checksum is missing
+func TestChecksumMissing(t *testing.T) {


Each of these new checksum tests is largely a copy-paste-tweak of the others. Instead of using three separate tests, can you create one test that handles a slice of test cases?

- Export checksum type / name information outside the client library. - Add standardized error types and singletons for failure checking. - Do not overwrite checksum mismatch error once it occurs. - Truncate checksum byte array to provided data length.

jhiemstrawisc

LGTM. I'll add this as a 7.16 patch as well.

bbockelm added bug Something isn't working enhancement New feature or request client Issue affecting the OSDF client critical High priority for next release labels Apr 26, 2025

bbockelm added this to the v7.16 milestone Apr 26, 2025

bbockelm requested a review from jhiemstrawisc April 26, 2025 16:15

jhiemstrawisc requested changes Apr 28, 2025

View reviewed changes

bbockelm added 5 commits April 29, 2025 13:27

Improve the ClassAd string printer to handle deeply nested ClassAds.

3728efc

Improve handling of checksums

d45c7b6

- Export checksum type / name information outside the client library. - Add standardized error types and singletons for failure checking. - Do not overwrite checksum mismatch error once it occurs. - Truncate checksum byte array to provided data length.

Add unit test coverage for checksums

1bf19c2

Add checksum output to stat

e74c984

Record checksum results in developer data

5e99561

bbockelm force-pushed the checksum_fixups branch from 139d3d5 to 5e99561 Compare April 29, 2025 18:30

bbockelm linked an issue Apr 29, 2025 that may be closed by this pull request

Add checksum support to the client #2206

Closed

Fixups from code review.

7c1bcd5

jhiemstrawisc self-requested a review April 29, 2025 20:24

jhiemstrawisc approved these changes Apr 29, 2025

View reviewed changes

jhiemstrawisc merged commit 5103884 into PelicanPlatform:main Apr 29, 2025
26 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bugfixes, tests, and tools around checksums #2259

Bugfixes, tests, and tools around checksums #2259

Uh oh!

bbockelm commented Apr 26, 2025

Uh oh!

Uh oh!

Uh oh!

jhiemstrawisc Apr 28, 2025

Uh oh!

jhiemstrawisc left a comment

Uh oh!

Uh oh!

Uh oh!

Bugfixes, tests, and tools around checksums #2259

Bugfixes, tests, and tools around checksums #2259

Uh oh!

Conversation

bbockelm commented Apr 26, 2025

Uh oh!

Uh oh!

Uh oh!

jhiemstrawisc Apr 28, 2025

Choose a reason for hiding this comment

Uh oh!

jhiemstrawisc left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!