It could be useful to define a dataset-level checksum and have the publisher add this as a dataset-level attribute.
A simple example would be the checksum of a textfile containing sorted listing of filenames and file checksums, but there might also be other metadata that needs to be rolled into the hash, so the algorithm would need thought and discussion before taking any action.