archives icon indicating copy to clipboard operation
archives copied to clipboard

Independently verify downloaded datasets

Open ghost opened this issue 8 years ago • 1 comments

Some datasets may contain hashes/checksums of the included files, which makes it easy to verify the integrity of the downloaded files. For datasets which don't, it's useful to download them again somewhere else, and compare the ipfs hashes.

This implies that downloaders sign and publish the ipfs hashes of the stuff they've finished downloading.

ghost avatar Jan 18 '17 23:01 ghost

You could use git-annex for that: http://git-annex.branchable.com/ It's already tested for large archives and could be used if the IPFS import needs to be repeated in the future. This could also serve as a basis for the initial download processing.

mguentner avatar Jan 21 '17 23:01 mguentner