RHash
RHash copied to clipboard
Add Oshash
https://github.com/r-salas/oshash
This is being used more and more, it's very fast, and resistant. It's a hash of filesize, and the beginning and end of the file.
In pseudo-code, the hash is computed in the following way:
file_buffer = open("/path/to/file/")
head_checksum = checksum(file_buffer.head(64 * 1024)) # 64KB
tail_checksum = checksum(file_buffer.tail(64 * 1024)) # 64KB
file_hash = file_buffer.size + head_checksum + tail_checksum
More code (including C code) at https://trac.opensubtitles.org/projects/opensubtitles/wiki/HashSourceCodes
@rhash similar algorithm for reference.
- https://github.com/kalafut/imohash
- https://github.com/kalafut/py-imohash/
Another vote for both oshash and imohash.