tika-python icon indicating copy to clipboard operation
tika-python copied to clipboard

Updated tika to use sha1 hash instead of md5 for checksum

Open griffin-rickle opened this issue 2 years ago • 2 comments

SHA1 has been deprecated in FIPS and there are suggested steps to move away from the algorithm, but it is still supported for "Non-digital-signature applications" (CTRL-F for SHA-1 to find the supported usages table in the PDF). The maven repository supplying the Tika JAR only provides md5 and sha1 checksums, or an .asc file for verifying the signature of the file (which is not the same as verifying the checksum), so I have implemented and update to use sha1 instead of md5.

griffin-rickle avatar Jul 21 '23 15:07 griffin-rickle

Thank you for this @griffin-rickle let me take a look. Is there a way to do this in a back compat way (see my comments on #348 )

chrismattmann avatar Jul 25 '23 16:07 chrismattmann

Thank you for this @griffin-rickle let me take a look. Is there a way to do this in a back compat way (see my comments on #348 )

Not sure if you saw my comment on the original Issue thread, but this has been done. Let me know if there's anything else I could do!

griffin-rickle avatar Aug 09 '23 15:08 griffin-rickle