iscc-specs
iscc-specs copied to clipboard
ISCC: International Standard Content Code
Bumps [py](https://github.com/pytest-dev/py) from 1.9.0 to 1.11.0. Changelog Sourced from py's changelog. 1.11.0 (2021-11-04) Support Python 3.11 Support NO_COLOR environment variable Update vendored apipkg: 1.5 => 2.0 1.10.0 (2020-12-12) Fix a...
Bumps [babel](https://github.com/python-babel/babel) from 2.8.0 to 2.9.1. Release notes Sourced from babel's releases. Version 2.9.1 Bugfixes The internal locale-data loading functions now validate the name of the locale file to be...
We could extract printable strings (with different encodings) from all kinds of binary data like executables or custom binary formats with https://github.com/getreu/stringsext ... and create a text similarity signature. The...
BLAKE3 turns out to be the ideal cryptographic hash for the Instance-ID. As stated by its developers BLAKE3 is: - Much faster than MD5, SHA-1, SHA-2, SHA-3, and BLAKE2 (~10x...
[alert: typical newbie issue: :-) ] [short: define the content by data and not by code] I think there are several issues with the content ID defined by code. -...
https://iscc.codes/specification/#base58-iscc https://tools.ietf.org/id/draft-msporny-base58-01.html https://w3c.github.io/did-core/#bib-base58 most people think of "base58" as "base58btc" the one used by bitcoin.... it would be excellent to clearly communicate which version of base58 is being used.
Rationale: less ambiguity an trouble when using ISCC codes as file names on case-insesitive file systems. #90
Use-Case: A user has a small chunk of text and wants to find longer text that contain this chunk or a similar chunk. Proposed solution draft: Apply shift-invariant text-chunking (for...
The algorithms for the Content-IDs make use of more detailed media-type specific fingerprints. Currently these fingerprints are discarded as soon as the Content-ID is calculated. For use cases where a...