Michał Martyniak
Michał Martyniak
Part two of: https://github.com/Unstructured-IO/unstructured/pull/2842 Main changes compared to part one: * hash computation includes element's position in a sequence of all elements * there are more test for deterministic behavior...
This PR attempts to fix a memory issue, which resulted in errors like this: https://github.com/Unstructured-IO/unstructured/issues/2931 The root cause seems to be in how ListItems are being combined, not in how...
There was a bug in evaluation.py that caused extensions of certain files to be detected improperly. Evaluation files are expected to have two extensions, e.g. `foobar.pdf.json` because they were partitioned...