aleph
aleph copied to clipboard
Search and browse documents and data; find the people and companies you look for.
There are two tests currently marked as `@skip` in the `tests` dir: - [test_olm.py](https://github.com/alephdata/ingest-file/blob/main/tests/test_olm.py) - [test_djvu.py](https://github.com/alephdata/ingest-file/blob/main/tests/test_djvu.py) Both fail. The root cause for the failure should be investigated. Ideally, all tests...
We're ingesting some files and we're getting an alert in our monitorization system regarding a high number of context switching from the ingestors processes. I know it's a hard issue...
See https://github.com/alephdata/ingest-file/pull/511
While importing an e-mail-archive in the (IMHO cursed) .PST-format, I came across a mailbox having all `application/rtf` for body type. ``` Content-Type: application/rtf Content-Transfer-Encoding: base64 Content-Disposition: attachment; filename*=utf-8''rtf-body.rtf; filename="rtf-body.rtf" ```...
Bumping this version leads to import errors. To investigate whether it's solvable by simply updating imports / changing method calls. _Originally posted by @catileptic in https://github.com/alephdata/ingest-file/issues/539#issuecomment-1884813285_
As reported in https://github.com/alephdata/aleph/issues/3897: mail files sometimes end up being recognized as either `text/html` or `text/plain`. This happens for example when ingesting .pst files: their outgoing mail messages don't have...
This is a bit difficult to reproduce and I have tried debugging and gotten nowhere. Periodically, some documents that are deep within a directory hierarchy will appear, as copies of...
You have to give Microsoft credit for its consistency: instead of storing E-Mail messages in Outlook as RFC822 plain text, they came up with their own super funky file format...
It seems like we fail to parse files which are created in Excel with write-protection, even though they are readable without a password in the app. There has to be...
Bumps [rollup](https://github.com/rollup/rollup) from 2.78.1 to 2.79.2. Changelog Sourced from rollup's changelog. rollup changelog 4.24.0 2024-10-02 Features Support preserving and transpiling JSX syntax (#5668) Pull Requests #5668: Introduce JSX support (@lukastaegert,...