Christian Stefanescu

Results 39 issues of Christian Stefanescu

**Describe the bug** It is not immediately obvious that aleph and ingest-file workers are using "number of cpus" threads, which can cause scaling issues as seen in numerous reports, the...

bug
docs
Major

**Describe the bug** Previewing a password protected .xlsx file is not possible and the message `Format not supported` could be more specific **To Reproduce** Steps to reproduce the behavior: 1....

bug
good first issue
Low

**Describe the bug** PDF files without an extension get correctly ingested and their text can be seen, but no preview is rendered. **To Reproduce** Steps to reproduce the behavior: 1....

bug
Low

**Is your feature request related to a problem? Please describe.** The latest version of authlib (1.21.1) tested in `3.15.1-rc1` doesn't quite work with OAuth configured because of breaking changes in...

feature-request
Moderate

**Describe the bug** On some datasets it's impossible to view crossreferencing results. **To Reproduce** Steps to reproduce the behavior: 1. Go to https://aleph.occrp.org/datasets/7098#mode=xref 2. See error: ```We found 6864 results,...

bug
Major
data-desk

This adds HEIC/HEIF support for alephdata/aleph#3918 using a pillow plugin. Todos: - [ ] find a better test image and check OCR as well - [ ] figure out previewing...

Our current retry logic for converting documents (shelling out to LibreOffice) is based on two constants: the number of retry attempts and the timeout https://github.com/alephdata/ingest-file/blob/fca65fbb08ff37d65df3c14804ad5b1b6809b97d/ingestors/support/convert.py#L16-L17 What would be more desirable...

improvement

See https://github.com/alephdata/ingest-file/pull/511