airflow-pdf2embeddings icon indicating copy to clipboard operation
airflow-pdf2embeddings copied to clipboard

NLP tool for scraping text from a corpus of PDF files, embedding the sentences in the text and finding semantically similar sentences to a given search query.

Results 3 airflow-pdf2embeddings issues
Sort by recently updated
recently updated
newest added

Bumps [numpy](https://github.com/numpy/numpy) from 1.18.2 to 1.22.0. Release notes Sourced from numpy's releases. v1.22.0 NumPy 1.22.0 Release Notes NumPy 1.22.0 is a big release featuring the work of 153 contributors spread...

dependencies

The dependencies have some conflicts, starting with Scipy 1.4.1 and Pyarrow 0.16.0 NLTK also seems to be a version old enough to have security vulnerabilities. Both of these should be...

For anyone at the Ministry of Justice trying to install this on the Analytical Platform: by default you can't. Allen NLP uses `jsonnet`, which itself uses some C binaries that...