Results 54 issues of Daniel Ecer

Not sure if you want to accept it. But at least for the two notebooks it is using the newer TF API, it is using Python 3 and relative paths....

### Which version of floccus are you using? 5.0.12 ### How many bookmarks do you have, roughly? 0 ### Are you using other means to sync bookmarks in parallel to...

bug
native-app

It's great to be able to specify a docker image which makes it quite simple and portable. With the docker approach to separate each part in a separate container, I...

Thank you for providing the package. The following import is failing: ``` from apache_beam.io import fileio ``` You are only using `fileio` for the CompressionType. The correct import would be:...

## Subtasks - TODO ## Assumptions and notes - There is now a ScienceBeam "Autocut" (apologies for the name) model that cleans up the title (e.g. removes the `Title:` prefix)...

This is somewhat related to #48 There are some documents where each character ends up as a separate token. For example: [DOI: 10.1101/019794](https://doi.org/10.1101/019794) Extracted text might then look like: ```text...

Hi, Grobid appears to be quite powerful. eLife Sciences may have some kryptonite in the form of a test PDF file with various formatting options: https://github.com/elifesciences/XML-mapping/blob/master/elife-00666.pdf When I let Grobid...

bug
error cases
models:fulltext
models:segmentation

After having re-trained the `fulltext` and `figure` GROBID models, I noticed the extracted figure description / caption may sometimes only contain the first line despite the (DL) model correctly tagging...

enhancement
figures-tables

The header model (and possibly other models?), is calculating relative font sizes. For that it is first determining the smallest, largest and average font size for the tokens within the...

bug
models:header