Daniel Ecer issues

Results 54 issues of


                                            Daniel Ecer

use new tf api; use python 3; use relative paths

Not sure if you want to accept it. But at least for the two notebooks it is using the newer TF API, it is using Python 3 and relative paths....

Blank screen for Android app v5.0.12 (F-Droid)

### Which version of floccus are you using? 5.0.12 ### How many bookmarks do you have, roughly? 0 ### Are you using other means to sync bookmarks in parallel to...

bug

native-app

Docker in docker support

It's great to be able to specify a docker image which makes it quite simple and portable. With the docker approach to separate each part in a separate container, I...

ImportError with current Apache Beam version (2.x)

Thank you for providing the package. The following import is failing: ``` from apache_beam.io import fileio ``` You are only using `fileio` for the CompressionType. The correct import would be:...

Improve Title extraction by using ScienceBeam Autocut Model

## Subtasks - TODO ## Assumptions and notes - There is now a ScienceBeam "Autocut" (apologies for the name) model that cleans up the title (e.g. removes the `Title:` prefix)...

Space between most of the character for some documents

This is somewhat related to #48 There are some documents where each character ends up as a separate token. For example: [DOI: 10.1101/019794](https://doi.org/10.1101/019794) Extracted text might then look like: ```text...

Paragraph detection issue and missing text

Hi, Grobid appears to be quite powerful. eLife Sciences may have some kryptonite in the form of a test PDF file with various formatting options: https://github.com/elifesciences/XML-mapping/blob/master/elife-00666.pdf When I let Grobid...

bug

error cases

models:fulltext

models:segmentation

Upgraded Python to 3.9

Only first line of figure description extracted if distance between lines deemed too large

After having re-trained the `fulltext` and `figure` GROBID models, I noticed the extracted figure description / caption may sometimes only contain the first line despite the (DL) model correctly tagging...

enhancement

figures-tables

Header model, relative font size includes spaces with a zero font size

The header model (and possibly other models?), is calculating relative font sizes. For that it is first determining the smallest, largest and average font size for the tokens within the...

bug

models:header