Aazhar

Results 23 issues of Aazhar

Hello we end up with two authors associated to the same ORCID when processing this document : https://mdpi-res.com/d_attachment/foods/foods-11-00274/article_deploy/foods-11-00274-v2.pdf in this case, the DOI was extracted so I think we should...

bug
implemented

Hello When using consolidation and the doi is not present in the document, we proceed with fuzzy matching, the problem is that while this should be very helpful in most...

consolidation

Using some editors formats, extracting ORCID using PDF annotations is not reliable in this case if consolidated , then prefer the crossref metadata for the authors ORCIDS.

kermitt2/pdf2xml/issues/5

enhancement

https://github.com/kermitt2/grobid-quantities/tree/master/resources/dataset/original/pdf/problematic_files

This is the text from the PDF ![Image Pasted at 2019-3-26 11-31](https://user-images.githubusercontent.com/9571357/54984256-c2a4e100-4fae-11e9-9c2e-0873a059ee63.png) and this is the result from the text : > the 2010Hawaii Ironman Triathlon consisting of 3.8 km...

bug

![Image Pasted at 2019-3-26 11-34](https://user-images.githubusercontent.com/9571357/54984395-131c3e80-4faf-11e9-889a-8cd6a6f34cb7.png) And the text `appears to be maintained until approximately 35 40 years -of age, followed by modest decreases until 50 years of age`,, you can...

bug

Some requests coming coming about having the possibility to output characters along with their respective attributs (width, height, fonts..)

enhancement

Content mine regroups a list of some known problematic fonts and maps character to correct unicode (e.g : l -> λ)

enhancement

This is a suggestion from the user @dlaurie linebreaks except when they would be significant (pdftohtml -xml did that), elision of unnecessary attributes, i.e. rotation=0, angle=0.

suggestion