Patrice Lopez

Results 389 comments of Patrice Lopez

Implemented with bb4b3591e51a084063791b6454f26faddfaf575b

I observe a runtime divided by 2 as compared to the complete full text processing. However, processing header alone is 50% faster. So this is the following typical scale for...

Hello, indeed, everywhere the funding-acknowledgement parser is applies, the sentence segmentation is ignored. The reason is that it would require to take into account the (numerous) annotations produced by this...

Hello @robertsilen Sorry for the late answer. The local endpoint is http://localhost:8090/service/ So if you indicate: ```python nlp.add_pipe("entityfishing", config={"api_ef_base": "http://localhost:8090/service/"}) ``` it should work.

Hello! I integrated ELMo in my Keras framework, you can have a look here: https://github.com/kermitt2/delft In Keras, I think the simplest way to integrate "dynamic" embeddings is to use `fit_generator`...

@jacklxc We normally don't want to update the word embeddings when using pre-trained embeddings to avoid overfitting and to reduce the size of the model. What would be the particular...

Hello @AaronNGray ! GROBID does not perform OCR, it's considered a bit out of scope (though it's debatable) and we prefer to let the user exploit its OCR of choice...

Just for reference, your second and third articles already contain a text layer in the PDF. But the quality of the OCR of the second 195609-.pdf is so low that...

Ah yes you're right the reference list of Chomsky_1959.pdf are very bad (I just look at the beginning), the reference section is badly segmented from the body. Well, actually the...