Patrice Lopez comments

Results 389 comments of


Patrice Lopez

endpoint to process fundings

Implemented with bb4b3591e51a084063791b6454f26faddfaf575b

endpoint to process fundings

I observe a runtime divided by 2 as compared to the complete full text processing. However, processing header alone is 50% faster. So this is the following typical scale for...

Funding, acknowledgement statements are not split into sentences

Hello, indeed, everywhere the funding-acknowledgement parser is applies, the sentence segmentation is ignored. The reason is that it would require to take into account the (numerous) annotations produced by this...

No Swedish results? What API endpoint address with local docker image?

Hello @robertsilen Sorry for the late answer. The local endpoint is http://localhost:8090/service/ So if you indicate: ```python nlp.add_pipe("entityfishing", config={"api_ef_base": "http://localhost:8090/service/"}) ``` it should work.

Is there a way to put Elmo as a Keras layer and integrate it into a Keras model?

Hello! I integrated ELMo in my Keras framework, you can have a look here: https://github.com/kermitt2/delft In Keras, I think the simplest way to integrate "dynamic" embeddings is to use `fit_generator`...

Is there a way to put Elmo as a Keras layer and integrate it into a Keras model?

@jacklxc We normally don't want to update the word embeddings when using pre-trained embeddings to avoid overfitting and to reduce the size of the model. What would be the particular...

Is Grobid able to OCR papers ?

Hello @AaronNGray ! GROBID does not perform OCR, it's considered a bit out of scope (though it's debatable) and we prefer to let the user exploit its OCR of choice...

Is Grobid able to OCR papers ?

Just for reference, your second and third articles already contain a text layer in the PDF. But the quality of the OCR of the second 195609-.pdf is so low that...

Is Grobid able to OCR papers ?

Ah yes you're right the reference list of Chomsky_1959.pdf are very bad (I just look at the beginning), the reference section is badly segmented from the body. Well, actually the...