PyPaperBot icon indicating copy to clipboard operation
PyPaperBot copied to clipboard

Multithreading/parallelize?

Open Niemand112233 opened this issue 3 years ago • 3 comments

Hi, Your software works great, but it is a little bit slow when searching for queries on google scholar. Is it possible to parallelize for example the search on the single pages?

Niemand112233 avatar Aug 27 '20 18:08 Niemand112233

Hi @Niemand112233, Yes for the next version (as soon as I have time or new collaborators) I'll try to introduce multithreading(even though it's not the best thing of python) at least during the downloads so that the tool will continue searching while downloading papers.

There is also a random timeout 1-10sec after each request to Crossref (When you see on the prompt "Searching paper x of y on Crossref.."). I've put it as I think that it helps to not get blocked by crossref for API abuse. On the next version, I think I'll put this timeout optional

ferru97 avatar Aug 27 '20 20:08 ferru97

I guess mutlithreading can be introduced by adding crossref query and scihub downloads alternatively

suhan-paradkar avatar May 08 '21 06:05 suhan-paradkar

Yes of course, even if the gain is not maximum

ferru97 avatar May 09 '21 19:05 ferru97