pygetpapers icon indicating copy to clipboard operation
pygetpapers copied to clipboard

EPMC: Building corpus using the existing metadata JSON

Open ShweataNHegde opened this issue 3 years ago • 0 comments

Given I have a eupmc_results.json, can't I build the corpus from scratch? I tried doing it using:

pygetpapers -o tomato --restart -x

I had the JSON file within the tomato folder. But all I get is empty PMC folders. Here's a portion of the tree:

C:.
│   eupmc_results.json
│
├───PMC3193516
├───PMC3466413
├───PMC3790869
├───PMC4032488
├───PMC4364678
├───PMC4375501
├───PMC4445982
├───PMC4464248
...

I think this is an useful functionality to add, if it doesn't exist.

ShweataNHegde avatar Sep 09 '21 10:09 ShweataNHegde