crossref icon indicating copy to clipboard operation
crossref copied to clipboard

2019-09 IA Bulk Dump Update

Open bnewbold opened this issue 4 years ago • 2 comments

I ran another bulk dump using the scripts in this repository. The scrape started on 2019-09-09 and ended around 2019-10-05, yielding 107,151,607 DOIs. The xz compressed file is 46 GBytes.

Available at: https://archive.org/details/crossref_doi_dump_201909

SHA-256:

338e01b613f34624a3408f781fecf746e7fc1ce6e4636186e57775d18d2a6ebc  crossref-works.2019-09-09.json.xz

Updating the README with a link might make sense, as this repository is the top hit for "Crossref Metadata Bulk Dump" (at least with my search filters).

I would encourage any future folks using these dumps to switch over to Daniel Ecer's dumps posted to figshare more frequently: https://figshare.com/articles/Crossref_Works_Dump_-_August_2019/9751865

bnewbold avatar Oct 06 '19 18:10 bnewbold

@bnewbold thanks for the update. Happy to hear about your 2019-09 dump as well as Ecer's dumps that hopefully will regularly get uploaded to figshare.

Updating the README with a link might make sense, as this repository is the top hit for "Crossref Metadata Bulk Dump" (at least with my search filters).

Good idea. PR would be appreciated to do this, but if not, will do when I next get around to it. We could make the reference to the latest archived dumps more prominent as well in the README.

dhimmel avatar Oct 06 '19 18:10 dhimmel

I ran another bulk dump using the scripts in this repository. The scrape started on 2019-09-09 and ended around 2019-10-05, yielding 107,151,607 DOIs. The xz compressed file is 46 GBytes.

Available at: https://archive.org/details/crossref_doi_dump_201909

SHA-256:

338e01b613f34624a3408f781fecf746e7fc1ce6e4636186e57775d18d2a6ebc  crossref-works.2019-09-09.json.xz

Updating the README with a link might make sense, as this repository is the top hit for "Crossref Metadata Bulk Dump" (at least with my search filters).

I would encourage any future folks using these dumps to switch over to Daniel Ecer's dumps posted to figshare more frequently: https://figshare.com/articles/Crossref_Works_Dump_-_August_2019/9751865

Appreciate for great work if possible month to month updates easier once full database to download.

omeletteinc avatar Dec 15 '20 23:12 omeletteinc