Wikidata-Toolkit icon indicating copy to clipboard operation
Wikidata-Toolkit copied to clipboard

Fetching .xml-dump fails

Open AlexTichy opened this issue 6 years ago • 0 comments

I'm using the Wikidata Toolkit Examples as they are in online mode, just changed DumpProcessingMode to CURRENT_REVS (any DumpProcessingModes other than JSON produce the same results). Although several online dumps are found, the program fails to download any. This is the console output I get:


*** Wikidata Toolkit: GreatestNumberProcessor


*** This program will download and process dumps from Wikidata. *** It will scan the dump to find the item with the greatest value *** for property P1113. *** See source code for further details.


2019-07-05 17:03:13 INFO - Using download directory C:\my...\dumpfiles\wikidatawiki 2019-07-05 17:03:13 INFO - Found 0 local dumps of type FULL: [] 2019-07-05 17:03:14 INFO - Found 7 online dumps of type FULL: [wikidatawiki-full-20190701, wikidatawiki-full-20190620, wikidatawiki-full-20190601, wikidatawiki-full-20190520, wikidatawiki-full-20190501, wikidatawiki-full-20190420, wikidatawiki-full-20190401] 2019-07-05 17:03:16 WARN - Could not find any dump of type FULL. 2019-07-05 17:03:16 INFO - Finished processing. 2019-07-05 17:03:16 INFO - Processed 0 entities in 0 sec Found 0 matching items after scanning 0 items. No number with a specified value found yet.

It seems that the error is in WmfOnlineStandardDumpFile's method FetchIsDone(). It reads through the md5 checksum files of the dumps and attempts to find a line ending on "-pages-meta-history.xml.bz2". However, this specific line ending doesn't exist in the files.

AlexTichy avatar Jul 05 '19 15:07 AlexTichy