Hicham EL BOUKKOURI

Results 1 issues of Hicham EL BOUKKOURI

Hi, I am getting UnicodeDecodeErrors when I try to extract a decompressed .xml dump. For the record, this is how I am using the WikiExtractor: `WikiExtractor.py wikicorpus_en.xml -b 100M --processes...