Matteo Pagliardini
Matteo Pagliardini
Same here. Would love to find a way to fix that.
Thx for sharing! I tried the same thing, it works well to get the text with numbers, but I also get all the references mixed with the text. I'm still...
This would only remove a tiny fraction of references. Also, I'm not sure what you want to do with your wiki corpus, but in the eventuality that you'd like to...
So in the end you managed to recover the missing numbers? I personally went on with the solution of @DavidNemeskey, I get the html files from the zim archive using...
Thanks for the work! I'll just need to test that quickly and I'll get back to you.
Hello, you might want to check your gcc version.
Could this fasttext issue be relevant? https://github.com/facebookresearch/fastText/issues/386#issuecomment-354189736
I don t have a mac to test this, but maybe try adding this code in `setup.py`: ```python def has_flag(compiler, flags): """Return a boolean indicating whether a flag name is...
Thanks for the PR! I'll do some tests and come back to you hopefully by next Monday.
I pulled your code and had the same issue on Ubuntu. Any luck on your side?