Adam Nohejl
Adam Nohejl
When I try to set up webDAV, I get this: > Error connecting to webDAV server > Error setting up webdav, message: > com.thegrizzlylabs.sardineandroid.impl.SardineException: Error contacting http://[domain]/zotero (401 Authorization Required)...
Followed the readme, but I end up with the following error when I run check.py. (I have no experience with huggingface, but I tried to create an account and log...
Many elements/tags appear in wikiextractor's output, such as `poem`, `q`, `ins`, `del`, `br`, `section`, `onlyinclude`, `includeonly`, `math` or mathematical equations (with commands such as `\mathbf`) not enclosed in any tags....
Several elements with non-textual content such as maps and musical scores (elements `mapframe` and `score`) are not filtered out. Steps to reproduce: 1. Download this dump: `https://dumps.wikimedia.org/enwiki/20221020/enwiki-20221020-pages-articles1.xml-p1p41242.bz2` 2. Invoke the...
Due to a bug, the only way to turn off the --html-safe command line option is passing an empty argument (that evaluates as false in Python) like this: `wikiextractor --html-safe...
Many tables (or parts of them) are still in the output. Steps to reproduce: 1. Download this dump: `https://dumps.wikimedia.org/jawiki/20221020/jawiki-20221020-pages-articles1.xml-p1p114794.bz2` 2. Invoke the following command to list lines that contain the...
I tried to install the current version of CyHunspell (1.3.4). `pip install cyhunspell` first fails with "package hunspell was not found in the pkg-config search path". When I add it...
**Describe the bug** When POS tagging a specific string in Spanish a RuntimeError is **reproducibly** thrown without any apparent reason. **To Reproduce** Steps to reproduce the behavior: 1. Run the...