Allen Lin

Results 13 issues of Allen Lin

The MediaWiki parser that is currently used (`de.tudarmstadt.ukp.wikipedia.parser`) is no longer maintained. [See here](http://mvnrepository.com/artifact/de.tudarmstadt.ukp.wikipedia/de.tudarmstadt.ukp.wikipedia.parser/1.0.0). It's suggested to use [Sweble](http://sweble.org/projects/swc/swc-example-basic/) instead. I raised this since I noticed some bugs in the...

For some SR training dataset, the file names are wrong. For example, **atlasify195zh6.csv** should be **tsv** since it's tab separated. This should be changed both in the `wikibrain-sr/src/main/resources/datasets` and `wikibrain-sr/src/main/resources/datasets/info.tsv`...

It seems that command line argument of the pipeline loader does not support customized timestamp of the wikipedia dump. Currently, it will just import the latest dump file.

Solve #260. Simple PR to get the ball rolling. Tested on Allen's local machine. Let me know if there are other automated tests.

When configure that in GUI or the customized.conf, the username and password for PostGIS does not kick in. This is because the path of the configuration of PostGIS does not...

I am not an expert in the multi-languages encoding but this files seems to be corrupted by the encoding. https://github.com/shilad/wikibrain/blob/master/wikibrain-core/src/main/resources/language_info.tsv

In [here](https://github.com/shilad/wikibrain/blob/master/wikibrain-parser/src/main/java/org/wikibrain/parser/wiki/WikiTextParser.java#L135), it seems that using `getCanonicalTitle()` is better than `toString()` since the `getCanonicalTitle()` returns the title without language code affix.

All other stages in org.wikibrain.download.loader are not triggered when selecting Postgres as the data source in the GUILoader. (Not sure if this is solely related to GUILoader).

I am parsing the Spanish Wikipedia XML dumps using WikiTextParser and getting the following error. At the ends, there are 4000+ IndexOutofBounds errors. Another weird thing about Spanish Wikipedia parsing,...

**Environment:** GUIloader simple English h2 DB **check Wikidata** The following error occurred when downloading Wikidata. `0.34 Part#189(0.86) Part#190(0.62) Part#191(0.49) 0.35 Part#194(0.60) Part#195(0.48) Part#196(0.18) 0.35 Part#197(0.56) Part#198(0.80) Part#199(0.80) 0.36 Part#202(0.88) Part#203(0.32)...

bug