wikt2dict
wikt2dict copied to clipboard
Wiktionary parser tool for many language editions.
Thanks for sharing this! I installed it successfully and ran: w2d.py download en ta It downloaded the English bz2 file and also created the enwiktionary.txt file. However, Tamil wiktionary was...
In the translation_pairs, I was getting many other languages, in addition to the ta, en that I wanted. So, I deleted all the languages other than ta from the English...
Chinese config should be changed to Section-level parser. Example: https://zh.wiktionary.org/wiki/%D1%81%D0%BC%D0%BE%D1%82%D1%80%D0%B5%D1%82%D1%8C
The langnames type parser (the one that needs the languages' names in a given language) ignores the list of wikicodes specified and instead extracts all languages it has a name...
Only at most three source files are read for each triangle. These are the ones extracted from the three Wiktionaries belonging to the 3 languages. There could be other translations...