wikiparsec
wikiparsec copied to clipboard
Feature request: Add gender to the information extracted from the German wiktionary dump
Each article title for nouns has information on the gender of the corresponding noun. It would be very helpful to have them extracted as well.
In addition to the title, gender information is also stored in the inflection table. The corresponding section is explicitly skipped in the parsing code (https://github.com/LuminosoInsight/wikiparsec/blob/a7bfe4c668da1ff610ca5852f5efa08f845438ce/Text/MediaWiki/Wiktionary/German.lhs#L281).