wikiparsec icon indicating copy to clipboard operation
wikiparsec copied to clipboard

Feature request: Add gender to the information extracted from the German wiktionary dump

Open wrznr opened this issue 5 years ago • 1 comments

Each article title for nouns has information on the gender of the corresponding noun. It would be very helpful to have them extracted as well.

wrznr avatar Apr 09 '19 09:04 wrznr

In addition to the title, gender information is also stored in the inflection table. The corresponding section is explicitly skipped in the parsing code (https://github.com/LuminosoInsight/wikiparsec/blob/a7bfe4c668da1ff610ca5852f5efa08f845438ce/Text/MediaWiki/Wiktionary/German.lhs#L281).

wrznr avatar Apr 09 '19 09:04 wrznr