jmviz
jmviz
Great, thanks for your work. I did notice that the "cite" variations were redirects. You can see some [here](https://en.wiktionary.org/w/index.php?title=Special%3AWhatLinksHere%2FTemplate%3Acite-book&hidelinks=1&hidetrans=1) and [here](https://en.wiktionary.org/w/index.php?title=Special%3AWhatLinksHere%2FTemplate%3Acite-journal&hidelinks=1&hidetrans=1). Probably the simplest way to cover the redirects would...
@alexchandel In case you're interested in running wiktextract locally to get this information, I have a [fork](https://github.com/jmviz/wiktextract/tree/descendants) that outputs basic data for `Descendants` and PIE `Derived terms`/`Extensions` sections. It outputs...
I looked into this. The page [version](https://en.wiktionary.org/w/index.php?title=Reconstruction:Proto-Indo-European/(s)ker-&oldid=68495619) you linked has an idiosyncratic structure, where the etymology is placed below the level of the part of speech. Usually, by [wiktionary's entry...
The easiest way would be to download the compressed raw data ([here](https://kaikki.org/dictionary/raw-wiktextract-data.json.gz)) and process that directly. You should only need about ~1.5GB free space on your machine. Then you can...
Sure, I sent it. Like I said, there will be various stuff in there (affixes, phrases, abbreviations, inflections, etc.) that you may or may not want.
> For some words there are more than 1 etymology and some of the etymology sections are empty in this case one cannot group correctly under the etymology, one won't...
Found some more information on this. I noticed a lot of DEBUG messages of the form: ```DEBUG: heuristically added missing } to template arg ! at ['term', 'template']``` Where `template`...
Looked into this more. Previously I was looking at the debug log for a local partial extraction. For completion's sake, I went through the latest `wiktextract-error-data.json` from the kaikki.org raw...
I also checked to see when else that `}` imputation gets called: ``` 17567 "msg": "heuristically added missing } to template arg !", 11536 "msg": "heuristically added missing } to...
The romanization appears to be intact in `forms`: ```json "forms": [ { "form": "kǐao", "tags": [ "romanization" ] } ], ```