wiktextract
wiktextract copied to clipboard
Please use .jsonl suffix
The format you are using (which is a perfectly good choice) is commonly known as jsonlines (https://jsonlines.org/)
Since it is technically not a json file, it would be most appropriate to use the .jsonl suffix instead, so that people and programs can deal with it properly.
Using .jsonl on the kaikki.org downloads would be helpful. I didn't see anywhere in the codebase to change this, so it must be somewhere in Tatu's CI's.
See upcoming change issue #603. Rejoice, .jsonl enthusiasts!