penelope
penelope copied to clipboard
support `sametypesequence` for stardict
The sametypesequence
stanza in the ifo file defines, which syntax is used in the definition block. Currently, penelope
seems to allow only for m
, which treats whatever as plain text. The following types are known to stardict
- 'm' Word's pure text meaning. The data should be a utf-8 string ending with '\0'.
- 'g' A utf-8 string which is marked up with the Pango text markup language.
- 't' English phonetic string. The data should be a utf-8 string ending with '\0'.
- 'x' A utf-8 string which is marked up with the xdxf language.
- 'y' Chinese YinBiao or Japanese KANA. The data should be a utf-8 string ending with '\0'.
- 'k' KingSoft PowerWord's data. The data is a utf-8 string ending with '\0'.
- 'w' MediaWiki markup language.
- 'h' Html codes.
- 'n' WordNet data.
- 'r' Resource file list.
- 'X' this type identifier is reserved for experimental extensions.
cf. StarDict on Google Code for details.
Unfortunately, I no longer have time to develop or maintain Penelope: https://github.com/pettarin/penelope#important-updates
Added the text-like, UTF-8 encoded formats in v3.0.0: https://github.com/pettarin/penelope/blob/master/penelope/format_stardict.py#L32
The contents are not parsed, though.
In view of https://github.com/pettarin/penelope#important-update I leave the issue open, but I do not plan to address it.