penelope icon indicating copy to clipboard operation
penelope copied to clipboard

support `sametypesequence` for stardict

Open aw-bib opened this issue 9 years ago • 3 comments

The sametypesequence stanza in the ifo file defines, which syntax is used in the definition block. Currently, penelope seems to allow only for m, which treats whatever as plain text. The following types are known to stardict

  • 'm' Word's pure text meaning. The data should be a utf-8 string ending with '\0'.
  • 'g' A utf-8 string which is marked up with the Pango text markup language.
  • 't' English phonetic string. The data should be a utf-8 string ending with '\0'.
  • 'x' A utf-8 string which is marked up with the xdxf language.
  • 'y' Chinese YinBiao or Japanese KANA. The data should be a utf-8 string ending with '\0'.
  • 'k' KingSoft PowerWord's data. The data is a utf-8 string ending with '\0'.
  • 'w' MediaWiki markup language.
  • 'h' Html codes.
  • 'n' WordNet data.
  • 'r' Resource file list.
  • 'X' this type identifier is reserved for experimental extensions.

cf. StarDict on Google Code for details.

aw-bib avatar Nov 09 '15 19:11 aw-bib

Unfortunately, I no longer have time to develop or maintain Penelope: https://github.com/pettarin/penelope#important-updates

pettarin avatar Nov 13 '15 12:11 pettarin

Added the text-like, UTF-8 encoded formats in v3.0.0: https://github.com/pettarin/penelope/blob/master/penelope/format_stardict.py#L32

The contents are not parsed, though.

pettarin avatar Nov 22 '15 21:11 pettarin

In view of https://github.com/pettarin/penelope#important-update I leave the issue open, but I do not plan to address it.

pettarin avatar Mar 04 '18 17:03 pettarin