cainteoir-engine issues

Support Hebrew vowel annotations

Hebrew text does not include vowels. There is a system of diacritics (niqqud [1]) that is used to annotate the missing vowels. The vowel annotation logic should occur between the...

rhdunn

roadmap

tts

Implement the eSpeak voice synthesizer

1

The eSpeak text-to-speech program uses a combination of klatt (see issue #35) parameters, recorded wave audio and spectral parameters. This is all coordinated by generating a sequence of wave commands...

rhdunn

roadmap

tts

implement a proper style model for xml-based documents

At the moment, the rendering model for `cainteoir-engine` with XML-based documents is to: 1. call a document-specific parser; 2. map element names to `xml::context::entry` objects; 3. specify the type (span,...

rhdunn

roadmap

Support reading currencies

Currencies in Unicode have the `Sc` character class. This should be split from the generic `punctuation` event type and put into a `currency` event type. Each currency symbol has a...

rhdunn

roadmap

tts

Implement the NRL Report 7948 letter-to-phoneme algorithm.

1

The NRL Report 7948 describes an algorithm for converting English letter sequences to phonemes. This is implemented in various places (such as rsynth's english.c file). Rsynth also has a ruleset...

rhdunn

roadmap

tts

create a text matcher class based on regular expressions

2

The tests/dictionary.py script has very simple regular expression expansion logic, placing limits on where you can place `[ab]` and `(a|b)` expressions. This is limiting what can be expressed. For example...

rhdunn

roadmap

tts

Create an open_memstream compatiblity layer

On systems that do not support `open_memstream` a temporary file is created. While this works, it is slower than using the in-memory version. BSD-based systems (including Android and Mac OS)...

rhdunn

roadmap

expose a dictionary API to the tts engines

1

The engines API should have dictionary support that allows: - adding/updating a word to the dictionary; - reloading the dictionary set; - looking up the pronunciation of a given word....

rhdunn

roadmap

tts

Make number parsing locale aware

At the moment `tts/context_analysis.cpp` only handles numbers of the form `nnnnn`. That is, it does not handle numbers of the form: ``` `n,nnn,nnn` -- e.g. US numbers `n nnn nnn`...

rhdunn

roadmap

tts

HTML processing should use a HTML to XML parser

Due to HTML quirks, the processing for HTML and XHTML content (including HTML without xmlns, but with an XML processing instruction) should: 1. Use the xmlreader class to read the...

rhdunn

roadmap

reader-api

cainteoir-engine
cainteoir-engine copied to clipboard

Metadata

Support Hebrew vowel annotations

Implement the eSpeak voice synthesizer

implement a proper style model for xml-based documents

Support reading currencies

Implement the NRL Report 7948 letter-to-phoneme algorithm.

create a text matcher class based on regular expressions

Create an open_memstream compatiblity layer

expose a dictionary API to the tts engines

Make number parsing locale aware

HTML processing should use a HTML to XML parser

← Metadata

Owner

Metadata

cainteoir-engine cainteoir-engine copied to clipboard

Metadata

← Metadata

Owner

Metadata

cainteoir-engine
cainteoir-engine copied to clipboard