jgabc icon indicating copy to clipboard operation
jgabc copied to clipboard

Incorrect English syllabification

Open adunning opened this issue 5 years ago • 5 comments

I have noticed that many English words are not being separated correctly:

  1. Visit https://bbloomf.github.io/jgabc/transcriber.html;
  2. With the language set to English, enter a word such as 'alleluia'.

This will be displayed as having two syllables instead of four at the moment. In other cases words are given too many syllables; e.g. 'ends' gets two.

adunning avatar May 08 '20 14:05 adunning

Related to this is whether final -ed is given its own syllable (de-liv-er-ed v. de-liv-ered). I think this is common enough in adapting chant to make it the default, though perhaps it should be optional.

adunning avatar May 08 '20 14:05 adunning

I think there are a lot of problems with English syllabification. Even beyond the examples you give, I think many words are being incorrectly syllabified, but unfortunately, I don't have time to fix it right now.

bbloomf avatar May 08 '20 14:05 bbloomf

Ideally we would just create a database of all the words of the psalter properly syllabified. I wrote a script to automatically parse English syllables using sed, but it too still has a few errors.

ftherese avatar Sep 09 '20 20:09 ftherese

How does the hyphenator at https://juiciobrennan.com/hyphenator/ do it? It's not absolutely perfect, but reasonably reliable.

Occasionally I have seen errors in the Latin as well – not sure if the version at http://gregorio-project.github.io/hyphen-la/ would improve it.

adunning avatar Sep 10 '20 20:09 adunning

@adunning The Latin hyphenator you linked to is what gets used when "Liturgical Latin" is selected, so if you see any errors with that, please let them know there, although it looks like there is quite a list of issues and there hasn't been much activity there lately https://github.com/gregorio-project/hyphen-la/issues

The English hyphenator at juiciobrennan.com uses a dictionary, and is what jgabc had been using. I'm not sure why it stopped working from the transcriber tool any more.

bbloomf avatar Sep 10 '20 20:09 bbloomf