html5-audio-read-along
html5-audio-read-along copied to clipboard
See if WebVTT would be a good fit
Instead of marking up the text with each word, there could be a track file that has words with the time indexes.
http://www.whatwg.org/specs/web-apps/current-work/multipage/the-video-element.html#the-track-element