Splitting
Splitting copied to clipboard
Soft hyphens (­) are output as visible characters
We're character-splitting headlines containing ­
soft hyphens. Those are usually invisible and act as a hyphenation hint for the browser.
This is how Califor­nia Dreaming
is rendered, note the extra glyph (U+F8FF):
data:image/s3,"s3://crabby-images/1fd20/1fd2005968339e4c11ec880806f3c164b791c41e" alt="Screenshot 2022-01-05 at 13 50 47"
data:image/s3,"s3://crabby-images/b6ab2/b6ab2297f1a3f8fee3fd7b0255bb44698a71a153" alt="Screenshot 2022-01-05 at 13 50 39"
Different font, same issue (here ­
is rendered as a different glyph):
data:image/s3,"s3://crabby-images/26fa9/26fa9129f15afd13979e313416f90a4448ed9db9" alt="Screenshot 2022-01-05 at 13 56 13"
Is there a way to ignore the ­
s when processing the DOM?
Edit: This is Chrome/macOS. Seems fine on Firefox.
Simply checking for and stripping the offending unicode character (00AD
) from contents
inside the splitText
function works, but then of course the soft hyphen is thrown out completely as if it was never there.
Hm, the unprocessed ­
would need to be wedged back in between the generated <span class="char">
s..