zips icon indicating copy to clipboard operation
zips copied to clipboard

[protocol spec] Generated PDFs have escape characters for ligatures in text content

Open stanislavkozlovski opened this issue 5 years ago • 4 comments

Greetings!

I was reading through your protocol specification PDF page and wanted to search for Difficulty. I noticed that my browser wouldn't catch all the places where difficulty was mentioned and when inspecting the word, I saw that it had some strange characters:

e.g difculty is what I get when I copy it from the PDF (notice the unrecognized character and "fi" missing)

It seems to be the U+001B : <control> ESCAPE [ESC] unicode character. When I search for dif<U+001B>culty, I find 23 occurrences in the specification doc

stanislavkozlovski avatar Dec 04 '20 05:12 stanislavkozlovski

cc @daira

stanislavkozlovski avatar Dec 04 '20 05:12 stanislavkozlovski

This is viewer dependent. It doesn't happen on the PDF viewer I use (Atril 1.20.3 on Debian Buster). Nevertheless, this appears to be a well-known problem and there are some potential fixes here. What viewer are you using and on what platform, so that I can check whether the fix works?

daira avatar Jan 08 '21 01:01 daira

I am using Google Chrome (Version 87.0.4280.66 (Official Build) (64-bit))'s built-in PDF reader on Ubuntu 20.04

stanislavkozlovski avatar Jan 15 '21 07:01 stanislavkozlovski

This works for me on Debian in Atril, Okular, and Firefox's PDF plugin. Please re-test it with the latest version of the spec. (In any case there wouldn't be much else that I can do about it, if it still doesn't work on the reader you're using @stanislavkozlovski .)

daira avatar Dec 20 '23 02:12 daira