WebToEpub icon indicating copy to clipboard operation
WebToEpub copied to clipboard

Sometimes fails to fetch chapters that have URLs with UrlEncoded characters

Open dteviot opened this issue 4 years ago • 1 comments

Have seen this a couple of times, but was in a hurry and did not investigate further. (And now can't remember where I've seen it.)

From memory, happens when one of the chapters of the story has a URL that contains escaped characters. (I think it might need to be UTF-8 chars outside the ASCII range, but not sure). When try to fetch the chapter, site gives a 404 error. But if try to get chapter by directly putting URL into web browser's URL bar, chapter is retrieved.

If anyone sees an occurrence of this, please let me know. So I can investigate properly.

dteviot avatar Oct 30 '20 08:10 dteviot

I thought it was chapter 78 from page https://shirokuns.com/level-up-just-by-eating/. But that is not a UrlEncoding issue. It's an invalid URL. The URL in the ToC for the chapter is

https://shirokuns.com/2020/04/16/level-up-just-by-eating-chapter-78-post-fight-clean-up%e3%83%bbliesel/

The correct URL for chapter is

https://shirokuns.com/2020/04/16/level-up-just-by-eating-chapter-78-post-fight-clean-up%e3%83%bbriesel/

i.e. Correct is "riesel", incorrect is "liesel".

dteviot avatar Nov 02 '20 00:11 dteviot