WebToEpub icon indicating copy to clipboard operation
WebToEpub copied to clipboard

Please add site https://po18cs.com

Open ghost opened this issue 11 months ago • 1 comments

Please note, I'm basically the only developer working on WebToEpub, and I'm not paid for doing this. (WebToEpub is completely free, and generates no money.) By asking to add a site, you're asking me to give you some of my limited free time. So, I think it's not unreasonable for me to ask you to do as much as you can to help me.

Provide URL for web page that contains Table of Contents (list of chapters) of a typical story on the site

Did you try using the Default Parser for the site? If not, why not?

Instructions for using the default parser can be found at https://github.com/dteviot/WebToEpub/wiki/FAQ#how-to-convert-a-new-site-using-the-default-parser

I have tried writing it and it seems to have worked but when packing the epub, none of the chapter URLs seem to have been detected.

What settings did you use? What didn't work?

  • URL of first chapter
  • CSS selector for element holding content to put into EPUB: div.panel-body
  • CSS selector for element holding Title of Chapter: h1.readTitle
  • CSS selector for element(s) to remove

If the Default Parser did not work, if you have developer skills, did you try writing a new parser?

Instructions https://github.com/dteviot/WebToEpub/wiki/FAQ#how-to-write-a-new-parser

If you don't have developer skills, can you ask a friend who does have them if they can do it for you?

None

If you tried writing a parser, and it doesn't work. Attach the parser here.

ghost avatar Jan 21 '25 07:01 ghost

@ghost

At this time, supporting site is not really feasible.

Table of Contents page seems to be broken for site.
Going to the following ToC pages, only half the chapters have URLs.

  • https://www.po18cs.com/book/2/
  • https://www.po18cs.com/book/6/

Other notes for future

  1. Looks like pages might use GBK for character set encoding
  2. Story content is embedded in chapter HTML. (So that at least is easy to extract.) CSS is "#htmlContent", chapter title is ".readTitle"

dteviot avatar Jan 26 '25 06:01 dteviot