WebToEpub icon indicating copy to clipboard operation
WebToEpub copied to clipboard

Please add site https://www.wisdomlib.org/

Open xeolod opened this issue 1 year ago • 5 comments

Please note, I'm basically the only developer working on WebToEpub, and I'm not paid for doing this. (WebToEpub is completely free, and generates no money.) By asking to add a site, you're asking me to give you some of my limited free time. So, I think it's not unreasonable for me to ask you to do as much as you can to help me.

Provide URL for web page that contains Table of Contents (list of chapters) of a typical story on the site

  • https://www.wisdomlib.org/buddhism/book/the-great-chronicle-of-buddhas

Did you try using the Default Parser for the site? If not, why not?

  • Yes, but couldn't make it work.
  • Error thrown - Fetch of URL '.ils_364343' failed with network error NetworkError when attempting to fetch resource..

What settings did you use? What didn't work?

  • URL of first chapter - .ils_364343
  • CSS selector for element holding content to put into EPUB - #scontent
  • CSS selector for element holding Title of Chapter - .h2
  • CSS selector for element(s) to remove - None

If the Default Parser did not work, if you have developer skills, did you try writing a new parser?

  • None.

If you don't have developer skills, can you ask a friend who does have them if they can do it for you?

  • None.

If you tried writing a parser, and it doesn't work. Attach the parser here.

xeolod avatar Feb 15 '24 09:02 xeolod

@xeolod

I think you got the "URL of first chapter" wrong. ".ils_364343" does not look like a URL.

I suggest the following:

URL of first chapter - https://www.wisdomlib.org/buddhism/book/the-great-chronicle-of-buddhas/d/doc364348.html CSS selector for element holding content to put into EPUB - #pageContent CSS selector for element holding Title of Chapter - None CSS selector for element(s) to remove - None

For my notes: 6 minutes work

dteviot avatar Feb 17 '24 01:02 dteviot

Sorry, but I mistakenly used css selector as URL for first chapter These are the correct values.

  • URL of first chapter - https://www.wisdomlib.org/buddhism/book/the-great-chronicle-of-buddhas/d/doc364343.html
  • CSS selector for element holding content to put into EPUB - #scontent
  • CSS selector for element holding Title of Chapter - .h2

If used #pageContent instead of #scontent, there are lot of unneeded elements and other unexpected things, like cover image in every page, so its ideal to use #scontent.


CSS selector for element(s) to remove - .footnotes

Footnotes

  • Please provide an option for this just like for webnovel to remove footnotes.
  • Css Selector - .footnotes

footnotes

xeolod avatar Feb 18 '24 05:02 xeolod

Also, the sub content is not downloaded.

https://github.com/dteviot/WebToEpub/assets/129925515/58da7c39-8b23-49ff-a2ee-a38ae88d187a

xeolod avatar Feb 18 '24 05:02 xeolod

Also, the sub content is not downloaded.

Maybe flattening the TOC while downloading would solve the issue.

xeolod avatar Feb 19 '24 03:02 xeolod

Hello just wanted to know if this is ongoing as the label is completed, but the issue is not resolved.

xeolod avatar Feb 26 '24 08:02 xeolod