rss-bridge icon indicating copy to clipboard operation
rss-bridge copied to clipboard

Bridge request for webtoons.com

Open svew opened this issue 1 year ago • 2 comments

Bridge request

Create new bridge for webtoons.com

General information

This is the same request as #915. That request was closed because technically a feed for webtoons.com already exists for each of their comics, however, the feed acts as little more than a notification that a new release has occurred because it barely shows any content of the new release, and to read the release you still have to navigate to their website. A more complete RSS feed would include all images of the comic for the reader to read.

Here's an example of the content given by webtoons.com's RSS feed for the webcomic Wind Breaker as shown on Feedly:

image

And here's an example of the issue. This only shows a single cut off panel of the comic before being given a link to the full content on their website:

image

  • Host URI for the bridge_ (i.e. https://github.com):

https://www.webtoons.com

  • Which information would you like to see?

The latest 3-however many releases for a specific web comic. Example URL for a webcomic release: https://www.webtoons.com/en/sports/wind-breaker/part-4-ep12/viewer?title_no=372&episode_no=390

Each release representing a specific article. For each release, data should contain all the images for the given release of the web comic. Each release is comprised of a series of images (anywhere from 10 to ~100?).

  • How should the information be displayed/formatted?

Each release is only made up of a list of images, tightly grouped next to each other to make the comic appear seamless/pageless. This is important because any margins will break up panels of the comic.

image

  • Which of the following parameters do you expect?

    • [X] Title
    • [X] URI (link to the original article)
    • [X] Author
    • [ ] Timestamp
    • [X] Content (the content of the article)
    • [X] Enclosures (pictures, videos, etc...)
    • [ ] Categories (categories, tags, etc...)

Options

  • [X] Limit number of returned items
    • Default limit: 5
  • [ ] Load full articles
    • Cache articles (articles are stored in a local cache on first request): yes
    • Cache timeout (max = 24 hours): 24 hours
  • [X] Balance requests (RSS-Bridge uses cached versions to reduce bandwith usage)
    • Timeout (default = 5 minutes, max = 24 hours): 5 minutes

svew avatar Aug 06 '22 21:08 svew

They have strict referrer checking: https://techdocs.akamai.com/property-mgr/docs/referrer-checking

I don't think this is feasible. Would have to proxy out for each image and spoof referrer.

$ curl -i -H 'referer: AAAAAA_pstatic.net' "https://webtoon-phinf.pstatic.net/20220722_20/16584544051656aMeq_JPEG/16584544051573723909.jpg?type=q90"
HTTP/2 403 
server: AkamaiGHost
mime-version: 1.0
content-type: text/html
content-length: 364
expires: Sat, 06 Aug 2022 23:53:22 GMT
date: Sat, 06 Aug 2022 23:53:22 GMT
access-control-expose-headers: Content-Length, Server, Cache-Control, Date, Accept-Ranges
access-control-allow-origin: *

<HTML><HEAD>
<TITLE>Referral Denied</TITLE>
</HEAD><BODY>
<H1>Referral Denied</H1>
You don't have permission to access *SNIP*

See also https://github.com/RSS-Bridge/rss-bridge/issues/2365

dvikan avatar Aug 06 '22 23:08 dvikan

Not so familiar with referral header restrictions, but I just tried a similar thing and had successful results (I'm using powershell):

> $Headers = @{
    referer = 'pstatic.net'
}
> Invoke-WebRequest -H $Headers -Uri "https://webtoon-phinf.pstatic.net/20220722_28/1658454392188oUIUC_JPEG/16584543921813723907.jpg?type=q90" -OutFile "C:\Users\...\Desktop\photo.jpg"

w/o -OutFile:

StatusCode        : 200
StatusDescription : OK
Content           : {255, 216, 255, 224...}
RawContent        : HTTP/1.1 200 OK
                    Connection: keep-alive
                    Access-Control-Expose-Headers: Content-Length, Server, Cache-Control, Date, Accept-Ranges
                    Access-Control-Allow-Origin: *
                    Accept-Ranges: bytes
                    Content-Length...
Headers           : {[Connection, keep-alive], [Access-Control-Expose-Headers, Content-Length, Server, Cache-Control, Date, Accept-Ranges],
                    [Access-Control-Allow-Origin, *], [Accept-Ranges, bytes]...}
RawContentLength  : 117026

This outputted a valid and correct .jpg file for me

svew avatar Aug 08 '22 02:08 svew

I would give another "no" but for another reason:

https://www.webtoons.com/en/action/the-god-of-high-school/ep-553/viewer?title_no=66&episode_no=556

That single issue has like 80 pictures in it and that was just a random selection, there could be comics with way more. Thats really not in focus for an rss feed. So I vote for a close :)

Bockiii avatar Aug 12 '22 13:08 Bockiii

This might not be what the rss feed was intended for when it was developed, but I also don't see there being any technical limitation for it, or a reason to veto because it because it's an irregular use. This whole project is an irregular use of RSS :)

At the same time, I just recently discovered that Tachiyomi exists, so this problem is already solved elsewhere with better support anyway, so therefore I'm going to close this. Thanks for the feedback/work all

svew avatar Aug 12 '22 21:08 svew