RSSHub icon indicating copy to clipboard operation
RSSHub copied to clipboard

The Initium 端传媒 router fails to fetch full text article behind paywall

Open mintyfrankie opened this issue 9 months ago • 8 comments

Routes

/theinitium/channel/:type?/:language?

Full routes

/theinitium/channel/latest/zh-hans

Related documentation

https://docs.rsshub.app/routes/new-media#duan-chuan-mei-zhuan-ti-lan-mu

What is expected?

With default ENV variable or personal variable like INITIUM_BEARER_TOKEN, the router should be able to fetch full-text article from the source.

What is actually happening?

Some articles failed to be fetched in full-text mode, showing for example :

CleanShot 2023-09-25 at 14 28 18

Deployment information

Self-hosted

Deployment information (for self-hosted)

No response

Additional info

No relative logs found regarding this issue.

This is not a duplicated issue

  • [X] I have searched existing issues to ensure this bug has not already been reported

mintyfrankie avatar Sep 25 '23 12:09 mintyfrankie

Searching for maintainers:
  • /theinitium/channel/:type?/:language?: @prnake

To maintainers: if you are not willing to be disturbed, list your username in scripts/workflow/test-issue/call-maintainer.js. In this way, your username will be wrapped in an inline code block when tagged so you will not be notified.

如果所有路由都无法匹配,issue 将会被自动关闭。如果 issue 和路由无关,请使用 NOROUTE 关键词,或者留下评论。我们会重新审核。 If all routes can not be found, the issue will be closed automatically. Please use NOROUTE for a route-irrelevant issue or leave a comment if it is a mistake.

github-actions[bot] avatar Sep 25 '23 12:09 github-actions[bot]

Maybe the initium changed some api, I’ll check this later

prnake avatar Sep 25 '23 16:09 prnake

Thanks! Happy to help if you need some more information to trace down the issue.

mintyfrankie avatar Sep 25 '23 18:09 mintyfrankie

The same issue. Please help to check.

shyrz avatar Oct 21 '23 12:10 shyrz

I think it's a known issue that The Initium disables access to the full text behind paywall via a fake (or real) search engine bot UA started from August, which means only paid accounts can read the full text. And I tested the full text accessing via configured password or token from a paid account works fine now.

prnake avatar Oct 22 '23 16:10 prnake

Hi @prnake,

Just want to give a quick update on the issue; withe either INITIUM_BEARER_TOKEN or env variables on login and password it seems to work fine now; however the INITIUM_IAP_RECEIPT doesn't work anymore.

I sniffed the iOS application and found that it uses a standard Authorization header, with its value as token xxxxx rather than Bearer xxxx. Maybe it can be interesting to update the script, as from my testings this authorization method can keep alive much longer than bearer token.

I'd be happy to contribute and raise a PR if you want to, or you can patch a quick fix right away.

mintyfrankie avatar Feb 16 '24 09:02 mintyfrankie

Also:

I think it's a known issue that The Initium disables access to the full text behind paywall via a fake (or real) search engine bot UA started from August, which means only paid accounts can read the full text. And I tested the full text accessing via configured password or token from a paid account works fine now.

Perhaps then we don't need to fake a search bot anymore, as it cannot bypass the paywall. Can be interesting to do a cleanup on the script to streamline the logic.

mintyfrankie avatar Feb 16 '24 09:02 mintyfrankie

Hi @prnake,

Just want to give a quick update on the issue; withe either INITIUM_BEARER_TOKEN or env variables on login and password it seems to work fine now; however the INITIUM_IAP_RECEIPT doesn't work anymore.

I sniffed the iOS application and found that it uses a standard Authorization header, with its value as token xxxxx rather than Bearer xxxx. Maybe it can be interesting to update the script, as from my testings this authorization method can keep alive much longer than bearer token.

I'd be happy to contribute and raise a PR if you want to, or you can patch a quick fix right away.

It’s great if you would like to PR for the IOS login method. And although the current fake UA is not working, it does not cause side effects, so I think it’s not necessary to replace it.

prnake avatar Feb 21 '24 09:02 prnake