rendertron icon indicating copy to clipboard operation
rendertron copied to clipboard

Question: What is the reason of cutting pathname in rendertron which is used in <base> as a page default URL

Open vcernomschi opened this issue 3 years ago • 1 comments

Could you please help to understand why rendertron cuts page URL which is used in tag as a page default URL ?

Reference: https://github.com/GoogleChrome/rendertron/blob/main/src/renderer.ts#L226

I ams asking as it leads to invalid behaviour when crawler tries to check empty link defined as <a href="#">Empty link URL</a>.

For example, potentially for the following URL: "https://developer.mozilla.org/en-US/docs/Web/API/URL" it will add the following base tag (which in my specific case leads to "Not Found" resource):

<base href="https://developer.mozilla.org/en-US/docs/Web/API">

Just wondering if it is only applied for use case when the pathname ends with slash "/".

vcernomschi avatar Oct 21 '21 09:10 vcernomschi

I'm having the same issue.

chimame avatar Nov 05 '21 07:11 chimame

In general, as we're deprecating the project, you should look into alternative approaches to rendering on the web.

AVGP avatar Oct 06 '22 07:10 AVGP