rendertron
rendertron copied to clipboard
Question: What is the reason of cutting pathname in rendertron which is used in <base> as a page default URL
Could you please help to understand why rendertron cuts page URL which is used in
Reference: https://github.com/GoogleChrome/rendertron/blob/main/src/renderer.ts#L226
I ams asking as it leads to invalid behaviour when crawler tries to check empty link defined as <a href="#">Empty link URL</a>
.
For example, potentially for the following URL: "https://developer.mozilla.org/en-US/docs/Web/API/URL" it will add the following base tag (which in my specific case leads to "Not Found" resource):
<base href="https://developer.mozilla.org/en-US/docs/Web/API">
Just wondering if it is only applied for use case when the pathname ends with slash "/".
I'm having the same issue.
In general, as we're deprecating the project, you should look into alternative approaches to rendering on the web.