h
h copied to clipboard
Pages with different URLs and no rel=canonical tags are treated as identical
Steps to reproduce
- Open an article on getpocket.com
- Annotate it
- Navigate to another article and annotate it.
Expected behaviour
- annotation extension (more likely the server, see below) should detect the URL & title correctly
- as a result, annotations are split across multiple pages
- as a result, no annotations get orphaned
Actual behaviour
On step 3, the URL in the navbar & the title tag are changed by the Pocket JS code. But the problem persists even if you reload the page manually.
I couldn't intercept the AJAX requests of the extension via the Network tab, might try with Wireshark later.
Here is the interesting part: the extension seems to send the right URL to the server but it groups it under a single URL & title anyway:
I suspected rel=alternate or rel=canonical in the webpage source but didn't find any. Perhaps Pocket does some advanced DOM manipulation after loading, that I can't reverse engineer, because there was nothing suspicious in the HTTP headers either.
Browser/system information
macOS / Chrome / uBlock Origin
I reproduce the same bug for the website : http://sens-public.org/?lang=fr
No tag rel=canonical indeed in the meta of the domain.
All my private or public annotations are packed (as orphan annotations) in a single page of the domain (not sure whether it is the last one I have annotated or a random one among the same domain.
Here is the interesting part: the extension seems to send the right URL to the server but it groups it under a single URL & title anyway.
Indeed. However the button "Visit the annotation in context" is correct and allow me to open the right page and to retrieve all annotations of this page. Which gives me hope.
The error occurs :
- from Hypothes.is user page :
- the same page of this domain is listed several times instead of differents pages of the domain
- Title and URL are wrong
- "Visit annotations in context"'s link is correct
- from the domain itself, when lauching Hypothes.is with the bookmarklet :
- from the recurrent page, hypothes.is retrieves all annotations I made on the domain
- from the other pages I annotated, hypothes.is doesn't retrieve anything.
Working with Ubuntu and Firefox.
Reviving this thread because I've recently come across this issue on certain websites:
When creating annotations or page notes on one paper the annotation will also show up on a completely unrelated paper from the same domain (but completely different url).
When posting an annotation to public on the paper Suppressing high mobility group box-1 release alleviates morphine tolerance via the adenosine 5’-monophosphate-activated protein kinase/heme oxygenase-1 pathway
the annotation also shows up for Sodium selenite promotes neurological function recovery after spinal cord injury by inhibiting ferroptosis.
Page note added on first paper:
Same page note shows up on second paper after page refresh:
You can reproduce the same bug yourself on the two links provided. I've experienced similar issues on other URLs from this website.