web-archives icon indicating copy to clipboard operation
web-archives copied to clipboard

Automatically strip query string for pages that aren't found

Open huyz opened this issue 4 years ago • 3 comments

Is your feature request related to a problem? Please describe. Sometimes a URL isn't found in archive.is because of a superfuous query string

Describe the solution you'd like If there is no match inside of archive.is, then automatically strip the query string and try again. Or maybe just strip the query string before searching archive.is

Describe alternatives you've considered I edit the URL manually and try again.

Additional context Here's an example URL: https://www.economist.com/united-states/2020/06/11/meet-our-us-2020-election-forecasting-model?cid1=cust/ednew/n/bl/n/2020/06/11n/owned/n/n/nwl/n/n/NA/494260/n

huyz avatar Jun 11 '20 23:06 huyz

Thanks for the feature request, this issue also applies to the Wayback Machine.

On archive.is we could redirect to https://archive.is/https://www.economist.com/united-states/2020/06/11/meet-our-us-2020-election-forecasting-model* when no results are found. Note the asterisk at the end, that will also catch snapshots with different query strings.

dessant avatar Aug 29 '20 09:08 dessant

On Wayback Machine we can also redirect to a similar URL that catches pages archived with different query strings: https://web.archive.org/web/*/https://www.economist.com/united-states/2020/06/11/meet-our-us-2020-election-forecasting-model*

dessant avatar Aug 29 '20 09:08 dessant

I just would like to bump up/second this request, would be really great to add this to the context menu function.

newsjunkie247 avatar Jul 04 '22 00:07 newsjunkie247

Bumping up and seconding this request; getting negative answers from the respective archives thanks to the appended tracking string.

mollyrealized avatar Dec 21 '22 12:12 mollyrealized

Let's track this issue in https://github.com/dessant/web-archives/issues/76.

dessant avatar Feb 06 '24 09:02 dessant