waybackurls
waybackurls copied to clipboard
[issue] Urls appended together
Some urls are appended with what seems to be different domains
Test:
> echo "https://dominoweb.draco.res.ibm.com"|waybackurls |grep TCG
Notice .TCG.htmlhttp:/msdn.microsoft.com/en-us/library/ms171339.aspxhttp:/www.opensymphony
Lmao, dude it's part of path :D
Even I am facing the same issue, is there any way if we can parse the output to remove such cases of appended URLS
Even I am facing the same issue, is there any way if we can parse the output to remove such cases of appended URLS
Yes, using http/https as separator in awk.