perma
perma copied to clipboard
Mark early failed captures as failed
Some captures from the first few months of Perma failed, and now just show a random javascript library instead of HTML. The actual HTML was never recorded. These should be detected and marked as failed so the screenshot will show instead.
Examples:
https://perma.cc/0b8rEbzAeLt https://perma.cc/0AJoRguAHH9 https://perma.cc/0FcUyeG1ueG https://perma.cc/0XR6SHkae8Z https://perma.cc/0VDEFpeV4hR
http://perma.cc/0JyHP2USEcP
http://perma.cc/0pAbVwjDQgJ
https://perma.cc/0qmeom2SYDX
https://perma.cc/0upUSSNF7rH
These warcs appear to contain a lot of "resource" type records, rather than the usual "request"/"response " type records used when recording from the web. That might prove a good way to detect them... though it would involve combing through Perma's warcs.