Mat Kelly
Mat Kelly
Via @ibnesayeed in https://github.com/oduwsdl/ipwb/pull/391#issuecomment-399261574 To deal with various datetime formats, I would define the following functions: ```python def parseRFC1123(RFC1123_datetime_string): return native_datetime_object def parse14Digit(14Digit_datetime_string): return native_datetime_object def to14Digit(native_datetime_object): return "14Digit_datetime_string" def...
Related to #71.
In a previous ticket (which #?), @akavel (or maybe @ibnesayeed) mentioned having a contributors list. While code contributions would establish this on the GitHub interface, we are thankful for bug...
The [WARC/1.1](http://bibnum.bnf.fr/warc/WARC_ISO_28500_version1-1_latestdraft.pdf) spec (Section B.8) gives an example where a response record is segmented into multiple other smaller records. This changes the hash digests of the records both in the...
Per #369 and #375, this value is critical to replay the payload, especially when the HTTP header is not in IPFS.
Related to lack-of-header mitigation in #369 and the discussion in #375 via @ibnesayeed
Consider the scenario where a user indexes a WARC to ipwb and gets a CDXJ record with `locator` hashes H/P where H is the IPFS hash of the HTTP header...
It should only downcase the hostname and scheme. This has not yet been tested but might be a function of the parameters we are sending to the SURT module. A...
At one point, https://github.com/ipfs-search/ipfs-search allowed searching of the content of dereferenced payloads described by a set of IPFS hashes. Because IPFS hashes are describing non-changing content, being able to search...
Relevant to ServiceWorker usage on replay and general potential use cases of serving replay on HTTPS. Related #328