JustAnotherArchivist

Results 394 comments of JustAnotherArchivist

> First, the API isn't quite correct, it should really be: [using `http_headers`] Hmm, yeah, but there is even a test for a single payload with the headers included in...

I'd argue that #45 does not actually fix #38 at all. Reading a WARC record and then writing it again should produce identical record body data (though it may change...

Your reading of the documentation is correct. The value of `mode` in the filelock code is passed into `os.open`'s `flags` argument. This doesn't matter because `os.open` never sees the name...

I'd love to put the descriptions in annotations as well! Your proposed syntax has one major disadvantage though: it essentially ruins the flexibility of annotations. I'm not sure `Annotated` is...

1. Yes, I think that's effectively the issue. Escaping a literal `^` with a backslash should be sufficient. (`^^` won't work as that's used for 0x1e.) 2. Ah, thank you,...

I looked a bit into what it would take to ship stubs ourselves so we don't need to remember to send a PR to typeshed upon a release with API...

I've been thinking about this, and I'm not sure. It would make sense if snscrape returned a single JSON object (like yt-dlp does). Including it in every JSONL line seems...

> can you please comment if there's already an undocumented way to achieve this? There is not, unless you implement your own JSON serialisation stuff or do ugly monkeypatches. >...

This can't be done directly with the snscrape CLI. There are various reasons for this, but mostly the `--format` syntax just can't cover all the cases of multiple images, retweets,...

Yeah, probably, although I'm not familiar enough with `jq` to tell what incantations you'd need to use exactly. And yeah, might be preferable to a script.