JustAnotherArchivist
JustAnotherArchivist
Instead of a crash with a `ValueError`, such tweets will now produce warnings like `Consistency errors in unified card on tweet 1602600450255163393: missing media`.
The Reddit scrapers return submissions and comments unless instructed otherwise. The absence of those fields means that there are no submissions in the time frame (since they aren't present on...
Well, yes, there is no `TelegramScraper`, only a `TelegramChannelScraper`.
@IvanTrendafilov Yes, it is safe to run multiple instances of the CLI at the same time. Or indeed to use the snscrape package/modules from multiple independent Python processes in parallel...
I've previously hinted at this possibility in , but that issue's really about something else, and comment extraction is a whole different beast than just more post metadata, so thanks...
@pukkandan That's only the size of the set object itself and does not include the size of the strings within. Each of those is 75 bytes per my testing (which...
I'm one of the operators of the channel that triggered the creation of this issue, and I'd like to give a bit more context why this is needed. We have...
> In this case though, looking at the code, I see that dconv_d2s() is doing the all the difficult stuff and we know the length of the string-ified decimal before...
I would argue that the dumping behaviour is correct. Numbers in JSON are not floats but arbitrarily long sequences of digits. For parsing, we should support `parse_float = decimal.Decimal` as...
I disagree. `decimal.Decimal` can be directly mapped to a JSON number. It's just a `-?\d+(\.\d+)?(E[+-]\d+)?` number in almost all cases. There are two special cases, `NaN` and `Infinity` (positive or...