rgaudin comments

Results 846 comments of


                                            rgaudin

Use newer multithreaded youtube downloader from scraperlib

We definitely should but we shall keep an alternative method for S3 download/upload (IO bound) and ffmpeg (cpu bound)

`sw.js` has not been extracted from the ZIM by the time `load.js` tries to register it.

Please improve this ticket's description. The title makes little sense: _extracting_ file from the ZIM is not the responsibility of this scraper…

`sw.js` has not been extracted from the ZIM by the time `load.js` tries to register it.

Since it's extracted from the current URL, it would be good to have its value(s) https://github.com/openzim/warc2zim/blob/main/src/warc2zim/templates/load.js#L24

Revamp the UI to stick to original

> Is that a substainable solution? what do you mean? Do you mean you want to actually fetch the YT source code and remove/hide parts of it? That would be...

Revamp the UI to stick to original

> @rgaudin would know best for the darkmode part. Nothing to know here

Illustration does not respect the openZIM specification

It's a JPEG image…

Illustration does not respect the openZIM specification

Yes, the run would have failed ; and there are conversion functions to use

Cross-browser Integration test related to warc2zim and wombat

@wsdookadr thank you for this. I agree that given how much we are dependent on other projects in warc2zim (and even more with zimit), it would be a very useful...

New ZIM request: Marxist Internet Archive

Size do matter. Scraping over a TB off a third party website is resource intensive for us and for them. zimit is an _uncontrolled environment_ and we don't have tools...

New ZIM request: Marxist Internet Archive

Not sure I fully understand the question but creating, storing and uploading a TB large ZIM file is possible, yes. I think you're referring to manioc.org ZIM.