browsertrix-crawler
browsertrix-crawler copied to clipboard
Video on kiwix.org homepage is not retrieved
Basically, when running the crawler with official 0.12.2 Docker image on https://kiwix.org/fr/, the Youtube video on the home page is not in the WARCs:
docker run --rm -it -v ${PWD}/output:/output webrecorder/browsertrix-crawler:0.12.2 crawl --depth 0 --url https://kiwix.org/fr/ --cwd /output/.tmph919m5n3
For more details, see https://github.com/openzim/zimit/issues/247