gitreich
gitreich
Hi, thanks for your anwer; the exact executed Command is stored in the metadata folder with the name container_info.log, in the tranfered crawl it was: docker run -d --name ONB_Btrix_parlament_politik_daily_20240306082001...
I made now 78 Crawls where prevously the validation error occured and it never occured The only difference from the general setup was that the cronjob was now triggered with...
I started: docker run -d --name ONB_Btrix_invalid_urls_20240507090214 -e NODE_OPTIONS="--max-old-space-size=32768" -p 9397:9397 -p 12157:12157 -v /home/antares/Schreibtisch/Docker/browsertrix/crawls/:/crawls/ webrecorder/browsertrix-crawler:1.1.1 crawl --screencastPort 9397 --seedFile /crawls/config/invalid_urls_seeds.txt --scopeType prefix --depth 3 --extraHops 0 --workers 1 --healthCheckPort...
Without --restartOnFail I receive ExitCode 17 docker run -d --name ONB_Btrix_invalid_urls_20240507090634 -e NODE_OPTIONS="--max-old-space-size=32768" -p 9961:9961 -p 13181:13181 -v /home/antares/Schreibtisch/Docker/browsertrix/crawls/:/crawls/ webrecorder/browsertrix-crawler:1.1.1 crawl --screencastPort 9961 --seedFile /crawls/config/invalid_urls_seeds.txt --scopeType prefix --depth 3 --extraHops...
With 1.1.2 I receive ExitCode 1 for invalid seeds with parameter --failOnFailedSeed Great! Only thing I am not sure is --failOnInvalidStatus alone ( not combined with failOnFailedSeed) Case URL does...
No, I also don't think any code changes are necessary In my case i have a random port usuage and check if the port is available on the localhost, but...
Hi; At the WAC24 @ikreymer brought up the idea to make a parameter for adding the location of the CDXIndex (for DeDup via writing revisit entries) If this feature would...