gitreich

Results 8 comments of gitreich

Hi, thanks for your anwer; the exact executed Command is stored in the metadata folder with the name container_info.log, in the tranfered crawl it was: docker run -d --name ONB_Btrix_parlament_politik_daily_20240306082001...

I made now 78 Crawls where prevously the validation error occured and it never occured The only difference from the general setup was that the cronjob was now triggered with...

I started: docker run -d --name ONB_Btrix_invalid_urls_20240507090214 -e NODE_OPTIONS="--max-old-space-size=32768" -p 9397:9397 -p 12157:12157 -v /home/antares/Schreibtisch/Docker/browsertrix/crawls/:/crawls/ webrecorder/browsertrix-crawler:1.1.1 crawl --screencastPort 9397 --seedFile /crawls/config/invalid_urls_seeds.txt --scopeType prefix --depth 3 --extraHops 0 --workers 1 --healthCheckPort...

Without --restartOnFail I receive ExitCode 17 docker run -d --name ONB_Btrix_invalid_urls_20240507090634 -e NODE_OPTIONS="--max-old-space-size=32768" -p 9961:9961 -p 13181:13181 -v /home/antares/Schreibtisch/Docker/browsertrix/crawls/:/crawls/ webrecorder/browsertrix-crawler:1.1.1 crawl --screencastPort 9961 --seedFile /crawls/config/invalid_urls_seeds.txt --scopeType prefix --depth 3 --extraHops...

With 1.1.2 I receive ExitCode 1 for invalid seeds with parameter --failOnFailedSeed Great! Only thing I am not sure is --failOnInvalidStatus alone ( not combined with failOnFailedSeed) Case URL does...

No, I also don't think any code changes are necessary In my case i have a random port usuage and check if the port is available on the localhost, but...

Hi; At the WAC24 @ikreymer brought up the idea to make a parameter for adding the location of the CDXIndex (for DeDup via writing revisit entries) If this feature would...

thank you for your very fast response, actually there is no time pressure at all