gitreich

Results 7 comments of gitreich

Hi, thanks for your anwer; the exact executed Command is stored in the metadata folder with the name container_info.log, in the tranfered crawl it was: docker run -d --name ONB_Btrix_parlament_politik_daily_20240306082001...

I made now 78 Crawls where prevously the validation error occured and it never occured The only difference from the general setup was that the cronjob was now triggered with...

I started: docker run -d --name ONB_Btrix_invalid_urls_20240507090214 -e NODE_OPTIONS="--max-old-space-size=32768" -p 9397:9397 -p 12157:12157 -v /home/antares/Schreibtisch/Docker/browsertrix/crawls/:/crawls/ webrecorder/browsertrix-crawler:1.1.1 crawl --screencastPort 9397 --seedFile /crawls/config/invalid_urls_seeds.txt --scopeType prefix --depth 3 --extraHops 0 --workers 1 --healthCheckPort...

Without --restartOnFail I receive ExitCode 17 docker run -d --name ONB_Btrix_invalid_urls_20240507090634 -e NODE_OPTIONS="--max-old-space-size=32768" -p 9961:9961 -p 13181:13181 -v /home/antares/Schreibtisch/Docker/browsertrix/crawls/:/crawls/ webrecorder/browsertrix-crawler:1.1.1 crawl --screencastPort 9961 --seedFile /crawls/config/invalid_urls_seeds.txt --scopeType prefix --depth 3 --extraHops...

With 1.1.2 I receive ExitCode 1 for invalid seeds with parameter --failOnFailedSeed Great! Only thing I am not sure is --failOnInvalidStatus alone ( not combined with failOnFailedSeed) Case URL does...

No, I also don't think any code changes are necessary In my case i have a random port usuage and check if the port is available on the localhost, but...

Hi; At the WAC24 @ikreymer brought up the idea to make a parameter for adding the location of the CDXIndex (for DeDup via writing revisit entries) If this feature would...