js-crawler issues

Crawler is not a function

3

Hi, I am trying to setup a basic crawler script but I am getting an error: ` new Crawler().configure({depth: 3}) ^ TypeError: Crawler is not a function at Object. (/var/www/user/test2.js:3:1)...

atomixstar

stop crawling

5

Is it possible to force crawler to stop its crawling. I have condition that only 500 pages should be crawled when that condition is met ti want to stop this...

Muneem

enhancement

question

Link crawling gets stuck in Wordpress sites

If we try to crawl websites that is WP then link gets stuck after crawling few links and nothing happens after that, crawler just gets stalled. Can you suggest me...

sandysh

Crawler completes then cancels the output of "crawledUrls"?

I found when crawling a site with the depth set to 2, it will finish, and console.log(crawledUrls) correctly. But when using a higher depth like 4 or 6 (which of...

sbr2567

How to deal with basic auth?

1

Dear developers, I am crafting a tool that let me automatically crawl a few sites. However, they are protected by a username and password (that I have). Which is the...

pittersnider

Crawler stopped without reason and any error

1

It stops working in some urls for no reason, even without any non-standard configuration. domains he stops: paraleloiluminacao.com.br tcengenhariaeletrica.com.br kplojista.com.br bsgrafo.com.br

rafaelwdornelas

freeze and defrost for saving and resuming a big crawl? enhancement

5

Would be awesome to have the already visited urls saveable so that you can restart a crawl later and not revisit links, to start where you left off.

Shane-Neeley

enhancement

Crawler stopped without reason and any error

I am trying to crawl a big website (arezzo.com.br), however, after ~1700 URLs crawled, it simply stopped. No errors printed, and also the `finished` callback wasn't called. ![image](https://user-images.githubusercontent.com/13719228/64491922-2876e380-d244-11e9-992a-4e952cae58cc.png) Can someone...

pittersnider