Lukáš Křivka
Lukáš Křivka
**Describe the bug** This is somewhere between a bug and a feature request. When you use RequestList and RequestQueue together, all requests are first taken from the list before any...
When you need to load or write a larger amount of data, you always have to persist your offset state. Otherwise, you may - Repeat the same load after every...
New (and sometimes experienced) users often fight with the allowed characters for different APIs. As far as I know, there are 2 types of limited character APIs: - Key-value store...
For example, this page - http://nvdmc.org/feed/ is parseable with Cheerio so our crawler should just crunch it. Then it will automatically propagate to Cheerio Scraper that we can use for...
Honestly, I don't know what is the expected behavior. I can see 2 general approaches: 1) Autoscaling just helps a bit to slow things down but doesn't do so very...
I have already implemented this for a few customers (only with in-browser `document`). You cannot simply extract keywords from `$('body').text()` because it will splash all words together. So the solution...
Or at least recognize it and show a reasonable message.
https://www.linkedin.com/company/delegatus https://www.facebook.com/pages/category/Lawyer---Law-Firm/Delegatus-services-juridiques-inc-131011223614905/ https://www.facebook.com/pages/KinEssor-Groupe-Conseil/208264345877578 Reproduce: ``` (new RegExp('(?
Use-case is that a regular crawler crawls homepage -> categories -> detail pages. If homepages fails, the whole crawler fails. If the category fails, a lot of pages are missing,...
https://apify.com/page-analyzer is broken, the UI is forever spinning In the network, this gives 404 - https://apifier-key-value-store-prod.s3.amazonaws.com/7fMXPPEAuFdaWCvQt/OUTPUT?AWSAccessKeyId=AKIAJTQHBVH6QKNNBOIQ&Expires=1595574947&Signature=TQPjEY%2BVpXeqlC%2FrRAllX2kdQAc%3D