Caleb Peffer
Caleb Peffer
I've had a few prospects/customers ask me if we could allow them to run Javascript and actions on the page before we extract the data, such as scrolling the page...
A customer reached out about https://www.clinikally.com/blogs/news. They were trying to crawl it with the parameters.: Include only paths: blogs/news/* The results were inconsistent, sometimes giving 9 links sometimes giving more...
https://www.liveflow.io/product-guides/how-to-disable-links-to-quickbooks. It's a Loom video; funnily enough, there are other videos in the same format that seem to work on the site. Tried: * adding a timeout of 2000 *...
I've had a few customers who are concerned with the few failures they experience during long crawl jobs. Some have even implemented their own automatic retry functionality. If we handle...
PDF parsing is not great on this
When using the /scrape endpoint on https://www.mccarthy.com/craft/search?jobviteiframe=job%2FoYD3tfwR, the job description information doesn't appear on the page. [] notify will on crisp once completed
Hey, https://refact.ai/ on /scrape is timing out, not sure why. Could this have something to do with fire-engine? @tomkosm CCing customers @nyacg @danny-hunt for visibility
https://dlp.dubai.gov.ae/en/Pages/OfficialGazette.aspx @mogery
A customer who is using the linksOnPage field noticed that it still includes links from headers and footers, even though they have been removed from the content. Move the URL...
**Describe the Bug** For some odd reason, the using the crawl endpoint on this link [https://www.tripadvisor.com/Restaurant_Review-g60763-d4418144-Reviews-Reichenbach_Hall-New_York_City_New_York.html ](https://www.tripadvisor.com/Restaurant_Review-g60763-d4418144-Reviews-Reichenbach_Hall-New_York_City_New_York.html) returns a single page on the playground, but returns nothing via curl request...