Caleb Peffer

Results 45 issues of Caleb Peffer

In some pages, content is stored within an Iframe. I recommend using a playwright to handle this. This is dependent on the incoming fc-scraper release @nickscamara @rafaelsideguide

stale

The issue is that some pdfs are returning bad text. On top of fixing the root cause, may be-worth returning the original pdf so the user can parse with their...

bug

**Describe the Bug** Firecrawl fails to gather all the data available on documentation portals powered by Readme. Specifically, its not getting information such as query parameters in code snippets **To...

bug

**Problem Description** An image of the page is sometimes useful for vision models. **Proposed Feature** Return a Screenshot of the page for vision models **Implementation Suggestions** Having a parameter that...

enhancement
high-priority
Customer Request

**Problem Description** For the sake of flexibility and robustness, it would be good if we passed back raw pdfs we grab as objects so people can parse them with their...

enhancement
Customer Request

**Problem Description** In certain circumstances, it's useful to grab computed styles from a page. Currently, this is impossible via fire crawl **Proposed Feature** Firecrawl returns a computed styles object when...

enhancement
blocked
low priority
Customer Request
stale

Customer problem: "When I crawl websites with firecrawl I'll sometimes get essentially the same links eg https://site.com/, https://www.site.com/, https://www.site.com/, when in reality they're all the same page" It would be...

https://gamelandia.fun/featured-events/#start_date=2024-08-17;end_date=2024-09-16;featured=1 Doesn't work on /scrape

bug
Broken Link

``` data { content: '', markdown: '', html: '', linksOnPage: [], metadata: { sourceURL: 'https://www.solvhealth.com/privacy', pageStatusCode: 200 } } success true error No page found ``` looks like bardia fixed...

bug
high-priority

Sometimes, customers want to remove any boilerplate on pages. One potential strategy on the crawl endpoint is to remove any content that is present on all pages. For example, if...

Customer Request