crawl4ai issues

feat: Add BatDongSan.com.vn crawler scripts

Created comprehensive crawler scripts for batdongsan.com.vn to extract Vietnamese real estate listings using crawl4ai. Features: - Full-featured crawler class with pagination support - Simple script for quick usage and customization...

TrungLee2020

[Bug]: (Documentation) No docs for cdp_url functionality

### crawl4ai version N/a ### Expected Behavior The documentation should detail the ability to use cdp_url to connect to remote browser instances ### Current Behavior There is no mention of...

helicalchris

🐞 Bug

📖 Documentation

[Bug]: Outdated Documentation regarding arun and arun_many return type

### crawl4ai version 0.7.4 ### Expected Behavior I noticed that the documentation regarding [arun](https://docs.crawl4ai.com/api/async-webcrawler/#22-manual-start-close) and [arun_many](https://docs.crawl4ai.com/api/arun_many/) suggests the return type be CrawlResult and nion[List[CrawlResult], AsyncGenerator[CrawlResult, None]] respectively. Which is incorrect,...

adnanaq

🐞 Bug

📖 Documentation

[Bug]: proxy configuration is ignored when using AsyncHTTPCrawlerStrategy

1

### crawl4ai version 0.7.4 ### Expected Behavior Suppose I want to crawl a website using `AsyncHTTPCrawlerStrategy` and pass the proxy configuration. It should start crawling the website by using the...

KY64

🐞 Bug

🩺 Needs Triage

feat/add Firecrawl backend support to crawler

1

I introduces Firecrawl as an optional backend for crawl4ai. **Updates** - Added FirecrawlBackend wrapper around Firecrawl’s SDK. - Extended CLI with --backend option (default | firecrawl). - Enabled output in...

Akeemkabiru

Docs: Wrong parameter name in AsyncWebCrawler.arun docstring

1

## Summary There is an error in the docstring of AsyncWebCrawler.arun: the parameter is called `config`, not `crawler_config`. ## List of files changed and why crawl4ai/async_webcrawler.py - see summary ##...

AkosLukacs

[Bug]: Sitemap error causes url seeding to end prematurely

### crawl4ai version 0.7.4 ### Expected Behavior I expect to be able to discover all the urls from https://www.fastighetsvarlden.se with the url seeding. ### Current Behavior An error occurs during...

Thornfalt

🐞 Bug

🩺 Needs Triage

Add Claude Code GitHub Workflow

1

## 🤖 Installing Claude Code GitHub App This PR adds a GitHub Actions workflow that enables Claude Code integration in our repository. ### What is Claude Code? [Claude Code](https://claude.com/claude-code) is...

unclecode

[Bug]: Target_elements doesnt work

1

### crawl4ai version 0.7.4 ### Expected Behavior Hello, I am having an issue using target_elements to only save certain content to markdown while still allowing it to view all links...

jay377

🐞 Bug

🩺 Needs Triage

[Bug]: utils.py removes / (slash) from url which breaks crawling

2

### crawl4ai version 1.7.4 ### Expected Behavior If a webpage s build like: http://www.example.com/whatever/you/want/9123/ crawl4AI makes: http://www.example.com/whatever/you/want/9123 which leads to a 404. I monkey patched as a workaround (very dirty...

mamema

🐞 Bug

🩺 Needs Triage

crawl4ai
crawl4ai copied to clipboard

Metadata

feat: Add BatDongSan.com.vn crawler scripts

[Bug]: (Documentation) No docs for cdp_url functionality

[Bug]: Outdated Documentation regarding arun and arun_many return type

[Bug]: proxy configuration is ignored when using AsyncHTTPCrawlerStrategy

feat/add Firecrawl backend support to crawler

Docs: Wrong parameter name in AsyncWebCrawler.arun docstring

[Bug]: Sitemap error causes url seeding to end prematurely

Add Claude Code GitHub Workflow

[Bug]: Target_elements doesnt work

[Bug]: utils.py removes / (slash) from url which breaks crawling

← Metadata

Owner

Metadata

crawl4ai crawl4ai copied to clipboard

Metadata

← Metadata

Owner

Metadata

crawl4ai
crawl4ai copied to clipboard