crawl4ai
crawl4ai copied to clipboard
ππ€ Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
### crawl4ai version 0.6.1 ### Expected Behavior Crawl4AI CLI work using -b parameter ### Current Behavior   ### Is this reproducible? Yes ### Inputs Causing the Bug ```bash ```...
## Summary Improved utils.normalize_url functionality: 1. Can handle more edge cases; 2. Add the function to remove tracking parameters, enhancing privacy and anti-crawling capabilities. ## Checklist: - [β ] My...
### crawl4ai version 0.6.3 ### Expected Behavior Extraction sends the preferred content (in my case markdown) and url for extraction. ### Current Behavior https://github.com/unclecode/crawl4ai/blob/main/crawl4ai/async_webcrawler.py#L610 When using raw html, the url...
### crawl4ai version current as of 5-15-2025 - pip install ### Expected Behavior trying to use openrouter for processing ### Current Behavior trying to use openrouter and the application hangs...
### crawl4ai version 0.5.0.post8 ### Expected Behavior * The crawl4ai crawler should consistently work across multiple API calls in the FastAPI application. * The browser context should remain active and...
### crawl4ai version 0.6.1 ### Expected Behavior Crawl4AI should perform deep crawling when using CrawlerRunConfig with a BFSDeepCrawlStrategy, even if the Crawl4AI server is running in a Docker container. I...
## Summary Fixes #1066 This PR fixes the error that happens when you send a request to /crawl with `stream: true` in the `crawler_config`. **Whatβs the fix?** If a request...
β¦re Object ## Summary `Fixes #1056` ## List of files changed and why crawl4ai/__init__.py ## How Has This Been Tested? ``` "crawler_config": { "type": "CrawlerRunConfig", "params": { "chunking_strategy": { "type":...
### crawl4ai version 0.6.0 ## Expected Behavior The crawler should either: 1. Successfully crawl the page and convert to markdown, or 2. Provide a clear error message about the browser...
### crawl4ai version # crawl4ai/_version.py __version__ = "0.5.0.post8" ### Expected Behavior Here is metadata object should show title as below: "metdata": { **"title": "Where's my refund? | Internal Revenue Service",**...