crawl4ai icon indicating copy to clipboard operation
crawl4ai copied to clipboard

๐Ÿš€๐Ÿค– Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN

Results 541 crawl4ai issues
Sort by recently updated
recently updated
newest added

โ€ฆexecution ## Summary Refactor extraction and completion functions to support asynchronous execution Fixes #1055 ## List of files changed and why - async_webcrawler.py - content_filter_stratery.py - extraction_strategy.py - utils.py ##...

## Summary This PR introduces a new feature for cache management - content change detection. Users can now specify whether they want to check if web content has changed before...

Models & Pricing | DeepSeek API Docs The above html code does not get the title ## Summary by CodeRabbit - **Bug Fixes** - Improved extraction of page titles from...

## Summary In extract() of LLMExtractionStrategy, `response` is not overridden. Thus if some error is met in later part of this `try`, in `except` we'll get error triggering response.choices[0].message.content: 'str'...

### crawl4ai version 0.6.2 ### Expected Behavior [FETCH]... โ†“ http://ayacd.yrdcpcn.org.cn/upload/2023/0807/b316d... | Status: False | Time: 0.28s [SCRAPE].. โ—† http://ayacd.yrdcpcn.org.cn/upload/2023/0807/b316d... | Time: 0.0s [ERROR]... ร— http://ayacd.yrdcpcn.org.cn/upload/2023/0807/b316d... | Error: โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ” โ”‚ ร—...

๐Ÿž Bug
๐Ÿฉบ Needs Triage

### crawl4ai version 0.6.3 ### Expected Behavior Getting the data of each link correctly and not getting the Failed on navigating ACS-GOTO with multiple URLs crawl. I tried using a...

โ“ Question

### crawl4ai version 0.6.3 ### Expected Behavior NA ### Current Behavior When running with use_managed_browser=True, the crawler crashes with an IndexError: list index out of range at line 967 in...

๐Ÿž Bug
โš™๏ธ In-progress
๐Ÿ“Œ Root caused

This commit introduces a new example script, `prompt_driven_crawler.py`, located in `docs/examples/`. The script enables you to perform a recursive crawl starting from a given URL. It uses an LLM (OpenAI...

## Summary If no credentials are provided, the token gets never verified, even if jwt is enabled. This PR throws a 401 exceptions then. ## List of files changed and...

### crawl4ai version 0.7.7 ### Expected Behavior When I use the latest version of Docker deployment and set 'max_dession_permit' to 1, an error message will appear, as follows: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File...

๐Ÿž Bug
๐Ÿฉบ Needs Triage