Scrapegraph-ai icon indicating copy to clipboard operation
Scrapegraph-ai copied to clipboard

Python scraper based on AI

Results 147 Scrapegraph-ai issues
Sort by recently updated
recently updated
newest added

When I run this code `from scrapegraphai.graphs import SmartScraperGraph import nest_asyncio graph_config = { "llm": { "model": "ollama/mistral", "temperature": 0, "format": "json", # Ollama needs the format to be specified...

**Describe the bug** I'm getting an error when running the Groq example on the repo. I confirmed my Groq key works when making a normal request. *Code* ```python from scrapegraphai.graphs...

I have seen in logs that before content pass to llm it does cleanup and send clean text. is there any way to send html as it is ( not...

KeyError Traceback (most recent call last) [/usr/local/lib/python3.10/dist-packages/scrapegraphai/graphs/abstract_graph.py](https://localhost:8080/#) in _create_llm(self, llm_config, chat) 181 try: --> 182 self.model_token = models_tokens["claude"][llm_params["model"]] 183 except KeyError as exc: KeyError: 'claude-3-haiku-20240307' The above exception was the...

model : gpt-4o I have html where there are many attachment (around 110 but it got only 69) so i use prompt like "Extract title, abstract, published date, all attachments...

**Describe the bug** I tried to change the batch size to a different number (4) from the default (16) by specifying batch size in the graph configuration. However, when I...

**Describe the bug** I followed the example of bedrock https://github.com/VinciGit00/Scrapegraph-ai/blob/main/examples/bedrock/smart_scraper_bedrock.py It was working in the first place. Then after I replace the url from source="https://perinim.github.io/projects/", to source="https://www.seek.com.au/jobs?page=1&sortmode=ListedDate", I got the...