Scrapegraph-ai
Scrapegraph-ai copied to clipboard
Python scraper based on AI
When I run this code `from scrapegraphai.graphs import SmartScraperGraph import nest_asyncio graph_config = { "llm": { "model": "ollama/mistral", "temperature": 0, "format": "json", # Ollama needs the format to be specified...
**Describe the bug** I'm getting an error when running the Groq example on the repo. I confirmed my Groq key works when making a normal request. *Code* ```python from scrapegraphai.graphs...
I have seen in logs that before content pass to llm it does cleanup and send clean text. is there any way to send html as it is ( not...
KeyError Traceback (most recent call last) [/usr/local/lib/python3.10/dist-packages/scrapegraphai/graphs/abstract_graph.py](https://localhost:8080/#) in _create_llm(self, llm_config, chat) 181 try: --> 182 self.model_token = models_tokens["claude"][llm_params["model"]] 183 except KeyError as exc: KeyError: 'claude-3-haiku-20240307' The above exception was the...
model : gpt-4o I have html where there are many attachment (around 110 but it got only 69) so i use prompt like "Extract title, abstract, published date, all attachments...
**Describe the bug** I tried to change the batch size to a different number (4) from the default (16) by specifying batch size in the graph configuration. However, when I...
**Describe the bug** I followed the example of bedrock https://github.com/VinciGit00/Scrapegraph-ai/blob/main/examples/bedrock/smart_scraper_bedrock.py It was working in the first place. Then after I replace the url from source="https://perinim.github.io/projects/", to source="https://www.seek.com.au/jobs?page=1&sortmode=ListedDate", I got the...