HDembinski.github.io
HDembinski.github.io copied to clipboard
posts/parsing_webpages_with_llm
From unstructured to structured: Parsing webpages with a Large Language Model (LLM) – Hans Dembinski’s blog
https://hdembinski.github.io/posts/parsing_webpages_with_llm.html
Update: There is a new library from the authors of the popular pydantic type validation library, called PydanticAI, which looks very promising. It is designed to simplify the conversion of unstructured into structured data.