lambdasoup
lambdasoup copied to clipboard
Functional HTML scraping and rewriting with CSS in OCaml
https://developer.mozilla.org/en-US/docs/Web/CSS/:is In lieu of #15, `:is()` would cover it as well as many other use cases while keeping the selector syntax minimal.
Hello, I don't know CSS, but it looks like your library can do what I need. I would like some simple data extraction example. If I have such an HTML...
I discovered that lambdasoup/markup.ml takes many times more heap space to parse a file than the file size. Consider a trivial test program: ```ocaml let src = Soup.read_file (Sys.argv.(1)) let...
`Soup.insert_before` only works if the node is a child of an element node, but it fails when the node is at the root of the element tree. ``` utop #...
Hi, I started using Lambda Soup and found that it does not seems to support selector lists, like `".bg1, .bg3"`. I need to parse an HTML document with various ``...
Is there a reason for throwing away namespace prefixes ? https://github.com/aantron/lambda-soup/blob/master/src/soup.ml#L122-L124
In some HTML manipulation tasks, you need to create a copy of a node without deleting the original. For example, if you are making a page ToC, you need to...
I am making use of Lamdasoup by way of the static site generator Soupault. I'm using Kramdown to convert plain text into HTML. Most of the output has HTML entities,...
Right now, if lambdasoup encounters a non-standard element in ``, it moves that element to ``. ``` utop # Soup.parse {| |} |> Soup.to_string ;; - : string = "...
Pretty much what the title says. I have added a few test cases all of which pass with flying colours. Let me know if anything needs to be done. One...