jsoup
jsoup copied to clipboard
jsoup: the Java HTML parser, built for HTML editing, cleaning, scraping, and XSS safety.
after updating dependency jsoup from 1.13.1 to 1.15.2 I noticed some of our QA tests failing: mailText= Jsoup.parse(mailContent).wholeText(); The tests expect in mailText the lines to end with "\r\n" but...
Hey, the [ns|* selector](https://www.w3.org/TR/selectors-3/#univnmsp) unfortunately doesn't find any elements. ```kotlin @Test fun `Can find all ac namespace elements`() { val html = """Hello, World!p in ac namespaceimg in ac namespacepage...
Hi all, When upgrading from 1.15.1 to 1.15.2 I appear to have encountered unexpected insertion of spaces - is this a bug or desired behaviour? In this test with a...
Would it be possible to include the HTML5 Tags `details` and `summary` in `Safelist.relaxed`? https://developer.mozilla.org/de/docs/Web/HTML/Element/details Or are there reasons why those two tags cannot be considered safe?
I'm trying to adopt `Jsoup.clean(String, Safelist)` to secure my code. Most things I throw at v1.15.2 work as expected... but not the following: ``` Assertions.assertEquals(Jsoup.clean("Hallo", Safelist.relaxed()), "Hallo"); ``` It results...
Some websites that have tools to try out code online, like for CSS for example, you could find a feature that prints out the code that was used to make...
Hey I have an issue when trying to use the same connection after submitting the form. The method `newRequest()` makes copy `data` params and it leads that this request preserve...
We are facing the issue in parsing HTML with attr values that look JSON format with Jsoup1.14.3.jar . But it's parsing fine with jsoup1.13.1.jar Jsoup version: 1.14.3 ## source.html file...
# Before ``` Benchmark Mode Cnt Score Error Units JMHBenchmark.benchmarkSmall avgt 5 7.382 ± 0.940 us/op JMHBenchmark.benchmarkSmall:·gc.alloc.rate avgt 5 8572.806 ± 1082.658 MB/sec JMHBenchmark.benchmarkSmall:·gc.alloc.rate.norm avgt 5 72952.001 ± 0.001 B/op...
This might be overkill and we might be able to decrease the size of this buffer for small strings to be parsed. This should thus reduce memory allocation. Here is...