Jonathan Hedley
Jonathan Hedley
Hi there, I can't repro it. Here's a simple example: ```java import org.jsoup.*; import org.jsoup.nodes.Document; class MavenDirCheck { public static void main(String[] args) throws Exception { String url = "https://repo1.maven.org/maven2/org/apache/cxf/cxf-rt-rs-service-description-common-openapi/4.0.6/";...
OK I think I have a repro based on how the server doesn't return a charset in the header and we try to fast-path the detection parse. May be an...
Ok, I believe I have found and fixed the root cause. These Maven directory pages do not send a charset header, so jsoup tries to be efficient about issuing a...
Great, thank you for the report and the confirmation. This one was pernicious so I'm glad we've found it
Thanks, fixed.
Please ping me (jsoup author) if you hit any rough edges in jsoup when implementing this, or if you have ideas on how the library can be improved, or have...
Hi @acornall, conceptually I definitely think it would be good to build out coverage for the Cleaner. As it stands this fuzzer could just about be a unit test though...
@sjamesr thanks for maintaining this library. Is there a path to accepting this PR, or to publish as an mjar with a real module-info.java? It would make adoption much simpler....