crux
crux copied to clipboard
Preserve <br>?
I tried extracting https://us13.campaign-archive.com/?u=67bd06787e84d73db24fb0aa5&id=c3e998f811&e=7bc177b38a
and then rendering it. Looks broken since the
are extracted. Can I send a PR to add them back? What's the best way to do this?
Sure, PRs are always welcome! See CONTRIBUTIONS for how to send one.
At a minimum, please add a new test with the content you are trying to parse, and then modify the source so that your new text is parsed correctly (while ensuring that existing tests also continue to pass.)
Decided to use mozilla/readability instead since it's a bit more robust. Thanks!
True, it’s been around longer & has more maintainers.
Though, it’s written in JavaScript, which precludes many usages of it, especially in Android apps. Crux’s predecessor, Snacktory, started as a Java clone of Readability.
In Android I'm injecting it into a WebView and extracting output with webview.evaluateJavascript 😅