webpage-rs
webpage-rs copied to clipboard
Small Rust library to fetch info about a web page: title, description, language, HTTP info, RSS feeds, Opengraph, Schema.org, and more
For example, an URL like **http://äläomista.fi/** is legal and works with a browser. There is a catch: the web client should translate the dns names with non-ascii (UTF-8) chars properly,...
According to reports from our users, webpage-rs is unable to parse info from https://oko.press/ website. I was able to reproduce this by putting the site url into from_url test case....
Updates the requirements on [html5ever](https://github.com/servo/html5ever) to permit the latest version. Commits See full diff in compare view Dependabot will resolve any conflicts with this PR as long as you don't...
It would be useful to limit the retrieved HTTP body to max 1MB or something
Updates the requirements on [html5ever](https://github.com/servo/html5ever) to permit the latest version. Commits See full diff in compare view Dependabot will resolve any conflicts with this PR as long as you don't...