Feature improvement in HTML reports
The default HTLM report is excellent, but even on medium size sites can easily run to 20Mb or so. I wonder if a more limited 'management over view' option would be useful to have. For instance, this could just consist of the existing 'Summary' page???
Hi @DavidMatthewson,
thank you for your suggestion for improvement. Yes, HTML can be really big, especially for large websites.
Through the crawler parameter --analyzer-filter-regex it is possible to select only specific analyzers that should be applied. However, in that case, the methods, for which the analyses are given, will not be performed at all and the recommendations resulting from them would not even be displayed in the output in the summary.
I was wondering how we could make it possible to define what should/should not be displayed in an HTML report.
The most sense for me is the optional possibility to list which parts of the HTML report you want to display using the parameter. In the default state, everything would be as before, but you could set e.g. --html-report-options="summary,visited-urls,best-practices,accessibility,security" and that would only include these areas in the HTML report.
What do you think about this?
Apart from this, the javascript files are also bulky. They are not formatted. The HTML file is big because it has all the inline CSS and javascript combined into one file.
Thanks for both prompt and helpful feedback. Using flags such as "summary" would work well for me. If it revealed something 'odd' I could then rerun the job without the flag to dig deeper. I'll also try the regex filters too, which I have not yet used.
Thanks!
Hi @DavidMatthewson,
thank you for the feature request and I have implemented the option to set --html-report-options as I suggested. Even if you have a large website and a standard full HTML report would be several MB, if you only include some sections in the HTML report, the HTML report will only be tens of kB.
I will strive to release version 1.0.9 including this improvement in the next few hours.
From README.md:
| Parameter | Description |
|---|---|
--html-report-options=<sections> |
Comma-separated list of sections to include in HTML report. Available sections: summary, seo-opengraph, image-gallery, video-gallery, visited-urls, dns-ssl, crawler-stats, crawler-info, headers, content-types, skipped-urls, caching, best-practices, accessibility, security, redirects, 404-pages, slowest-urls, fastest-urls, source-domains.Default: all sections. |
Many thanks - I'm currently testing it & will revert soonest.