Add parameter to exclude certain resources
Currently, Browsertrix crawler parameters allow us a fine control over which pages are fetched into the ZIM.
However, all resources found on the page are fetched.
This could pose issues (i.e. produce a very big ZIM file) on some websites, where at least a "lite" version would be preferred.
See https://github.com/openzim/zim-requests/issues/774#issuecomment-1880996879 for a real-world example.
Should we consider to add a feature to exclude certain resources from the ZIM ?
Ultimately this would be done on the crawler to not produce big temporary WARC files for nothing or we can do it on our own as well (in warc2zim). This is hence an "overview" ticket which will have to be linked to a specific issue in crawler or warc2zim repo if we consider this makes sense.