contao icon indicating copy to clipboard operation
contao copied to clipboard

Extended docs for the Crawler in the back end

Open Toflar opened this issue 1 year ago • 6 comments

Draft because the to.contao.org link is up for discussion: https://github.com/contao/to.contao.org/pull/80

Toflar avatar Nov 09 '23 13:11 Toflar

This is ready now.

Toflar avatar Nov 10 '23 12:11 Toflar

The translation file should have a %s wildcard and the URL should be added using sprintf().

https://github.com/contao/contao/blob/d50e67901a36692f57f1a27680dced916ce6ddd6/core-bundle/contao/languages/en/default.xlf#L1205-L1207

https://github.com/contao/contao/blob/d50e67901a36692f57f1a27680dced916ce6ddd6/core-bundle/contao/controllers/BackendMain.php#L187

leofeyer avatar Dec 11 '23 15:12 leofeyer

Ah right, that's why it was blocked. Updated - while doing that I also migrated to the translator because we had to do this anyway at some point.

Toflar avatar Dec 11 '23 16:12 Toflar

I would have preferred it if you had just changed the language string. 🙈

leofeyer avatar Dec 13 '23 11:12 leofeyer

Moreover, make sure you analyze the debug log to see which URLs are crawled exactly. You can ask Contao to skip links from being crawled. Please refer to the Contao docs for more information.

What are you trying to tell the user? Is it the crawler might crawl fewer URLs than you expect, so check the debug log or is it you can exclude links from crawling? I don‘t quite understand the intention here.

leofeyer avatar Jan 19 '24 16:01 leofeyer

you can exclude links from crawling

This one. If the log contains URLs you don't want to have crawled (aka it takes too long), you can exclude them.

Toflar avatar Jan 26 '24 14:01 Toflar