webwhiz icon indicating copy to clipboard operation
webwhiz copied to clipboard

Improve Sitecrawl to Exclude paths

Open adamsonwalter opened this issue 1 year ago • 0 comments

I pulled in /blog/ but I want to exclude /blog/category and /blog/archives

When I used this it did not work as it seemed to be confused about excluding /blog/category, for example, but including /blog/

There was also a big difference when I indicated to include /blog versus /blog/

The latter only included one page, whereas the latter crawled all the pages containing /blog/

Because I cannot exclude /blog/category and /blog/archives I have hundreds of unnecessary pages and these are taking up space on your servers and also in crawling for answers.

Can this be fixed to exclude paths.

Screenshot 2023-10-13 at 9 52 47 am Screenshot 2023-10-13 at 9 54 21 am

adamsonwalter avatar Oct 12 '23 23:10 adamsonwalter