python-sitemap icon indicating copy to clipboard operation
python-sitemap copied to clipboard

Mini website crawler to make sitemap from a website.

Results 26 python-sitemap issues
Sort by recently updated
recently updated
newest added

Hi! Could you tell please how to exclude web pages with meta robots "noindex"

For large sitemaps, it's tough to see what changed when you rebuild it. That's because the URLs are not written in alphabetical order, so the diff doesn't work to see...

Iframes are sometimes used to have parts of sites controlled by a CMS. Here we add the option of inspecting the iframe's content and for any links that are to...

Web apps that use React, Vue.js, Svelte, etc. and client-side rendering don't work, because this package does not execute the JS, and so doesn't see all the content, thereby missing...

Iframes are sometimes used to have parts of sites controlled by a CMS. It would nice to have the option of inspecting the iframe's content and for any links that...

If I use the `--parserobots` argument the crawler simply crawls the main domain but doesn't follow any links dispite the allow directives. You can try it on `https://google.com` to see...