metafacture-core
                                
                                
                                
                                    metafacture-core copied to clipboard
                            
                            
                            
                        Add SitemapReader originally developed in OERSI
Reads sitemap from URL, sends each loc URL to the receiver.
e.g. "https://hoou.de/sitemap.xml" | read-sitemap | open-http ... in a Flux workflow to process every document linked in the sitemap.
Supports paging via from= query string parameter in the sitemap URL.
Assigning @dr0i for code review due to the (albeit loose) paging relation to #464.
We don't have a dedicated issue for this, maybe @TobiasNx could do functional review here?
Discussed in our planning meeting: we're putting this on hold to investigate if we actually need this kind of specific module for reading sitemaps, or if we can build something based on existing modules and the upcoming paging support (https://github.com/metafacture/metafacture-core/issues/464).
Kudos, SonarCloud Quality Gate passed!    
 
 0 Bugs
 
 0 Vulnerabilities
 
 0 Security Hotspots
 
 1 Code Smell
 No Coverage information
 0.0% Duplication