thsm-kb

Results 7 issues of thsm-kb

It would be great if it was possible to activate "continous mode" and get all links from all pages visited, until "continous mode" is switched of.

Great tool - thank you! Suggestion: The possibility to add OTHER as a language. Lets say I want to find English and French in a multi-language set. I want to...

enhancement

### Context From a european point of view cookies are troublesome. Most sítes are forced to ask the user to accept cookies due to the ePrivacy Directive. And we don't...

enhancement
frontend crawler settings

### Browsertrix Cloud Version v1.9.0-beta.2-896c3cc ### What did you expect to happen? What happened instead? some urls inside javascript is not extracted. M3r ### Step-by-step reproduction instructions Crawling http://holbergsskrifter.dk/holberg-public/view?docId=skuespill%2FJeppe%2FJeppe.page;toc.depth=1;brand=&chunk.id=start ###...

bug

### Browsertrix Cloud Version v1.8.0-beta.2-3aebf2e ### What did you expect to happen? What happened instead? If I use https://www.sn.dk/sitemaps/term/Place.Sitemap.0.xml as a seed, it is not crawled. I do not get...

enhancement

### Browsertrix Cloud Version v1.8.0-b6f8c96 ### What did you expect to happen? What happened instead? Harvesting http://midtfjordradio.dk/ produces odd results regarding link with danish char ø. http://midtfjordradio.dk/St%C3%B8t.html finished crawl here:...

bug

Feature request: Add the possibility to have a notification when a scheduled job suddenly drops to a much lower size than the last.