SingleFile
SingleFile copied to clipboard
Auto-settings rule based on "category"
Discussed in https://github.com/gildas-lormeau/SingleFile/discussions/986
Originally posted by thecheesemo June 23, 2022 I have been using SingleFile extensively to manually save pages for later reference. I do this mainly for archiving purposes and because i have a bad memory. I've since realized that I always want to save every news article I read. So, I've started to define Auto-settings rules with a specific SingleFile profile for each url of news sites i visit, This profile auto saves pages, and that works fine. However, like most people on the web now, i don't go to just one news source, I get most of my news from link aggregators like google news and reddit. I have to manually add each news website's top level domain (TLD) to the Auto-settings rule with my news autosave profile.
Could it be possible to create an option in the Auto-settings that would let you define a profile based on the "category" of the TDL?
I don't even know if there exists such a list that can be referenced, or how one would go about making one other then a manual process. But if such a list does exist, or can be created easily enough, that would be a pretty great feature.
I did some searching, and it looks like all the APIs to look up categorized webpages are very limited (throttled) or require payment.
You could implement one yourself: https://github.com/domantasm96/URL-categorization-using-machine-learning. There are a number of project on github for URL classification that might suit this purpose.
I figured that there would at least be a pre-classified list of top 1000 sites, but having a hard time finding that, too.