httrack
httrack copied to clipboard
Are ttf files not downloaded by default?
I see that my downloaded site is fetching
https://fonts.gstatic.com/s/sourcecodepro/v6/mrl8jkM18OlOQN8JLgasD9zbP97U9sKh0jjxbPbfOKg.ttf
but all other stuff is downloaded. I'm using -n -B
, and that seems to fetch everything (jquery from jquery.com, etc) and saves locally. However, not this ttf file.
Do I need to explicitly include the ttf file type filter in the command? Or is there some way to make it just download everything?
It's just a ttf file, so I can probably just easily add some post processing to get that one file and modify the URL. Good thing it's not like a gazillion files everywhere.
That depends greatly on what you mean by "deafault"
Most user think they can just put in a base URL and then HTTrack wll download just what the want.
When you do that HTTrack assumes a default rule for that link and every bage below it and will give a scan rule of
+*.png +*.gif +*.jpg +*.jpeg +*.css +*.js -ad.doubleclick.net/* -mime:application/foobar
This does not include .tiff
Now I recommend that you always delete that and use
-* +thesiteiwant.com*
and add more rules from there. This includes every thing including .tff that are found on the site.
Also HTTrack only gets URLs from known tag options, so if the url of that tff is in something unsupported the link wont be found.