httrack icon indicating copy to clipboard operation
httrack copied to clipboard

Are ttf files not downloaded by default?

Open trusktr opened this issue 7 years ago • 2 comments

I see that my downloaded site is fetching

https://fonts.gstatic.com/s/sourcecodepro/v6/mrl8jkM18OlOQN8JLgasD9zbP97U9sKh0jjxbPbfOKg.ttf

but all other stuff is downloaded. I'm using -n -B, and that seems to fetch everything (jquery from jquery.com, etc) and saves locally. However, not this ttf file.

Do I need to explicitly include the ttf file type filter in the command? Or is there some way to make it just download everything?

trusktr avatar May 30 '17 20:05 trusktr

It's just a ttf file, so I can probably just easily add some post processing to get that one file and modify the URL. Good thing it's not like a gazillion files everywhere.

trusktr avatar May 30 '17 20:05 trusktr

That depends greatly on what you mean by "deafault"

Most user think they can just put in a base URL and then HTTrack wll download just what the want.

When you do that HTTrack assumes a default rule for that link and every bage below it and will give a scan rule of +*.png +*.gif +*.jpg +*.jpeg +*.css +*.js -ad.doubleclick.net/* -mime:application/foobar This does not include .tiff

Now I recommend that you always delete that and use -* +thesiteiwant.com* and add more rules from there. This includes every thing including .tff that are found on the site.

Also HTTrack only gets URLs from known tag options, so if the url of that tff is in something unsupported the link wont be found.

mpheyse avatar Sep 06 '17 01:09 mpheyse