Dragonfly2 icon indicating copy to clipboard operation
Dragonfly2 copied to clipboard

‌‌‌‌‌After using the Huggingface integration solution, will a model occupy twice the disk space?

Open yetone opened this issue 1 year ago • 4 comments

‌‌‌‌‌‌I'm very happy to see the integration solution of Huggingface. I'm curious whether this will cause Hugging Face's models to occupy double disk space (dragonfly cache directory + huggingface cache directory)?

https://d7y.io/docs/next/operations/integrations/hugging-face/

yetone avatar May 15 '24 02:05 yetone

Yes, it is possible to optimize Dfdaemon's GC configuration and reduce the cache size . Refer to https://d7y.io/docs/next/reference/configuration/dfdaemon.

image

gaius-qi avatar May 15 '24 03:05 gaius-qi

‌‌‌Thanks, I'll look into it!

yetone avatar May 15 '24 03:05 yetone

@gaius-qi Will there be any adverse effects if I completely use dragonfly's cache and disable hugging face's own cache? Specifically, our goal is to have different containers utilize the same model cache.

yetone avatar May 15 '24 05:05 yetone

@gaius-qi Will there be any adverse effects if I completely use dragonfly's cache and disable hugging face's own cache? Specifically, our goal is to have different containers utilize the same model cache.

If model has cached in d7y, huggingface sdk will download by http proxy. Slower but reduce the space.

gaius-qi avatar May 16 '24 02:05 gaius-qi