Tim Esler
Tim Esler
@xiaoiker @chungyau97 there is a pytorch conversion of the two pretrained inception resnet v1 models (vggface and casia-web face) from davidsandberg/facenet here: https://github.com/timesler/facenet-pytorch Disclaimer: it is my repo.
I have not been able to reproduce this issue. Can you provide a complete, self-contained example?
Fixes #60
Hi @aakashg1999, does this only happen when there are no faces found in the frame?
@yueyihua can you let us know what versions are you using for: * facenet-pytorch * pytorch * torchvision * python
@OAfzal could the random output be because fastchat is loading the model with dtype `float16` rather than `bfloat16`?
@OAfzal I just saw that official support has been added for Falcon in https://github.com/huggingface/text-generation-inference, so you may be able to glean some insight there about how to get streaming working
Does the memory accumulate with each loop iteration or is it a one off jump of 300MB?
What do you need 128-dim embeddings for? It will probably be easier to either: 1. Add a single linear layer at the end of the model to do the mapping...
This feature would be amazing for CLI LLM chat apps!