FuryofV

Results 1 issues of FuryofV

I have an interest in this project, but my system VRAM is small so I prefer to use the [llama.cpp](https://github.com/ggerganov/llama.cpp)-based toolchain(ollama etc.) and GGUF quantization. However, the dual encoder architecture...