Fred Turkington
Results
22
comments of
Fred Turkington
your error is > flash_attn_2_cuda.cpython-310-x86_64-linux-gnu.so: undefined symbol: _ZN3c104cuda9SetDeviceEi Check out https://github.com/Dao-AILab/flash-attention/issues/451
Would love this for image captioning with quantized speedup. The `kosmos-2` model from Microsoft would be another good candidate