Ovis
Ovis copied to clipboard
How to inference on V100 without Flash Attention?
I only have V100S GPU, and I got "RuntimeError: FlashAttention only supports Ampere GPUs or newer.". How to inference on V100 without Flash Attention?
The same problem, I hope ovis2.5 can use eager.