Alex Grishin
Results
1
comments of
Alex Grishin
Is the problem still there? I have the same speed with 0.4.0.post1 and 0.4.1. In both cases I have flash_attn installed and in both cases when openai server starts it...