Alex Grishin

Results 1 comments of Alex Grishin

Is the problem still there? I have the same speed with 0.4.0.post1 and 0.4.1. In both cases I have flash_attn installed and in both cases when openai server starts it...