SpkArtZen
SpkArtZen
Yes, i use default model llama 3.1 7B 
Full logs: [logs.txt](https://github.com/user-attachments/files/17569720/logs.txt) I send single request from python sdk. It works the same with postman and curl
The main problem is that when I send a request, even through Postman, the response is generated multiple times and degrades each time. The same with sdk and Postman. Also,...