SpkArtZen

Results 3 comments of SpkArtZen

Yes, i use default model llama 3.1 7B ![Знімок екрана 2024-10-30 102834](https://github.com/user-attachments/assets/691ed60f-27b1-4c11-9ff1-849c605d7ea8)

Full logs: [logs.txt](https://github.com/user-attachments/files/17569720/logs.txt) I send single request from python sdk. It works the same with postman and curl

The main problem is that when I send a request, even through Postman, the response is generated multiple times and degrades each time. The same with sdk and Postman. Also,...