Bao 一
Bao 一
@zifeng-radxa I can run the Llama3 int4 provided in the documentation but it gives the above issue when trying to run the in8 model converted as per the documentation. I...
@samchen8008 I tried v1.8.0-ee, v1.8.0-ce and v1.9.0-ee, and they all encountered the same problem.helmchart need to make additional configuration?
I tried to use sdk to upload, and observed the upload traffic, but returned a 500 error.: ``` python3 upload.py fail to upload model-00163-of-000163.safetensors with response code: 500, error: 500...