ZhangHaoQ
Results
1
comments of
ZhangHaoQ
@gangmuk Hi, thanks for the follow-up. Here are the details: Load (RPS): around 20 QPS (≈20 requests/sec) LLM model: nemo-12b GPU model: NVIDIA 5090 The crash usually happens under this...