Seungsu Baek
Seungsu Baek
Hi! I have a question about perf_analyzer and tritonserver's trace.json. --- SERVER ``` $ tritonserver --model-repository=/triton_dev/models --trace-file /triton_dev/experiments/trace/trace.json --trace-level TIMESTAMPS --trace-rate 1 ``` --- CLIENT ``` $ perf_analyzer -b 1...
Hi. I want to use pipeline parallelism and tensor parallelism or combination of it. In the examples directory, only internLM supports two parallelism and other models support just one of...
Hi. I am interested in your nice work. I want to get a parallel configuration for my server. I read your codes but it is hard to find some documents...