Katherine Yang
Katherine Yang
I meant that how did you get 300-500ms? But I agree, the metrics should behave the same way as it would have with Fastertransformer backend.
@krishung5 can you take a look? I can review sometimes next week. @du00cs can you send us the CLA? ref: https://github.com/triton-inference-server/server/blob/main/CONTRIBUTING.md
Closing issue due to lack of activity. Please re-open the issue if you would like to follow up with this issue
ping @rmccorm4 ?
Closing issue due to lack of activity. Please re-open the issue if you would like to follow up with this issue
Closing issue due to lack of activity. Please re-open the issue if you would like to follow up with this issue
Closing issue due to lack of activity. Please re-open the issue if you would like to follow up with this issue.
Closing issue due to lack of activity. Please re-open the issue if you would like to follow up with this issue
Closing issue due to lack of activity. Please re-open the issue if you would like to follow up with this issue
Closing issue due to lack of activity. Please re-open the issue if you would like to follow up with this issue