sagemaker-inference-toolkit Default output function encodes results to JSON and that seems to add to response latency.

Default output function encodes results to JSON and that seems to add to response latency.

Open vdantu opened this issue 3 years ago • 1 comments

Describe the bug By default the accept type in inference container seems to be application/json. The default encoder which converts results to JSON seems to add significantly to the response latencies. Is there a way to reduce the default response's latencies?

Mar 30 '21 17:03 vdantu

up, did you find the solution?

Nov 28 '23 18:11 Pawel842

sagemaker-inference-toolkit sagemaker-inference-toolkit copied to clipboard

Default output function encodes results to JSON and that seems to add to response latency.

sagemaker-inference-toolkit
sagemaker-inference-toolkit copied to clipboard