multi-model-server icon indicating copy to clipboard operation
multi-model-server copied to clipboard

Multi Model Server is a tool for serving neural net models for inference

Results 105 multi-model-server issues
Sort by recently updated
recently updated
newest added

https://github.com/awslabs/mxnet-model-server/blob/master/plugins/endpoints/src/main/java/software/amazon/ai/mms/plugins/endpoint/ExecutionParameters.java#L26 'max_request_size' seems to refer to bytes, not mb.

bug
good first issue

My input image is [batch, 3, None, None], how to represent any size in the signature.json file. I think a solution is to remove image.resize(img_arr, w, h) in mxnet_vision_servion_service.py. But...

question

I'm getting bad inference results from ResNet ONNX models. The image of the cat used in the example is reported as a pool table when using ResNet v1 or v2....

question

Currently MMS_METRICS that are logged include things like CPUUtilization and Memory usage. It would be great if these also included things like GPU usage and memory. This would make it...

enhancement

Having worked in a number of AI startups the biggest limitation of the server is that it can only manage modes on GPUS which are on a single server implementation....

feature request

Follow up to https://github.com/awslabs/mxnet-model-server/issues/732#issuecomment-470214725 and https://github.com/awslabs/mxnet-model-server/issues/732#issuecomment-470224017 #### Short Background Currently, there are 2 ways to register models: - POSTing the model to the management API - Writing the model to...

feature request

Check if there are better JVM config values that can be used. There are guidelines published by oracle for JVM GC configurations in production setup https://docs.oracle.com/cd/E40972_01/doc.70/e40973/cnf_jvmgc.htm#autoId0

enhancement

It would be helpful if model-archiver could exclude directories or files when creating an archive. Consider the use case given this file structure: `models/ test/ mms_server.py build/ ` If I...

enhancement
feature request

Jetsons do not use nvidia-smi, hence inaccurate gpu count, 0 instead of one.

bug

It would be great to have a swagger client UI to manage the number of workers etc. It is fairly easy to integrate. See: https://swagger.io/tools/swagger-ui/

enhancement
question