multi-model-server issues

Execution parameters bug for MaxPayloadinMB

1

https://github.com/awslabs/mxnet-model-server/blob/master/plugins/endpoints/src/main/java/software/amazon/ai/mms/plugins/endpoint/ExecutionParameters.java#L26 'max_request_size' seems to refer to bytes, not mb.

ericangelokim

bug

good first issue

How to represent any size in the signature.json file

7

My input image is [batch, 3, None, None], how to represent any size in the signature.json file. I think a solution is to remove image.resize(img_arr, w, h) in mxnet_vision_servion_service.py. But...

wangce888

question

bad inference prediction results from ResNet50 ONNX models

3

I'm getting bad inference results from ResNet ONNX models. The image of the cat used in the example is reported as a pool table when using ResNet v1 or v2....

tahouse

question

Add GPU metrics to metrics logging

1

Currently MMS_METRICS that are logged include things like CPUUtilization and Memory usage. It would be great if these also included things like GPU usage and memory. This would make it...

mikeobr

enhancement

Multi-node awareness

1

Having worked in a number of AI startups the biggest limitation of the server is that it can only manage modes on GPUS which are on a single server implementation....

fish-finger

feature request

Batch Configuration In Model Archive

14

Follow up to https://github.com/awslabs/mxnet-model-server/issues/732#issuecomment-470214725 and https://github.com/awslabs/mxnet-model-server/issues/732#issuecomment-470224017 #### Short Background Currently, there are 2 ways to register models: - POSTing the model to the management API - Writing the model to...

erandagan

feature request

Production JVM server configuration

Check if there are better JVM config values that can be used. There are guidelines published by oracle for JVM GC configurations in production setup https://docs.oracle.com/cd/E40972_01/doc.70/e40973/cnf_jvmgc.htm#autoId0

vdantu

enhancement

Model Archive should allow excluding dirs/files

It would be helpful if model-archiver could exclude directories or files when creating an archive. Consider the use case given this file structure: `models/ test/ mms_server.py build/ ` If I...

mikeobr

enhancement

feature request

getAvailableGpu() inaccurate on Jetson

3

Jetsons do not use nvidia-smi, hence inaccurate gpu count, 0 instead of one.

ThomasDelteil

bug

Server a swagger client UI on 8081 to make it easier to manage

It would be great to have a swagger client UI to manage the number of workers etc. It is fairly easy to integrate. See: https://swagger.io/tools/swagger-ui/

ThomasDelteil

enhancement

question

multi-model-server
multi-model-server copied to clipboard

Metadata

Execution parameters bug for MaxPayloadinMB

How to represent any size in the signature.json file

bad inference prediction results from ResNet50 ONNX models

Add GPU metrics to metrics logging

Multi-node awareness

Batch Configuration In Model Archive

Production JVM server configuration

Model Archive should allow excluding dirs/files

getAvailableGpu() inaccurate on Jetson

Server a swagger client UI on 8081 to make it easier to manage

← Metadata

Owner

Metadata

multi-model-server multi-model-server copied to clipboard

Metadata

← Metadata

Owner

Metadata

multi-model-server
multi-model-server copied to clipboard