serve issues

model registered not found after restart docker container torchserve Management

Please have a look at [FAQ's](../../docs/FAQs.md) and [Troubleshooting guide](../../docs/Troubleshooting.md), your query may be already addressed. Your issue may already be reported! Please search on the [issue tracker](https://github.com/pytorch/serve/issues) before creating one....

hungtooc

enhancement

docker

How to access API outside of localhost?

6

Hi! I have Wireguard in my machine and have a few other devices connected with it. Let's say my Wireguard IP is 10.0.0.1, then in the `config.properties` file, I change...

kkmehta03

help wanted

support

Add an example for Video inferencing

2

Add an example to demonstrate video inferencing where a video payload is sent and predictions are done for each frame. Eg for activity recognition across a video stream

chauhang

enhancement

help wanted

How to distribute multi models to each gpu?

2

I have two models: model0,model1 and two gpus: gpu0,gpu1. I want to set model0 to gpu0,model0 to gpu1,it means that the work of model0 will always on gpu0 and model1...

dzcmingdi

enhancement

workflow manager async prediction response causes timeout potentially

Symptom: ab test sent out 11000 requests. The last inference request didn’t get response even though backend processed the request successfully. root cause: [applyAsync](https://github.com/pytorch/serve/blob/master/frontend/server/src/main/java/org/pytorch/serve/workflow/WorkflowManager.java#L381) causes the order of thread completion...

lxning

bug

Can't serve model.ts

5

## Context Hi! I have trained maskrcnn with detectron2 and have exported with [this](https://github.com/facebookresearch/detectron2/tree/main/tools/deploy) instructions (I used scripting) torch script model (model.ts). Then I used [this](https://github.com/pytorch/serve/tree/master/examples/object_detector/maskrcnn) example to start a...

ArgoHA

bug

TorchServe does not read correct default_response_timeout

7

## Context I'm trying to increase the configuration parameter `default_response_timeout` to a value higher than 120 seconds but it seems neither the `default_response_timeout` parameter nor the environment variable `TS_DEFAULT_RESPONSE_TIMEOUT` are...

KilianMichiels

bug

workflowx

Enabling HTTP caching of inference results

## Is your feature request related to a problem? Please describe. Currently, TorchServe adds headers that prevent from caching the inference results: https://github.com/pytorch/serve/blob/30f83500b0850e26ec55581f48a9307b1986f9f9/frontend/server/src/main/java/org/pytorch/serve/util/NettyUtils.java#L187-L190 This prevents some reverse-proxies like `nginx` from...

danthe3rd

enhancement

help wanted

WorkflowTest.testPredictionWorkflowNotFound is flaky on GPU

1

Please have a look at [FAQ's](../../docs/FAQs.md) and [Troubleshooting guide](../../docs/Troubleshooting.md), your query may be already addressed. Your issue may already be reported! Please search on the [issue tracker](https://github.com/pytorch/serve/issues) before creating one....

maaquib

bug

workflowx

Docker build on M1 fails

6

Please have a look at [FAQ's](../../docs/FAQs.md) and [Troubleshooting guide](../../docs/Troubleshooting.md), your query may be already addressed. Your issue may already be reported! Please search on the [issue tracker](https://github.com/pytorch/serve/issues) before creating one....

c-xlenz

bug

docker

serve
serve copied to clipboard

Metadata

model registered not found after restart docker container torchserve Management

How to access API outside of localhost?

Add an example for Video inferencing

How to distribute multi models to each gpu?

workflow manager async prediction response causes timeout potentially

Can't serve model.ts

TorchServe does not read correct default_response_timeout

Enabling HTTP caching of inference results

WorkflowTest.testPredictionWorkflowNotFound is flaky on GPU

Docker build on M1 fails

← Metadata

Owner

Metadata

serve serve copied to clipboard

Metadata

← Metadata

Owner

Metadata

serve
serve copied to clipboard