serving
serving copied to clipboard
A flexible, high-performance serving system for machine learning models
Currently when the configured model list is updated via a call to `handleReloadConfigRequest`, the request thread blocks until any newly added models become available. Their availability however depends on the...
In direct_session.cc https://github.com/tensorflow/tensorflow/blob/master/tensorflow/core/common_runtime/direct_session.cc#L1514, it always emplaces key to executors_, then a lot of keys are added to map, which leads to a lot of memory usage. If 10 input tensors,...
addresses this https://github.com/tensorflow/serving/issues/795 I think it's a feature the community really wants
OP_REQUIRES failed at xla_ops : UNIMPLEMENTED: Could not find compiler for platform CUDA: NOT_FOUND
## Bug Report Does Tensorflow Serving support XLA compiled SavedModels ? or am I doing something wrong ? ### System information - **OS Platform and Distribution (e.g., Linux Ubuntu 16.04)**:...
# Feature Request Add a health check to the Dockerfile ## Describe the problem the feature is intended to solve I'm using a docker container on an edge device and...
**System information** operating system: `win11 64bit` tensorflow/serving version: `2.14.1 ` TFserving Deployment mode: `RESTful API` python requirements.txt: `tensorflow-cpu == 2.3.0 pyqt5 pillow opencv-python matplotlib` **Describe the problem** Hello everyone, my...
Trying to understand why TF uses one CUDA compute stream? Is there a metric which shows if ops are waiting to be scheduled on that one compute stream? I want...
## Feature Request May I know the expected release date of TensorFlow Serving with TensorFlow Runtime? For the background, recently, I found that tensorflow runtime is integrated into tensorflow/serving codes....
## Bug Report If this is a bug report, please fill out the following form in full: ### System information - **OS Platform and Distribution (e.g., Linux Ubuntu 16.04)**: -...
Does TF Serving support CUDA graphs?