fil_backend
fil_backend copied to clipboard
FIL backend for the Triton Inference Server
…endency file The issue occurs during fil_backend dockerfile build. We will get the `301` result in actual if trying to request "https://developer.download.nvidia.cn/compute/cuda/repos/ubuntu2004/x86_64/cuda-keyring_1.0-1_all.deb" directly. So the dpkg installer just reports a...
I deployed multiple models on the Triton server(docker), BERT models are using GPU, and XGBoost models are using CPU. Now I want to limit the number of CPU cores used...
Hello, The [FIL backend installation instructions](https://github.com/triton-inference-server/fil_backend/blob/66543e5ea2710e44ba36f2b4b1d20fcc28eefa04/docs/install.md) indicate that: > The FIL backend is a part of Triton and can be installed via the methods described in the [main Triton documentation](https://github.com/triton-inference-server/server#build-and-deploy)....
root@2ff024ed2346:/opt/tritonserver/tmp/simple-xgboost# python3 sample.py Test Accuracy: 51.24 /usr/local/lib/python3.10/dist-packages/xgboost/core.py:160: UserWarning: [09:16:55] WARNING: /workspace/src/c_api/c_api.cc:1240: Saving into deprecated binary model format, please consider using `json` or `ubj`. Model format will default to JSON in...
* Use `use_experimental_optimizations` flag to selectively enable the new FIL * Enable the new FIL for both CPU and GPU inference workload. Note: requires https://github.com/rapidsai/cuml/pull/5559 to function. * Implement common...
This PR moves the model generation and loading into pytest. This achieves a few things: - Generation of configurations and models on the fly by pytest - Testing prediction results...
When setting threshold=0.9, cuml models for CPU or GPU do not appear to observe the threshold. Other model types seem to be working.
Multiclass models must have the paramers predict_proba=true AND output_class=true in order to predict probabilities, otherwise we get the following backend error: ``` terminate called after throwing an instance of 'raft::exception'...
In the case of a binary classification model from sklearn we expect the output for both positive and negative classes (this would be consistent with the normal prediction output). As...