Riley Hun
Riley Hun
We are trying to decrease the latency of our BERT model prediction service that is deployed using FastAPI. The predictions are called through the `/predict` endpoint. We looked into the...
**What happened**: I am trying to set up Dask Gateway on an EMR cluster following the guidelines specified [here](https://gateway.dask.org/install-hadoop.html). The only difference is that I am not sure how to...
I understand that it is recommended to use TLS in a production environment as per the docs, so I'm trying to set that up. Here are the steps I followed...
Hello, We are using an internal TCP load balancer to expose the traefik proxy for security purposes. Our users are able to create a client connection to the cluster generated...
I am getting the following error when running a gridsearch on dask distributed back-end. This error is nonexistent when just running sklearn gridsearch on single core local machine. I don't...
**Describe the bug** I really need OpenDistro running in a production environment for a project, but I'm having a lot of trouble getting my Kibana server to start up when...
Is there a way to authenticate to HANA using JWT? We are currently using basic authentication, but want to transition into integrating OAUTH because we don't want to enter username/password...
I am noticing my fluentd pods keep restarting. They are collecting the logs and sending them to elasticsearch, so the workflow isn't broken per se, but in the last 13...
Hello all, Unfortunately, I ran out of space training the 3B model. I'm using a `p3.16xlarge` instance and it ran out of space on epoch 0.3. Any advice on how...
I am training using the `EleutherAI/pythia-2.8b` model and I'm using a `p3.16xlarge`. I tried the instructions for training on smaller instances, but still got a `CUDA out of memory` error...