amybachir

Results 9 issues of amybachir

I'm using great_expectations version 0.13.33. I'm running great_expectation in an Azure Databricks cluster with [ADLS Credential Passthrough](https://docs.microsoft.com/en-us/azure/databricks/security/credential-passthrough/adls-passthrough) enabled. All my stores are `TupleFilesystemStoreBackend` that are Databricks DBFS mounts with Azure...

help wanted
community
core-engineering-queue
feature

I'm using [aws-virtual-gpu-device-plugin](https://github.com/awslabs/aws-virtual-gpu-device-plugin) which is a solution built on top of Multi-Process Service(MPS) to expose arbitrary number of virtual GPUs on GPU nodes in a kubernetes cluster. Occasionally and randomly,...

Our inference service pods occasionally run into an init error so the storage-initializer container restarts and gets stuck in an Init:CrashLoopBackOff when attempting to run the following: ``` [I 220726...

kfserving/storage

I'm trying out kubeflow 1.1.0 with kfctl_istio_dex.v1.1.0.yaml manifest installed on Azure kubernetes cluster. Before 1.1.0, I would customize the installation by adding my custom parameters directly in kfctl_istio_dex.v1.1.0.yaml configuration file...

kind/bug
platform/azure
priority/p2
area/kfctl

Is there a way to log vgpu utilization metrics and monitor with aws-virtual-gpu-device-plugin? I currently use nvml library with datadog but it is not aware of the virtual GPUs so...

### Describe the bug When running `bentoml containerize` with a custom docker base_image, the command throws an error ``` => => transferring context: 47.64MB 0.3s => ERROR [base-container 2/8] RUN...

bug

I have a milvus standalone deployment through milvus helm chart deployed on kubenretes cluster with s3 backend - chart version is 4.0.34, app version is 2.2.14 The milvus pod throws...

### What is the version? 3.3.5-3.4.1 ### What happened? Upgraded from k8s 1.24 to 1.25 and dcgm-exporter from 3.3.3-3.3.1 to 3.3.5-3.4.1. The dcgm-exporter pod is now in Crashloopbackoff with this...

bug

The kubeflow models web app (v0.6.0) does not display models although it gets a 200 response for `/models/api/namespaces/namespace/inferenceservices` ``` { "inferenceServices":[ { "apiVersion":"serving.kubeflow.org/v1beta1", "kind":"InferenceService", "metadata":{ "annotations":{ "autoscaling.knative.dev/maxScale":"5", "autoscaling.knative.dev/minScale":"1", "kubectl.kubernetes.io/last-applied-configuration":"{""apiVersion"":"serving.kubeflow.org/v1beta1",""kind"":""InferenceService"",""metadata"":{""annotations"":{""autoscaling.knative.dev/maxScale"":"5",""autoscaling.knative.dev/minScale"":"1"},""labels"":{""app"":""iris-amy"",""app.kubernetes.io/instance"":""iris-amy"","controller-tools.k8s.io":"1.0",""env"":"mlk8s-dev",""sidecar.istio.io/inject"":"false",""version"":"0.0.20"},""name"":""iris-amy"",""namespace"":""abachir""},""spec"":{""predictor"":{""containers"":[{""env"":[{""name"":""PATH_TO_MODEL"",""value"":""/mnt/models/iris/models/2021-05-05T18":"47":18.359991/"},{""name"":""STORAGE_URI"",""value"":""pvc":"},{""name"":""MODEL_VERSION"",""value"":"0.0.20"}],""image"":""iris-kfserving":0.0.20",""name"":""kfserving-container"",""resources"":{""limits"":{""cpu"":"1",""memory"":"6Gi"},""requests"":{""cpu"":"250m",""memory"":"4Gi"}}}]}}} "...