Results 47 comments of Nobuto Murata

I think a previously opened issue in https://github.com/canonical/microk8s/issues/2575 might have the same root cause, but at that point a solution was to use 1.22, which doesn't support Kubeflow in this...

Ah, okay. You are talking about [this change](https://github.com/canonical/microk8s/commit/1b55a80ba01659dc46524436dfef462d1e4208fa#diff-56759910381a014fecfd7556dd72ddd68c747d922a5b7df2044b9ce7c552f5f5).

Okay, at least the theory has been confirmed; the patchset below can make GPU enablement step work. https://github.com/canonical/microk8s/compare/1.21...nobuto-m:1.21-gpu

> > Okay, at least the theory has been confirmed; the patchset below can make GPU enablement step work. [1.21...nobuto-m:1.21-gpu](https://github.com/canonical/microk8s/compare/1.21...nobuto-m:1.21-gpu) > > @nobuto-m , Can you pls help me how...

> I tried to reproduce your issue but in my case the trials get completed without patching the katib deployment: Hmm, interesting. I had the timeout 100% so far. >...

Hmm, `trial-resources=Workflow.v1alpha1.argoproj.io` might be a red herring. The issue is still reproducible in my environment, but after adding `trial-resources=Workflow.v1alpha1.argoproj.io` and removing it again, the trial jobs complete. So the key...

> So the key to workaround the issue might be restarting/recreating the katib-controller pod. It's puzzling why it's reproducible on my testbed but not on the other environment though. `microk8s...

I've reproduced it successfully in a clean environment. The steps are almost identical with the one in the description. Hope it helps for you to reproduce it on your end,...

> I deployed kubeflow once more using your instructions and the trials still succeed, provided that the notebook's CPU and memory values are increased. > > When using the default...

This was tricky since `juju refresh seldon-controller-manager --channel edge` didn't solve the problem. It required a fresh redeployment or manual edit to the service definition. ``` $ juju refresh seldon-controller-manager...