xiashenzhen
xiashenzhen
我的报错和上面一样,大佬看看: data:image/s3,"s3://crabby-images/1f788/1f788634f810b055b973537696c17ea7e75965ad" alt="image" **katib-db-manager log:** E0827 04:22:23.644805 1 mysql.go:78] Ping to Katib db failed: dial tcp 10.96.249.11:3306: connect: connection refused E0827 04:22:28.668739 1 mysql.go:78] Ping to Katib db failed: dial tcp...
> **其他的POD都可以启动,相关数据库的katib-db-manager,和katib-mysql会有错误产生,查询log如下:** > > * katib-db-manager: > > E0827 03:18:05.755835 1 mysql.go:78] Ping to Katib db failed: dial tcp 10.96.67.181:3306: connect: connection refused > E0827 03:18:10.758696 1 mysql.go:78] Ping to...
> > > **其他的POD都可以启动,相关数据库的katib-db-manager,和katib-mysql会有错误产生,查询log如下:** > > > > > > * katib-db-manager: > > > > > > E0827 03:18:05.755835 1 mysql.go:78] Ping to Katib db failed: dial tcp 10.96.67.181:3306:...
> @xiashenzhen @WMeng1 你们看看PVC是否有问题: > > ```shell > kubectl get pvc -A > ``` > > 这个mysql应用是很简单的,有可能是你们之前安装出错没有删除导致,关于这个mysql,你们可以看 > > https://github.com/shikanon/kubeflow-manifests/blob/50ee9f1e0aef5f69620db89c9ae2f81c9b2d96e3/manifest1.3/019-katib-installs-katib-with-kubeflow-cert-manager.yaml#L616 感谢回复,我看了下,存贮是没有问题的,不知道为什么,就是这两个POD有问题 data:image/s3,"s3://crabby-images/3d17e/3d17e8773ab17160dd759f4c1ce8989f048c27b4" alt="image" 我把pod删掉重启也不行。。。 不知道是不是版本的问题,我用的kubectl 1.20.5
> kubeflow-manifests/manifest1.3/019-katib-installs-katib-with-kubeflow-cert-manager.yaml 我的现在解决了,直接删了创建 kubectl delete -f kubeflow-manifests/manifest1.3/019-katib-installs-katib-with-kubeflow-cert-manager.yaml kubectl apply -f kubeflow-manifests/manifest1.3/019-katib-installs-katib-with-kubeflow-cert-manager.yaml
> > > kubeflow-manifests/manifest1.3/019-katib-installs-katib-with-kubeflow-cert-manager.yaml > > > > > > 我的现在解决了,直接删了创建 > > kubectl delete -f kubeflow-manifests/manifest1.3/019-katib-installs-katib-with-kubeflow-cert-manager.yaml > > kubectl apply -f kubeflow-manifests/manifest1.3/019-katib-installs-katib-with-kubeflow-cert-manager.yaml > > 我这里删了创建之后,pvc都bound上了,但是连接数据库的两个Pod虽然为running状态,但是ready显示0/1,describe显示还是没有连通数据库 你先跑patch里面的东西,delete一遍,然后apply,最后再删除重建
> > > > kubeflow-manifests/manifest1.3/019-katib-installs-katib-with-kubeflow-cert-manager.yaml > > > > > > > > > 我的现在解决了,直接删了创建 > > > kubectl delete -f kubeflow-manifests/manifest1.3/019-katib-installs-katib-with-kubeflow-cert-manager.yaml > > > kubectl apply -f kubeflow-manifests/manifest1.3/019-katib-installs-katib-with-kubeflow-cert-manager.yaml >...