kongjibai

Results 17 comments of kongjibai

> Caffe don't need contiguous function. How can I get trough it. > It's almost done for my model. Thanks for help! Hi, I met the similar situation, did you...

> Please check the status of volcano components > > ```shell > kubectl get po -n volcano-system > ``` > > If possible, please attach the logs of the volcano...

> `kubectl describe vcjob job-1` sorry, it's a long time no reply. it outputs as below, reminds pod group is not ready. it's normal in volcano-release-1.5, but failed in volcano-release-1.6....

> This output provides little useful information for debug. Have you described the podgroup for more details or take a search of the logs? the podgroup described as below, it...

> Please post the yaml file of the job ``` apiVersion: batch.volcano.sh/v1alpha1 kind: Job metadata: # name: lm-hvd-job-tf-mnist name: lm-horovod-job # namespace: vc-horovod-test # namespace: default labels: "volcano.sh/job-type": Horovod spec:...

> Please try delayed start master. > > https://github.com/volcano-sh/volcano/blob/2cfce7a1305e4ad6d3dcb1a11bf3dc528aee0701/example/task-start-dependency/mpi.yaml#L34 > > You can try use dependsOn. I have tried use `dependsOn`, but there is no matser job and the worker...

> You may need to reinstall the volcano again after deleting these webhook configurations. thx, i have uninstall it using `kubectl delete -f volcano-development.yaml`, and reinstalled. but maybe uninstall completely,...

> if you use volcano-development.yaml, the normal reinstall way might be: > > ``` > $ kubectl delete -f volcano-development.yaml > $ kubectl delete validatingwebhookconfigurations volcano-admission-service-pods-validate volcano-admission-service-jobs-validate volcano-admission-service-queues-validate > $...

> if you use volcano-development.yaml, the normal reinstall way might be: > > ``` > $ kubectl delete -f volcano-development.yaml > $ kubectl delete validatingwebhookconfigurations volcano-admission-service-pods-validate volcano-admission-service-jobs-validate volcano-admission-service-queues-validate > $...

> please check > > https://github.com/volcano-sh/volcano/tree/master/example/integrations thx, but it only have argo, mpi, mxnet, paddlepaddle and tensorflow, do not have **_pytorch_** and _horovod_.