kongjibai
kongjibai
> Caffe don't need contiguous function. How can I get trough it. > It's almost done for my model. Thanks for help! Hi, I met the similar situation, did you...
> Please check the status of volcano components > > ```shell > kubectl get po -n volcano-system > ``` > > If possible, please attach the logs of the volcano...
> `kubectl describe vcjob job-1` sorry, it's a long time no reply. it outputs as below, reminds pod group is not ready. it's normal in volcano-release-1.5, but failed in volcano-release-1.6....
> This output provides little useful information for debug. Have you described the podgroup for more details or take a search of the logs? the podgroup described as below, it...
> Please post the yaml file of the job ``` apiVersion: batch.volcano.sh/v1alpha1 kind: Job metadata: # name: lm-hvd-job-tf-mnist name: lm-horovod-job # namespace: vc-horovod-test # namespace: default labels: "volcano.sh/job-type": Horovod spec:...
> Please try delayed start master. > > https://github.com/volcano-sh/volcano/blob/2cfce7a1305e4ad6d3dcb1a11bf3dc528aee0701/example/task-start-dependency/mpi.yaml#L34 > > You can try use dependsOn. I have tried use `dependsOn`, but there is no matser job and the worker...
> You may need to reinstall the volcano again after deleting these webhook configurations. thx, i have uninstall it using `kubectl delete -f volcano-development.yaml`, and reinstalled. but maybe uninstall completely,...
> if you use volcano-development.yaml, the normal reinstall way might be: > > ``` > $ kubectl delete -f volcano-development.yaml > $ kubectl delete validatingwebhookconfigurations volcano-admission-service-pods-validate volcano-admission-service-jobs-validate volcano-admission-service-queues-validate > $...
> if you use volcano-development.yaml, the normal reinstall way might be: > > ``` > $ kubectl delete -f volcano-development.yaml > $ kubectl delete validatingwebhookconfigurations volcano-admission-service-pods-validate volcano-admission-service-jobs-validate volcano-admission-service-queues-validate > $...
> please check > > https://github.com/volcano-sh/volcano/tree/master/example/integrations thx, but it only have argo, mpi, mxnet, paddlepaddle and tensorflow, do not have **_pytorch_** and _horovod_.