Anindya Saha

Results 5 issues of Anindya Saha

I am testing out TFJob and MPIJob operators from kubeflow/manifests-v1.1.0 branch on AWS EKS K8s=1.14. I am able to schedule TFJob and MPIJob successfully and these jobs also complete fine....

I have applied the following MPI Job Yaml. I observe that when I run the workers with only the GPU specified in the resources section the TF2 Job proceeds very...

Hello Team, I find the Pod Level propagation of Annotations and Labels to be very confusing. Below is an example how am I setting the annotations and labels at each...

bug

Hello Team, I have used Kubeflow MPI Job Operator before and I am evaluating Polyaxon Operators. One issue that I faced in the past that when I applied a similar...

stale

He Team, I am trying to use the Pytorch Operator to spawn distributed Pytorch Jobs. I see the image mentioned in https://github.com/kubeflow/pytorch-operator/blob/6293efc19503078953acf04df03a1204fd265e35/manifests/kustomization.yaml#L13 to be `809251082950.dkr.ecr.us-west-2.amazonaws.com/pytorch-operator`. However, that repo is not...

kind/bug