mxnet-operator issues

update-codegen.sh doesn't work after code-generator upgrading to 1.19

``` ➜ mxnet-operator git:(master) go build -o defaulter-gen /Users/jiaxin/go/pkg/mod/k8s.io/[email protected]/cmd/defaulter-gen -> 1.16.10 work ➜ mxnet-operator git:(master) ✗ go build -o defaulter-gen /Users/jiaxin/go/pkg/mod/k8s.io/[email protected]/cmd/defaulter-gen go: directory ../../../../pkg/mod/k8s.io/[email protected]/cmd/defaulter-gen outside available modules -> 1.19.x doesn't...

Jeffwan

add dependabot config script

3

Inspired by https://github.com/kubeflow/pipelines/issues/4682 I created a script that will create a config file for depandabot so that it knows what directories to scan. It will scan the repository for files...

davidspek

size/L

do-not-merge/hold

python sdk for mxnet-operator

6

There are python sdks for tf-operator and pytorch-operator, why doesn't mxnet-operator have one?

everett-moon

Move presubmit jobs to AWS CI

Community is asking different WG to own their infra and community won't provide a common shared testing infra anymore. Sees kubeflow/testing#752 for more details. Pytorch migration works well and here's...

Jeffwan

Change to follow kubeflow/common convention and reuse implementation

2

Currently, mxnet operator API doesn't follow `kubeflow/common` convention and but controller does. It imports tf-operator implementation `https://github.com/kubeflow/mxnet-operator/blob/c718707b304dc1ed0210a740c8efe7071d4ebb3e/pkg/controller.v1/mxnet/controller.go#L42` To graduate mxnet-operator to v1, we'd like to migrate to follow `kubecom/common` convention....

Jeffwan

Graduate mxnet-operator to v1

1

Jeffwan

[Feature Request] mxnet-operator not available by default in kubeflow

2

Hello, Thank you for working on mxnet-operator for kubeflow. I see that mxnet-operator is not installed by default in Kubeflow. If it is available by default, it would help users...

sandeep-krishnamurthy

the status of worker-0 is error, but the status of mxjob is Succeeded

2

**kubeflow version**: 0.5.0 **mxnet-operator version**: v1beta1 **kubernetes dashboard display**： ![image](https://user-images.githubusercontent.com/3971265/58082021-c472d580-7be8-11e9-8240-8880dfda3f70.png) **worker-0 log**: INFO:root:start with arguments Namespace(add_stn=False, batch_size=64, data_dir='/admin/public/model/mxnet_distributed/data', disp_batches=10, dtype='float32', gc_threshold=0.5, gc_type='none', gpus='0', image_shape='1, 28, 28', initializer='default', kv_store='dist_device_sync', load_epoch=None, loss='',...

magicmopper

kind/bug

Mxnet operator v1 API

6

There are couple of minor api changes that are suggested. We can incorporate all these changes in the next API version. Related: kubeflow/tf-operator#935 - [x] Requires support of Status subresource...

wackxu

mxnet-operator
mxnet-operator copied to clipboard

Metadata

update-codegen.sh doesn't work after code-generator upgrading to 1.19

add dependabot config script

python sdk for mxnet-operator

Move presubmit jobs to AWS CI

Change to follow kubeflow/common convention and reuse implementation

Graduate mxnet-operator to v1

[Feature Request] mxnet-operator not available by default in kubeflow

the status of worker-0 is error, but the status of mxjob is Succeeded

Mxnet operator v1 API

← Metadata

Owner

Metadata

mxnet-operator mxnet-operator copied to clipboard

Metadata

← Metadata

Owner

Metadata

mxnet-operator
mxnet-operator copied to clipboard