xgboost-operator icon indicating copy to clipboard operation
xgboost-operator copied to clipboard

Incubating project for xgboost operator

Results 22 xgboost-operator issues
Sort by recently updated
recently updated
newest added

Hello, everyone. Does XGBoost-operator have a python client? Just like Pytorch-operator, can XGBoost-operator be run by pythonSDK?

When I started to deploy xgboost-operator on my kubeflow cluster, I referred to https://github.com/kubeflow/xgboost-operator/blob/master/config/samples/xgboost-dist/utils.py#L47 to implement my own version to read my own data. It's very common I follow this...

when we submit a Job and assign a namespace, it cannot work, submit like this: " kubectl create -f xgboost-operator/config/samples/xgboost-dist/xgboostjob_v1_iris_train.yaml -n aisys " and the error message like this: "...

``` kubectl kustomize base | kubectl apply -f - Error: json: unknown field "envs" error: no objects passed to apply ``` kubectl version ``` "v1.20.4" ```

I ran distributed training on k8s. The `rank` number was got by `extract_xgbooost_cluster_env()` as in https://github.com/kubeflow/xgboost-operator/blob/master/config/samples/xgboost-dist/train.py#L29 However, `xgb.rabit.get_rank()` got another `rank` number as in https://github.com/kubeflow/xgboost-operator/blob/master/config/samples/xgboost-dist/train.py#L57. There are two things confusing...

Inspired by https://github.com/kubeflow/pipelines/issues/4682 I created a script that will create a config file for depandabot so that it knows what directories to scan. It will scan the repository for files...

size/L
do-not-merge/hold

Community is asking different WG to own their infra and community won't provide a common shared testing infra anymore. Sees kubeflow/testing#752 for more details. Pytorch migration works well and here's...

kind/feature

this is xgboost job python sdk, developers can import package to deploy xgboost on kubernetes(like tfjob)

size/XXL
needs-ok-to-test

Rank allocation to -1 when the underlying node is not ready Typically when running the xgboost operator in autoscale mode and more nodes are required, the operator allocates a negative...

kind/bug
area/front-end
area/operator

I am trying to deploy operator by kustomize. But looks like the pod is not running, and the status is CrashLoopBackOff. Then look into detail by using describe command, I...

kind/bug
area/operator