Jiaxin Shan
Jiaxin Shan
**Describe the bug** I notice even I choose one output feature, the `combined.loss` != `output_feature.loss`. I thought if one output feature is given, this should be exact same? Do I...
**Is your feature request related to a problem? Please describe.** Yes. I would like to know how many trials are pending/running/finished. How can I get the status from external? **Describe...
Currently, I am using KubeRay to start a distributed Ray cluster on kubernetes and get the endpoint which can be assessed from external either using LoadBalancer or NodePort. Then I...
**Is your feature request related to a problem? Please describe.** I only find `init_config` subcommand which is equivalent to `ludwig.automl.create_auto_config`, but I'd like to have a command to kick off...
**Is your feature request related to a problem? Please describe.** We are moving one Game to Agones achitecture, one single GameServer container internally needs to start multiple processes and they...
**Is your feature request related to a problem? Please describe.** I feel monolithic yaml is not that easy to manage, for example, If I like to update images allocator or...
My project already have some documents and README.md files and I would like to reuse them directly. My current folder structure is like below. ``` ├── docs │ ├── best-practice...
I notice `mpioperator/mpi-operator:latest` is from `rongou/mpi-operator` which sometimes has delay. 1. Could you help release latest mpi operator image? I'd like to use feature from latest version. I can use...
Currently, as I understand, operator doesn't support to scale down specified pods. Replica delta won't give us the granular control. If we plan to use spot instance, when the instance...
In order to leverage different training operators in kubeflow pipeline, it would be better to provide high level launcher components as an abstraction to invoke training jobs. `katib-launcher` and `launcher`...