Jiaxin Shan issues

Results 271 issues of


                                            Jiaxin Shan

combined.loss is not equal to loss of output feature?

**Describe the bug** I notice even I choose one output feature, the `combined.loss` != `output_feature.loss`. I thought if one output feature is given, this should be exact same? Do I...

Get trial status from outside the program

**Is your feature request related to a problem? Please describe.** Yes. I would like to know how many trials are pending/running/finished. How can I get the status from external? **Describe...

feature

waiting for answer

[discussion] What's the best practice to run ludwig job with a remote Ray cluster on Kubernetes?

Currently, I am using KubeRay to start a distributed Ray cluster on kubernetes and get the endpoint which can be assessed from external either using LoadBalancer or NodePort. Then I...

documentation

Any reason not to have `automl` subcommand supported?

**Is your feature request related to a problem? Please describe.** I only find `init_config` subcommand which is equivalent to `ludwig.automl.create_auto_config`, but I'd like to have a command to kick off...

feature

release-0.6

Multiple UE4Scene GameServers in one container

**Is your feature request related to a problem? Please describe.** We are moving one Game to Agones achitecture, one single GameServer container internally needs to start multiple processes and they...

kind/feature

Change monolithic yaml to kustomization

**Is your feature request related to a problem? Please describe.** I feel monolithic yaml is not that easy to manage, for example, If I like to update images allocator or...

kind/feature

area/operations

Is there a way to support reference link for soft link files

My project already have some documents and README.md files and I would like to reuse them directly. My current folder structure is like below. ``` ├── docs │ ├── best-practice...

Continuous building docker images

I notice `mpioperator/mpi-operator:latest` is from `rongou/mpi-operator` which sometimes has delay. 1. Could you help release latest mpi operator image? I'd like to use feature from latest version. I can use...

kind/feature

graduation

Support target pod deletion in elastic training scenario

Currently, as I understand, operator doesn't support to scale down specified pods. Replica delta won't give us the granular control. If we plan to use spot instance, when the instance...

kind/feature

Add pipeline launcher components for other distributed training jobs

In order to leverage different training operators in kubeflow pipeline, it would be better to provide high level launcher components as an abstraction to invoke training jobs. `katib-launcher` and `launcher`...

area/sdk/dsl

status/triaged

lifecycle/stale