Simon_CQK

Results 22 issues of Simon_CQK

```python class Storage(object): SUPERBLOCK_SIZE = 4096 INTEGER_FORMAT = "!Q" INTEGER_LENGTH = 8 ``` I know the INTEGER_FORMAT , but the other two really confuse me.

#### What would you like to be added: Introduce a new filed in `podGroup.spec` named with `waitTimeoutSeconds` or sth similar, then users are able to configure `waitTimeoutSeconds` dynamically. #### Why...

kind/feature
lifecycle/stale

# KubeDL 2022 Annual Review ## Table of Contents ## Backgroud KubeDL is a suite of Kubernentes controllers that enable running machine learning workloads on Kubernentes, such as model training...

Signed-off-by: SimonCqk ### Ⅰ. Describe what this PR does remove `launcherRunsWorkload` global startup flag, which can be inferred by whether has `Launcher` role in job spec. ### II. Does this...

**What would you like to be added**: In k8s `>=1.20`, align with `controller-runtime >=0.8.x`, list/watch transport protocol of built-in resources(pods, services...) has upgraded from `json` to `protobuf`, which brings considerable...

enhancement
dependencies

**What would you like to be added**: 1. a job-level queuing and orchestration module that admit job to be scheduled by some strategies. 2. refactor skeleton codes to make control...

enhancement

# Alibaba Summer of Code (ASOC) 2022 Welcome to the open source world! If you haven't planned how to spend this summer, come to the Alibaba Summer of Code and...

asoc2022
community

**Background**: As the [official portal](https://pytorch.org/docs/stable/distributed.elastic.html) introduced, torch-elastic has been upstreamed to pytorch >=1.9. KubeDL manages the lifecycle of jobs and orchestrate their resources, it is critical to implement torch-elastic distributed...

enhancement
asoc2022
community

**What would you like to be added**: the logging in stdout is mixed with multiple formats for we using logging libraries mixture across different packages and controller implementations, consolidating loggers...

refactor

**What would you like to be added**: as we discussed in https://github.com/kubedl-io/kubedl/issues/180#issuecomment-942907449, the global flag launcherRunsWorkload can be removed, mpiReplicaSpecs with Launcher role indicates that mpijob will be driven by...