volcano icon indicating copy to clipboard operation
volcano copied to clipboard

volcano性能验证

Open sharphu opened this issue 1 year ago • 9 comments

volcano号称是若是10000节点,会有 2000pods/s的调度,我想请问的是针对这个数据,你们是怎么得出来的?我现在想要验证它的话,需要怎么做?有没有验证教程? https://github.com/volcano-sh/volcano/blob/master/docs/Volcano%20-%20A%20Kubernetes%20Native%20Batch%20System.pdf

sharphu avatar Nov 15 '23 07:11 sharphu

I have a plan to write a mock-scheduler on volcano, to do scheduling performance test for large scale nodes and pods.

lowang-bh avatar Nov 16 '23 07:11 lowang-bh

I have a plan to write a mock-scheduler on volcano, to do scheduling performance test for large scale nodes and pods.

how to do?

sharphu avatar Nov 22 '23 01:11 sharphu

I have a plan to write a mock-scheduler on volcano, to do scheduling performance test for large scale nodes and pods.

how to do?

Just use fake client to replace apiserver when binding to nodes. Other scheduling logic is same as origin.

lowang-bh avatar Nov 26 '23 03:11 lowang-bh

I have a plan to write a mock-scheduler on volcano, to do scheduling performance test for large scale nodes and pods.

Do you have more detailed information about this plan? I'd like to see if I can help.Because recently we are also planning to test the performance of Volcano.

bysph avatar Dec 01 '23 09:12 bysph

Do you have more detailed information about this plan? I'd like to see if I can help.Because recently we are also planning to test the performance of Volcano.

We can use fake instance to simulate those interface in scheduler cache and leave the whole scheduing logic same as volcano scheduler. To simulating the scheduler, those steps are need:

  1. make sure all clients are use interface: please help to merge https://github.com/volcano-sh/volcano/pull/2866 first. @william-wang
  2. then define a MockCache similar to Cache in volcano-scheduler
  3. deveop a client(cli, http api or sdk) which support CRUD operations to manage fake resource such as pod/job/podgroup/node, etc. This step may take some time.
  4. creating large number of those resource and wait mock-scheduler to scheduling those pods, statistic those metrics data.

There is also anothter simple way: use https://github.com/kubernetes-sigs/kwok to replace real k8s nodes. Then create large number os fake-nodes and submit large numbers of pods to those fake-nodes and waiting volcano scheduling.

lowang-bh avatar Dec 03 '23 04:12 lowang-bh

Do you have more detailed information about this plan? I'd like to see if I can help.Because recently we are also planning to test the performance of Volcano.

We can use fake instance to simulate those interface in scheduler cache and leave the whole scheduing logic same as volcano scheduler. To simulating the scheduler, those steps are need:

  1. make sure all clients are use interface: please help to merge fix: events miss when there is a pod bind failed #2866 first. @william-wang
  2. then define a MockCache similar to Cache in volcano-scheduler
  3. deveop a client(cli, http api or sdk) which support CRUD operations to manage fake resource such as pod/job/podgroup/node, etc. This step may take some time.
  4. creating large number of those resource and wait mock-scheduler to scheduling those pods, statistic those metrics data.

There is also anothter simple way: use https://github.com/kubernetes-sigs/kwok to replace real k8s nodes. Then create large number os fake-nodes and submit large numbers of pods to those fake-nodes and waiting volcano scheduling.

ok,I am currently working on it.

bysph avatar Dec 04 '23 11:12 bysph

volcano号称是若是10000节点,会有 2000pods/s的调度,我想请问的是针对这个数据,你们是怎么得出来的?我现在想要验证它的话,需要怎么做?有没有验证教程?

Is there any background on that requirement? For 2k pods/s, it not only about scheduling, but also about kube-apiserver enhancement which is not open source right now.

k82cn avatar Dec 28 '23 02:12 k82cn

Is this topic still active? I would also like to discuss if there is any information available in this regard. I'm wondering if we can use KWOK here approach to simulate and validate scheduling performance.

googs1025 avatar Apr 17 '24 04:04 googs1025

keep one eye on this issue.

zhifanggao avatar Apr 23 '24 01:04 zhifanggao