GraphScope CI improvement for distributed & integration tests

CI improvement for distributed & integration tests

Open yecol opened this issue 3 years ago • 1 comments

There are many limitations on our current CI:

The environment on CI is hugely different from our laptop, it is hard to reproduce CI errors by our own;
Especially for the integration tests on distributed setting, which involve k8s, (or Hadoop/Spark in the near future);
Pain for debugging: a bugfix on distributed setting needs to package a new image, upload to registry, and then ask our "experts", i.e., Siyuan or Dongze for verification.

Hence the CI works like an obstacle for our development, however, it should be a productive tool.

Let's take a look at the integration tests of Clickhouse, I think it was a good practice! It was super-easy to launch a test environment with a cluster with 4 nodes, each mount a mocked s3 disk, and a 3-nodes zookeeper, using a single-line command like

runner integration_test_s3

The Clickhouse integration CI workflow is shown in the fig-1:

developers build the binary on local machine,
by assigning CLICKHOUSE_TESTS_SERVER_BIN_PATH or arg to the binary built on step-1, the runner code will launch a docker container.
in the container, pytest will begin to run. Pay attention to its helper functions. e.g., cluster.py will launch a cluster of containers within this container, making it easier for distributed tests.
The logs for each instance will be exported to the local machine, for further debugging and investigation. shown in the fig-2.

GraphScope may borrow the ideas from this, to improve our CI. Since Clickhouse builds the binary with static dependencies, its binary can be built on A and run on B. Whether we

need to link dependencies as static libraries;
or some adaptations need to be made (Prefered) :

How do you think? Any comments are welcome! Strongly recommend to run a test following README, to see how complex the test setting is while keeps the user experience clean and easy.

Jul 20 '22 07:07 yecol

It is a part of #1499

Jul 20 '22 07:07 yecol

GraphScope GraphScope copied to clipboard

CI improvement for distributed & integration tests

GraphScope
GraphScope copied to clipboard