scylla-operator icon indicating copy to clipboard operation
scylla-operator copied to clipboard

Flake: ScyllaCluster should allow to build connection pool using shard aware ports

Open tnozicka opened this issue 3 years ago • 42 comments

https://github.com/scylladb/scylla-operator/actions/runs/3056629503/jobs/4931011388#step:12:664

↺ [FLAKEY TEST - TOOK 2 ATTEMPTS TO PASS] [SLOW TEST] [716.582 seconds]
ScyllaCluster
github.com/scylladb/scylla-operator/test/e2e/set/scyllacluster/scyllacluster_shardawareness.go:25
  should allow to build connection pool using shard aware ports
  github.com/scylladb/scylla-operator/test/e2e/set/scyllacluster/scyllacluster_shardawareness.go:30

  Begin Captured StdOut/StdErr Output >>
    I0914 23:21:03.448313      47 framework/framework.go:249] "Namespace removed." Namespace="e2e-test-scyllacluster-jmx8m-nqlnz"
    I0914 23:26:03.711932      47 framework/framework.go:249] "Namespace removed." Namespace="e2e-test-scyllacluster-jmx8m-psjsw"
  << End Captured StdOut/StdErr Output

  Begin Captured GinkgoWriter Output >>
    [BeforeEach] ScyllaCluster
      github.com/scylladb/scylla-operator/test/e2e/framework/framework.go:57
    STEP: Creating a new namespace 09/14/22 23:14:07.13
    Sep 14 23:14:07.185: INFO: Created namespace "e2e-test-scyllacluster-jmx8m-nqlnz".
    STEP: Waiting for ServiceAccount "e2e-user" in namespace "e2e-test-scyllacluster-jmx8m-nqlnz". 09/14/22 23:14:07.215
    STEP: Waiting for default ServiceAccount in namespace "e2e-test-scyllacluster-jmx8m-nqlnz". 09/14/22 23:14:08.878
    [It] should allow to build connection pool using shard aware ports
      github.com/scylladb/scylla-operator/test/e2e/set/scyllacluster/scyllacluster_shardawareness.go:30
    STEP: Creating a ScyllaCluster 09/14/22 23:14:08.883
    STEP: Waiting for the ScyllaCluster to deploy 09/14/22 23:14:08.9
    Sep 14 23:17:38.950: INFO: ScyllaCluster e2e-test-scyllacluster-jmx8m-nqlnz/basic-x5qk6 (RV=4358) is rolled out
    STEP: Waiting for the driver to establish connection to shards 09/14/22 23:17:43.979
    Sep 14 23:17:43.981: INFO: Connecting to 10.103.26.49:9042 using 0 source port
    Sep 14 23:17:44.019: INFO: Connecting to 10.103.26.49:9042 using 0 source port
    Sep 14 23:17:44.063: INFO: Connecting to 10.103.26.49:9042 using 0 source port
    Sep 14 23:17:44.072: INFO: Connecting to 10.103.26.49:19042 using 32769 source port
    Sep 14 23:17:44.072: INFO: Connecting to 10.103.26.49:19042 using 32771 source port
    [AfterEach] ScyllaCluster
      github.com/scylladb/scylla-operator/test/e2e/framework/framework.go:58
    STEP: Collecting events from namespace "e2e-test-scyllacluster-jmx8m-nqlnz". 09/14/22 23:17:49.086
    STEP: Found 32 events. 09/14/22 23:17:49.189
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:14:27 +0000 UTC - event for basic-x5qk6-auth-token: {scyllacluster-controller } SecretCreated: Secret e2e-test-scyllacluster-jmx8m-nqlnz/basic-x5qk6-auth-token created
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:14:27 +0000 UTC - event for basic-x5qk6-member: {scyllacluster-controller } ServiceAccountCreated: ServiceAccount e2e-test-scyllacluster-jmx8m-nqlnz/basic-x5qk6-member created
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:14:27 +0000 UTC - event for basic-x5qk6-member: {scyllacluster-controller } RoleBindingCreated: RoleBinding e2e-test-scyllacluster-jmx8m-nqlnz/basic-x5qk6-member created
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:14:27 +0000 UTC - event for basic-x5qk6-us-east-1-us-east-1a: {statefulset-controller } SuccessfulCreate: create Pod basic-x5qk6-us-east-1-us-east-1a-0 in StatefulSet basic-x5qk6-us-east-1-us-east-1a successful
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:14:27 +0000 UTC - event for basic-x5qk6-us-east-1-us-east-1a: {scyllacluster-controller } StatefulSetCreated: StatefulSet e2e-test-scyllacluster-jmx8m-nqlnz/basic-x5qk6-us-east-1-us-east-1a created
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:14:27 +0000 UTC - event for basic-x5qk6-us-east-1-us-east-1a: {statefulset-controller } SuccessfulCreate: create Claim data-basic-x5qk6-us-east-1-us-east-1a-0 Pod basic-x5qk6-us-east-1-us-east-1a-0 in StatefulSet basic-x5qk6-us-east-1-us-east-1a success
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:14:27 +0000 UTC - event for basic-x5qk6-us-east-1-us-east-1a-0: {default-scheduler } FailedScheduling: 0/1 nodes are available: 1 pod has unbound immediate PersistentVolumeClaims.
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:14:27 +0000 UTC - event for data-basic-x5qk6-us-east-1-us-east-1a-0: {k8s.io/minikube-hostpath_fv-az626-979_558fc76d-00a8-498b-80bc-2fa6555824a2 } ProvisioningSucceeded: Successfully provisioned volume pvc-ccd7b6da-acdc-4e51-9754-cb823e77e831
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:14:27 +0000 UTC - event for data-basic-x5qk6-us-east-1-us-east-1a-0: {k8s.io/minikube-hostpath_fv-az626-979_558fc76d-00a8-498b-80bc-2fa6555824a2 } Provisioning: External provisioner is provisioning volume for claim "e2e-test-scyllacluster-jmx8m-nqlnz/data-basic-x5qk6-us-east-1-us-east-1a-0"
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:14:27 +0000 UTC - event for data-basic-x5qk6-us-east-1-us-east-1a-0: {persistentvolume-controller } ExternalProvisioning: waiting for a volume to be created, either by external provisioner "k8s.io/minikube-hostpath" or manually created by system administrator
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:14:28 +0000 UTC - event for basic-x5qk6-us-east-1-us-east-1a-0: {default-scheduler } Scheduled: Successfully assigned e2e-test-scyllacluster-jmx8m-nqlnz/basic-x5qk6-us-east-1-us-east-1a-0 to fv-az626-979
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:14:37 +0000 UTC - event for basic-x5qk6-client: {scyllacluster-controller } ServiceCreated: Service e2e-test-scyllacluster-jmx8m-nqlnz/basic-x5qk6-client created
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:14:38 +0000 UTC - event for basic-x5qk6: {scyllacluster-controller } PodDisruptionBudgetCreated: PodDisruptionBudget e2e-test-scyllacluster-jmx8m-nqlnz/basic-x5qk6 created
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:14:38 +0000 UTC - event for basic-x5qk6-us-east-1-us-east-1a-0: {scyllacluster-controller } ServiceCreated: Service e2e-test-scyllacluster-jmx8m-nqlnz/basic-x5qk6-us-east-1-us-east-1a-0 created
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:14:43 +0000 UTC - event for nodeconfig-podinfo-5f0cab8e-19f4-4e22-8749-d77b7b7d48ce: {NodeConfigCM-controller } ConfigMapCreated: ConfigMap e2e-test-scyllacluster-jmx8m-nqlnz/nodeconfig-podinfo-5f0cab8e-19f4-4e22-8749-d77b7b7d48ce created
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:14:47 +0000 UTC - event for basic-x5qk6-us-east-1-us-east-1a-0: {kubelet fv-az626-979} FailedMount: MountVolume.SetUp failed for volume "scylla-agent-config-volume" : failed to sync secret cache: timed out waiting for the condition
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:14:47 +0000 UTC - event for basic-x5qk6-us-east-1-us-east-1a-0: {kubelet fv-az626-979} FailedMount: MountVolume.SetUp failed for volume "scylla-agent-auth-token-volume" : failed to sync secret cache: timed out waiting for the condition
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:14:48 +0000 UTC - event for basic-x5qk6-us-east-1-us-east-1a-0: {kubelet fv-az626-979} FailedMount: MountVolume.SetUp failed for volume "kube-api-access-2h4nx" : failed to sync configmap cache: timed out waiting for the condition
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:14:49 +0000 UTC - event for basic-x5qk6-us-east-1-us-east-1a-0: {kubelet fv-az626-979} Pulled: Container image "docker.io/scylladb/scylla-operator:ci" already present on machine
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:14:50 +0000 UTC - event for basic-x5qk6-us-east-1-us-east-1a-0: {kubelet fv-az626-979} Started: Started container sidecar-injection
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:14:50 +0000 UTC - event for basic-x5qk6-us-east-1-us-east-1a-0: {kubelet fv-az626-979} Created: Created container sidecar-injection
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:14:59 +0000 UTC - event for basic-x5qk6-us-east-1-us-east-1a-0: {kubelet fv-az626-979} Created: Created container scylla
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:14:59 +0000 UTC - event for basic-x5qk6-us-east-1-us-east-1a-0: {kubelet fv-az626-979} Started: Started container scylla
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:14:59 +0000 UTC - event for basic-x5qk6-us-east-1-us-east-1a-0: {kubelet fv-az626-979} Pulling: Pulling image "docker.io/scylladb/scylla-manager-agent:2.6.3"
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:14:59 +0000 UTC - event for basic-x5qk6-us-east-1-us-east-1a-0: {kubelet fv-az626-979} Pulled: Container image "docker.io/scylladb/scylla:4.6.3" already present on machine
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:16:25 +0000 UTC - event for basic-x5qk6-us-east-1-us-east-1a-0: {kubelet fv-az626-979} Started: Started container scylla-manager-agent
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:16:25 +0000 UTC - event for basic-x5qk6-us-east-1-us-east-1a-0: {kubelet fv-az626-979} Created: Created container scylla-manager-agent
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:16:25 +0000 UTC - event for basic-x5qk6-us-east-1-us-east-1a-0: {kubelet fv-az626-979} Pulled: Successfully pulled image "docker.io/scylladb/scylla-manager-agent:2.6.3" in 1m25.824883618s
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:16:25 +0000 UTC - event for nodeconfig-podinfo-5f0cab8e-19f4-4e22-8749-d77b7b7d48ce: {NodeConfigCM-controller } ConfigMapUpdated: ConfigMap e2e-test-scyllacluster-jmx8m-nqlnz/nodeconfig-podinfo-5f0cab8e-19f4-4e22-8749-d77b7b7d48ce updated
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:16:28 +0000 UTC - event for basic-x5qk6-us-east-1-us-east-1a-0: {kubelet fv-az626-979} Unhealthy: Startup probe failed: Get "http://172.17.0.28:8080/healthz": dial tcp 172.17.0.28:8080: connect: connection refused
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:16:38 +0000 UTC - event for basic-x5qk6-us-east-1-us-east-1a-0: {kubelet fv-az626-979} Unhealthy: Readiness probe failed: HTTP probe failed with statuscode: 500
    Sep 14 23:17:49.189: INFO: At 2022-09-14 23:16:49 +0000 UTC - event for basic-x5qk6-us-east-1-us-east-1a-0: {kubelet fv-az626-979} Unhealthy: Readiness probe failed: HTTP probe failed with statuscode: 503
    STEP: Collecting dumps from namespace "e2e-test-scyllacluster-jmx8m-nqlnz". 09/14/22 23:17:49.189
    STEP: Destroying namespace "e2e-test-scyllacluster-jmx8m-nqlnz". 09/14/22 23:17:49.747
    STEP: Waiting for namespace "e2e-test-scyllacluster-jmx8m-nqlnz" to be removed. 09/14/22 23:17:49.769

    Ginkgo: Attempt #1 Failed.  Retrying...
    [BeforeEach] ScyllaCluster
      github.com/scylladb/scylla-operator/test/e2e/framework/framework.go:57
    STEP: Creating a new namespace 09/14/22 23:21:03.449
    Sep 14 23:21:03.482: INFO: Created namespace "e2e-test-scyllacluster-jmx8m-psjsw".
    STEP: Waiting for ServiceAccount "e2e-user" in namespace "e2e-test-scyllacluster-jmx8m-psjsw". 09/14/22 23:21:03.54
    STEP: Waiting for default ServiceAccount in namespace "e2e-test-scyllacluster-jmx8m-psjsw". 09/14/22 23:21:03.58
    [It] should allow to build connection pool using shard aware ports
      github.com/scylladb/scylla-operator/test/e2e/set/scyllacluster/scyllacluster_shardawareness.go:30
    STEP: Creating a ScyllaCluster 09/14/22 23:21:03.595
    STEP: Waiting for the ScyllaCluster to deploy 09/14/22 23:21:03.617
    Sep 14 23:22:05.660: INFO: ScyllaCluster e2e-test-scyllacluster-jmx8m-psjsw/basic-lxr8t (RV=6963) is rolled out
    STEP: Waiting for the driver to establish connection to shards 09/14/22 23:22:10.[669](https://github.com/scylladb/scylla-operator/actions/runs/3056629503/jobs/4931011388#step:12:670)
    Sep 14 23:22:10.669: INFO: Connecting to 10.101.178.97:9042 using 0 source port
    Sep 14 23:22:10.[671](https://github.com/scylladb/scylla-operator/actions/runs/3056629503/jobs/4931011388#step:12:672): INFO: Connecting to 10.101.178.97:9042 using 0 source port
    Sep 14 23:22:10.[674](https://github.com/scylladb/scylla-operator/actions/runs/3056629503/jobs/4931011388#step:12:675): INFO: Connecting to 10.101.178.97:9042 using 0 source port
    Sep 14 23:22:10.[676](https://github.com/scylladb/scylla-operator/actions/runs/3056629503/jobs/4931011388#step:12:677): INFO: Connecting to 10.101.178.97:19042 using 32768 source port
    [AfterEach] ScyllaCluster
      github.com/scylladb/scylla-operator/test/e2e/framework/framework.go:58
    STEP: Collecting dumps from namespace "e2e-test-scyllacluster-jmx8m-psjsw". 09/14/22 23:22:10.777
    STEP: Destroying namespace "e2e-test-scyllacluster-jmx8m-psjsw". 09/14/22 23:22:11.016
    STEP: Waiting for namespace "e2e-test-scyllacluster-jmx8m-psjsw" to be removed. 09/14/22 23:22:11.026
  << End Captured GinkgoWriter Output
------------------------------

tnozicka avatar Sep 15 '22 07:09 tnozicka

https://github.com/scylladb/scylla-operator/actions/runs/6547133720/job/17790765111#step:3:3523

rzetelskik avatar Oct 18 '23 07:10 rzetelskik

https://github.com/scylladb/scylla-operator/actions/runs/6547133720/job/17790764781#step:3:3535

rzetelskik avatar Oct 18 '23 07:10 rzetelskik

https://github.com/scylladb/scylla-operator/actions/runs/6547133720/job/17810272565#step:3:3844

rzetelskik avatar Oct 18 '23 10:10 rzetelskik

https://github.com/scylladb/scylla-operator/actions/runs/6547133720/job/17810272346#step:3:3479

rzetelskik avatar Oct 18 '23 10:10 rzetelskik

4 shards on 2 core machine might not be good idea, maybe it's better to skip it on not-big-enough nodes

zimnx avatar Oct 18 '23 12:10 zimnx

skips usually mean bugs slipping :(

tnozicka avatar Oct 19 '23 06:10 tnozicka

Prow CI is big enough, it fails only on GH Actions where we have limited resources.

zimnx avatar Oct 19 '23 11:10 zimnx

ok, then I'd defer this one to when we finish the migration

tnozicka avatar Oct 19 '23 11:10 tnozicka

Happened on prow too https://prow.scylla-operator.scylladb.com/view/gs/scylla-operator-prow/pr-logs/pull/scylladb_scylla-operator/1612/pull-scylla-operator-e2e-gke-parallel/1730532191244390400

zimnx avatar Dec 04 '23 15:12 zimnx

https://prow.scylla-operator.scylladb.com/view/gs/scylla-operator-prow/pr-logs/pull/scylladb_scylla-operator/1619/pull-scylla-operator-e2e-gke-parallel/1732313591878520832#1:test-build-log.txt%3A1087

tnozicka avatar Dec 06 '23 11:12 tnozicka

https://prow.scylla-operator.scylladb.com/view/gs/scylla-operator-prow/pr-logs/pull/scylladb_scylla-operator/1677/pull-scylla-operator-e2e-gke-parallel-clusterip/1744618657280954368#1:test-build-log.txt%3A905

rzetelskik avatar Jan 09 '24 08:01 rzetelskik

https://prow.scylla-operator.scylladb.com/view/gs/scylla-operator-prow/pr-logs/pull/scylladb_scylla-operator/1703/pull-scylla-operator-master-e2e-gke-parallel/1750468988346830848

zimnx avatar Jan 26 '24 11:01 zimnx

https://prow.scylla-operator.scylladb.com/view/gs/scylla-operator-prow/pr-logs/pull/scylladb_scylla-operator/1707/pull-scylla-operator-master-e2e-gke-parallel-clusterip/1752375983517732864

zimnx avatar Jan 31 '24 09:01 zimnx

https://prow.scylla-operator.scylladb.com/view/gs/scylla-operator-prow/pr-logs/pull/scylladb_scylla-operator/1724/pull-scylla-operator-master-e2e-gke-parallel/1757444225856704512

zimnx avatar Feb 13 '24 17:02 zimnx

https://prow.scylla-operator.scylladb.com/view/gs/scylla-operator-prow/pr-logs/pull/scylladb_scylla-operator/1734/pull-scylla-operator-master-e2e-gke-parallel/1758489560972529664

zimnx avatar Feb 16 '24 14:02 zimnx

https://prow.scylla-operator.scylladb.com/view/gs/scylla-operator-prow/pr-logs/pull/scylladb_scylla-operator/1781/pull-scylla-operator-v1.12-e2e-gke-parallel/1763206931012390912

zimnx avatar Feb 29 '24 15:02 zimnx

https://prow.scylla-operator.scylladb.com/view/gs/scylla-operator-prow/pr-logs/pull/scylladb_scylla-operator/1792/pull-scylla-operator-v1.12-e2e-gke-parallel/1764939098264440832#1:test-build-log.txt%3A1196

tnozicka avatar Mar 05 '24 11:03 tnozicka

https://prow.scylla-operator.scylladb.com/view/gs/scylla-operator-prow/pr-logs/pull/scylladb_scylla-operator/1836/pull-scylla-operator-master-e2e-gke-parallel-clusterip/1768622200081879040#1:test-build-log.txt%3A787

tnozicka avatar Mar 15 '24 14:03 tnozicka

https://prow.scylla-operator.scylladb.com/view/gs/scylla-operator-prow/pr-logs/pull/scylladb_scylla-operator/1840/pull-scylla-operator-master-e2e-gke-parallel-clusterip/1769700122335121408#1:test-build-log.txt%3A762 (although i suspect this one might be connected to persistent drives)

tnozicka avatar Mar 18 '24 14:03 tnozicka

https://prow.scylla-operator.scylladb.com/view/gs/scylla-operator-prow/pr-logs/pull/scylladb_scylla-operator/1871/pull-scylla-operator-master-e2e-gke-parallel/1775163020910006272#1:test-build-log.txt%3A1286

tnozicka avatar Apr 03 '24 08:04 tnozicka

https://prow.scylla-operator.scylladb.com/view/gs/scylla-operator-prow/pr-logs/pull/scylladb_scylla-operator/1880/pull-scylla-operator-master-e2e-gke-parallel/1775827306929983488#1:test-build-log.txt%3A805

rzetelskik avatar Apr 04 '24 12:04 rzetelskik

https://prow.scylla-operator.scylladb.com/view/gs/scylla-operator-prow/pr-logs/pull/scylladb_scylla-operator/1754/pull-scylla-operator-master-e2e-gke-parallel-clusterip/1780249855877189632#1:test-build-log.txt%3A725

rzetelskik avatar Apr 17 '24 07:04 rzetelskik

https://prow.scylla-operator.scylladb.com/view/gs/scylla-operator-prow/pr-logs/pull/scylladb_scylla-operator/1850/pull-scylla-operator-master-e2e-gke-parallel-clusterip/1782787763259576320#1:test-build-log.txt%3A896

rzetelskik avatar Apr 23 '24 16:04 rzetelskik

https://prow.scylla-operator.scylladb.com/view/gs/scylla-operator-prow/pr-logs/pull/scylladb_scylla-operator/1804/pull-scylla-operator-master-e2e-gke-parallel-clusterip/1782773293518950400

zimnx avatar Apr 23 '24 16:04 zimnx

https://prow.scylla-operator.scylladb.com/view/gs/scylla-operator-prow/pr-logs/pull/scylladb_scylla-operator/1824/pull-scylla-operator-master-e2e-gke-parallel-clusterip/1790745656973856768

zimnx avatar May 15 '24 15:05 zimnx

https://prow.scylla-operator.scylladb.com/view/gs/scylla-operator-prow/pr-logs/pull/scylladb_scylla-operator/1742/pull-scylla-operator-master-e2e-gke-parallel-clusterip/1791110211692400640#1:test-build-log.txt%3A833

rzetelskik avatar May 16 '24 15:05 rzetelskik

https://prow.scylla-operator.scylladb.com/view/gs/scylla-operator-prow/pr-logs/pull/scylladb_scylla-operator/1947/pull-scylla-operator-master-e2e-gke-parallel/1795849800504053760#1:test-build-log.txt%3A1392

tnozicka avatar May 30 '24 08:05 tnozicka

https://prow.scylla-operator.scylladb.com/view/gs/scylla-operator-prow/pr-logs/pull/scylladb_scylla-operator/1958/pull-scylla-operator-master-e2e-gke-parallel/1800123248852078592

zimnx avatar Jun 10 '24 12:06 zimnx

https://prow.scylla-operator.scylladb.com/view/gs/scylla-operator-prow/pr-logs/pull/scylladb_scylla-operator/1977/pull-scylla-operator-master-e2e-gke-parallel-clusterip/1802651485377400832#1:test-build-log.txt%3A1039

rzetelskik avatar Jun 17 '24 12:06 rzetelskik

https://prow.scylla-operator.scylladb.com/view/gs/scylla-operator-prow/logs/ci-scylla-operator-latest-e2e-gke-arm64-parallel/1805118759006900224#1:test-build-log.txt%3A648

rzetelskik avatar Jun 24 '24 13:06 rzetelskik