management-api-for-apache-cassandra icon indicating copy to clipboard operation
management-api-for-apache-cassandra copied to clipboard

K8SSAND-1486 ⁃ Getting error `Cannot create Driver CQLSession as the driver socket has not been created` even though Cassandra has started

Open bennythejudge opened this issue 2 years ago • 2 comments

Hi, thanks for this API, I am trying to test it , I have a kind cluster with a single node Cassandra , installed using the Bitnami Helm chart. I have your API configured as a sidecar. The container for the API is running but it's printing this error in its log:

Cannot create Driver CQLSession as the driver socket has not been created. This should resolve once Cassandra has started and created the socket at /tmp/cassandra.sock

Cassandra is up and running, this is a nodetool status:

Datacenter: datacenter1
=======================
Status=Up/Down
|/ State=Normal/Leaving/Joining/Moving
--  Address      Load        Tokens  Owns (effective)  Host ID                               Rack
UN  xx.xx.xx.xx  556.09 KiB  256     100.0%            1937b32f-ea42-4d8d-82b1-3f76477c76dc  rack1

I have searched I couldn't find anything useful.

Any suggestion of what I may be doing wrong? Thanks

┆Issue is synchronized with this Jira Task by Unito ┆friendlyId: K8SSAND-1486 ┆priority: Medium

bennythejudge avatar May 01 '22 22:05 bennythejudge

@bennythejudge I'll take a look at this and get back to you soon.

emerkle826 avatar May 05 '22 13:05 emerkle826

k8ssandra/k8ssandra#1447

mikekuzak avatar Jul 15 '22 12:07 mikekuzak

We are facing similar issue. with k8ssandra cluster. Following are the configs. cassandra: enabled: true resources: requests: cpu: 2 memory: 2Gi limits: cpu: 6 memory: 6Gi tolerations: [ ] cassandraLibDirVolume: storageClass: gp3 size: 5Gi heap: size: 2Gi datacenters: - name: us-east-1 size: 1 racks: - name: us-east-1a affinityLabels: topology.kubernetes.io/zone: us-east-1a

Error Log: INFO [nioEventLoopGroup-3-14] 2022-11-04 14:46:07,478 UnixSocketCQLAccess.java:88 - Cannot create Driver CQLSession as the driver socket has not been created. This should resolve once Cassandra has started and created the socket at /tmp/cassandra.sock │

INFO [nioEventLoopGroup-2-1] 2022-11-04 14:46:07,478 Cli.java:617 - address=/ url=/api/v0/probes/readiness status=500 Internal Server Error

kumarakannan avatar Nov 04 '22 15:11 kumarakannan

I too facing same issue when cluster gets restarted or we do scale down and scaleup for any upgrades/downtime Is there any workaround to fix this ?

sbbagal13 avatar Nov 18 '22 17:11 sbbagal13

I have the same problem

mmarat248 avatar Nov 23 '22 17:11 mmarat248

We have the same problem as well.

rakhya avatar Dec 02 '22 21:12 rakhya

What version are you on?

On Fri, Dec 2, 2022 at 4:20 PM rakhya @.***> wrote:

We have the same problem as well.

— Reply to this email directly, view it on GitHub https://github.com/k8ssandra/management-api-for-apache-cassandra/issues/189#issuecomment-1335854732, or unsubscribe https://github.com/notifications/unsubscribe-auth/AABJBOOQYT5HWBLZZV6MZ3TWLJRZJANCNFSM5U2PWOQQ . You are receiving this because you are subscribed to this thread.Message ID: <k8ssandra/management-api-for-apache-cassandra/issues/189/1335854732@ github.com>

--

  • John

jsanda avatar Dec 02 '22 22:12 jsanda

We are using k8ssandra/cass-operator version v1.13.1 which is latest And cassandra version 4.0.1 Management Api Image : k8ssandra/cass-management-api:4.0.1

sbbagal13 avatar Dec 02 '22 23:12 sbbagal13

We are still facing the same issue when we increase the resource or make any changes to the cluster or when there is node flip.We are using below operator versions.

  • Cass-operator - 1.10.4
  • K8ssandra operator - 1.0.1
  • K8ssandra - 1.5.1

On Fri, Dec 2, 2022 at 6:09 PM sbbagal13 @.***> wrote:

We are using k8ssandra/cass-operator version v1.13.1 which is latest And cassandra version 4.0.1 Management Api Image : k8ssandra/cass-management-api:4.0.1

— Reply to this email directly, view it on GitHub https://github.com/k8ssandra/management-api-for-apache-cassandra/issues/189#issuecomment-1335947904, or unsubscribe https://github.com/notifications/unsubscribe-auth/AVVNIUUOPKXTDVSR4RKGLQ3WLJ6TNANCNFSM5U2PWOQQ . You are receiving this because you commented.Message ID: <k8ssandra/management-api-for-apache-cassandra/issues/189/1335947904@ github.com>

kumarakannan avatar Dec 16 '22 22:12 kumarakannan

We have the same problem as well.

nuvme-devops avatar Dec 20 '22 18:12 nuvme-devops

Same issue.

okgolove avatar Jan 10 '23 11:01 okgolove

facing same issue.

Ankitsachdeva avatar Jan 12 '23 14:01 Ankitsachdeva

Note that the message itself is not an error or indicative of any error. It's just an informal message telling that the Cassandra hasn't started yet, but the error why it hasn't started is not this.

burmanm avatar Jan 12 '23 14:01 burmanm

Can someone suggest version which does not have this issue please? We are facing the issue as well however we are in early days of development so just need the cluster up.

Thank you

pankajgft avatar Jan 18 '23 18:01 pankajgft

As @burmanm said, this message indicates that the Cassandra process hasn't completed startup. However, it is not indicative of any specific issue. It is normal to see this message a few times when Cassandra is first starting up, but it should clear once the Management API agent has been created towards the very end of the Cassandra startup process.

If this message does not clear, it means that something has prevented Cassandra from fully starting. This could be for a number of reasons, and the only way to tell is to look at the Cassandra startup logs. If you are running this inside of kubernetes k8ssandra or k8ssandra-operator, you may be able to get some of the startup logs from the system-server-logger container in the pod. You can also try to exec/bash into the pod while it's starting and try to inspect the logs at /var/log/cassandra. In that directory, there should be a stderr.log file that might have more clues as to why Cassandra isn't successfully starting. There might also be something in debug.log as well.

emerkle826 avatar Jan 18 '23 20:01 emerkle826

@emerkle826 - thank you. It seems looking into system-server-logger logs Cassandra is not starting. "textPayload: "tail: cannot open '/var/log/cassandra/system.log' for reading: No such file or directory". Any option enabling verbose logging much appreciated.

pankajgft avatar Jan 19 '23 11:01 pankajgft

I've seen this behaviour many times before. As far as I understand system-manager should receive a request from k8ssandra-operator to start cassandra within a container and it does not happen for some reason. Is there a way to understand the reason?

okgolove avatar Jan 19 '23 18:01 okgolove

I also have this issue.

caniko avatar Jan 23 '23 19:01 caniko

As far as I understand system-manager should receive a request from k8ssandra-operator to start cassandra within a container and it does not happen for some reason. Is there a way to understand the reason?

The request can be seen in the cassandra container (search for /start REST call). If you don't see it happening, then the cass-operator logs often tell why it's looping over something else and not starting the resources.

However, if /start fails due to Kubernetes, then you need to search for the reason with kubectl describe / events / kubelet logs etc.

burmanm avatar Jan 23 '23 21:01 burmanm

@burmanm can you provide the list of permissions I need toward the namespace where k8ssandra operator lives? My cluster is in a separate namespace.

These details and minimum CPU and memory should be well documented, if not pre-defined in the manifests. Shouldn't the logs on the K8ssandraCluster also provide enough information, or at least point us in a plausible direction; creating GitHub issues should be the last resort where we make changes for future users to avoid the same issue.

caniko avatar Jan 24 '23 07:01 caniko

Hi, Pretty new at this, but got the same issue. but looks like a naming issue, don't know what oss-mgmt.lock is used for...

INFO  [main] 2023-02-08 10:47:16,288 IPCController.java:139 - Started Server
Started service on file:///tmp/oss-mgmt.sock
INFO  [nioEventLoopGroup-3-1] 2023-02-08 10:47:23,144 UnixSocketCQLAccess.java:88 - Cannot create Driver CQLSession as  he driver socket has not been created. This should resolve once Cassandra has started and created the socket at /tmp/cassandra.sock  

I'm using k8ssandra operator version 0.39.3, with spec using Cassandra version 4.0.5

benwaxway avatar Feb 08 '23 11:02 benwaxway

@emerkle826 - thank you. It seems looking into system-server-logger logs Cassandra is not starting. "textPayload: "tail: cannot open '/var/log/cassandra/system.log' for reading: No such file or directory". Any option enabling verbose logging much appreciated.

Hi,

I got my issue resolved. Nothing to do with k8ssandra. We are using on Google GKE. We enabled ecrypting on the Persistent Disk and need to set it "volumeBindingMode: "WaitForConsumer". It has been set to "immediate" Hope it helps

pankajgft avatar Feb 08 '23 11:02 pankajgft