confluent-kafka-dotnet icon indicating copy to clipboard operation
confluent-kafka-dotnet copied to clipboard

Debug in Linux Kills the Application without Logging Error and Causes a Core Dump

Open GrantByrn3 opened this issue 1 year ago • 4 comments

Description

Currently running into an issue where if I run the app, it runs fine, but if I run it in debug mode, it crashes the application with no error messages.

I do however, see a errors in journalctl.

Nov 19 12:03:40 fedora jetbrains-rider-278d93a4-289b-4de0-b7b8-bf1be30e7c64.desktop[7603]: 2024-11-19 12:03:40,543 [1136163]   WARN - #c.i.u.Alarm - Do not create alarm without coroutineScope: com.intellij.openapi.application.ClipboardAnalyzeListener.applicationActivated(ClipboardAnalyzeListener.java:31)
Nov 19 12:03:40 fedora jetbrains-rider-278d93a4-289b-4de0-b7b8-bf1be30e7c64.desktop[7603]: 2024-11-19 12:03:40,543 [1136163]   WARN - #c.i.u.Alarm - Do not create alarm without coroutineScope: com.intellij.openapi.application.ClipboardAnalyzeListener.applicationActivated(ClipboardAnalyzeListener.java:31)
Nov 19 12:03:40 fedora jetbrains-rider-278d93a4-289b-4de0-b7b8-bf1be30e7c64.desktop[7603]: 2024-11-19 12:03:40,959 [1136579]   WARN - #c.i.u.Alarm - Do not create alarm without coroutineScope: com.jetbrains.rider.debugger.RiderDebugRunner.K(RiderDebugRunner.kt:194)
Nov 19 12:03:46 fedora org.mozilla.firefox.desktop[17523]: Error: Failed to open Wayland display, fallback to X11. WAYLAND_DISPLAY='wayland-0' DISPLAY=':0'
Nov 19 12:03:47 fedora audit[17456]: ANOM_ABEND auid=1000 uid=1000 gid=1000 ses=3 subj=kernel pid=17456 comm="rdk:broker-1" exe="/home/grant/Code/Notifications/TSWCloud.NotificationService/bin/Debug/net9.0/TSWCloud.NotificationService" sig=5 res=1
Nov 19 12:03:47 fedora systemd-coredump[17555]: Process 17456 (TSWCloud.Notifi) of user 1000 terminated abnormally with signal 5/TRAP, processing...
Nov 19 12:03:47 fedora audit: BPF prog-id=236 op=LOAD
Nov 19 12:03:47 fedora audit: BPF prog-id=237 op=LOAD
Nov 19 12:03:47 fedora audit: BPF prog-id=238 op=LOAD
Nov 19 12:03:47 fedora systemd[1]: Started [email protected] - Process Core Dump (PID 17555/UID 0).
Nov 19 12:03:47 fedora audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=kernel msg='unit=systemd-coredump@9-17555-0 comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
Nov 19 12:03:47 fedora systemd-coredump[17556]: Removed old coredump core.slack.1000.2f94f52d8bfd4d728224cf34d531a8a0.6445.1731440683000000.zst.
Nov 19 12:03:47 fedora systemd-coredump[17556]: Removed old coredump core.TSWCloud\x2eNotifi.1000.2f94f52d8bfd4d728224cf34d531a8a0.866153.1731556359000000.zst.
Nov 19 12:03:47 fedora abrt-dump-journal-core[1403]: Failed to obtain all required information from journald
Nov 19 12:03:47 fedora abrt-dump-journal-core[1403]: Failed to obtain all required information from journald
Nov 19 12:03:47 fedora abrt-dump-journal-core[1403]: Failed to obtain all required information from journald
Nov 19 12:03:48 fedora systemd-coredump[17556]: [🡕] Process 17456 (TSWCloud.Notifi) of user 1000 dumped core.
                                                
                                                Module libplain.so from rpm cyrus-sasl-2.1.28-27.fc41.x86_64
                                                Module liblogin.so from rpm cyrus-sasl-2.1.28-27.fc41.x86_64
                                                Module libpcre2-8.so.0 from rpm pcre2-10.44-1.fc41.1.x86_64
                                                Module libselinux.so.1 from rpm libselinux-3.7-5.fc41.x86_64
                                                Module libkeyutils.so.1 from rpm keyutils-1.6.3-4.fc41.x86_64
                                                Module libkrb5support.so.0 from rpm krb5-1.21.3-3.fc41.x86_64
                                                Module libcom_err.so.2 from rpm e2fsprogs-1.47.1-6.fc41.x86_64
                                                Module libk5crypto.so.3 from rpm krb5-1.21.3-3.fc41.x86_64
                                                Module libkrb5.so.3 from rpm krb5-1.21.3-3.fc41.x86_64
                                                Module libgssapi_krb5.so.2 from rpm krb5-1.21.3-3.fc41.x86_64
                                                Module libgssapiv2.so from rpm cyrus-sasl-2.1.28-27.fc41.x86_64
                                                Module libsasldb.so from rpm cyrus-sasl-2.1.28-27.fc41.x86_64
                                                Module libanonymous.so from rpm cyrus-sasl-2.1.28-27.fc41.x86_64
                                                Module libcrypt.so.2 from rpm libxcrypt-4.4.36-10.fc41.x86_64
                                                Module libsasl2.so.3 from rpm cyrus-sasl-2.1.28-27.fc41.x86_64
                                                Module pkcs11.so from rpm pkcs11-provider-0.5-4.fc41.x86_64
                                                Module libz.so.1 from rpm zlib-ng-2.1.7-3.fc41.x86_64
                                                Module libcrypto.so.3 from rpm openssl-3.2.2-9.fc41.x86_64
                                                Module libssl.so.3 from rpm openssl-3.2.2-9.fc41.x86_64
                                                Module libnss_resolve.so.2 from rpm systemd-256.8-1.fc41.x86_64
                                                Module libnss_mdns4_minimal.so.2 from rpm nss-mdns-0.15.1-12.fc41.x86_64
                                                Module libcap.so.2 from rpm libcap-2.70-4.fc41.x86_64
                                                Module libnss_myhostname.so.2 from rpm systemd-256.8-1.fc41.x86_64
                                                Module libicui18n.so.74 from rpm icu-74.2-2.fc41.x86_64
                                                Module libicudata.so.74 from rpm icu-74.2-2.fc41.x86_64
                                                Module libicuuc.so.74 from rpm icu-74.2-2.fc41.x86_64
                                                Stack trace of thread 17554:
                                                #0  0x00007fb74dce14c1 n/a (n/a + 0x0)
                                                #1  0x00007f76738910c9 n/a (/home/grant/Code/Notifications/TSWCloud.NotificationService/bin/Debug/net9.0/runtimes/linux-x64/native/librdkafka.so + 0x910c9)
                                                ELF object binary architecture: AMD x86-64
Nov 19 12:03:48 fedora systemd[1]: [email protected]: Deactivated successfully.
Nov 19 12:03:48 fedora audit[1]: SERVICE_STOP pid=1 uid=0 auid=4294967295 ses=4294967295 subj=kernel msg='unit=systemd-coredump@9-17555-0 comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
Nov 19 12:03:48 fedora audit: BPF prog-id=238 op=UNLOAD
Nov 19 12:03:48 fedora audit: BPF prog-id=237 op=UNLOAD
Nov 19 12:03:48 fedora audit: BPF prog-id=236 op=UNLOAD
Nov 19 12:03:48 fedora systemd[1]: [email protected]: Consumed 1.324s CPU time, 620.6M memory peak.
Nov 19 12:03:49 fedora abrt-server[17573]: Executable '/home/grant/Code/Notifications/TSWCloud.NotificationService/bin/Debug/net9.0/TSWCloud.NotificationService' doesn't belong to any package and ProcessUnpackaged is set to 'no'
Nov 19 12:03:49 fedora abrt-server[17573]: 'post-create' on '/var/spool/abrt/ccpp-2024-11-19-12:03:48.912282-17456' exited with 1
Nov 19 12:03:49 fedora abrt-server[17573]: Deleting problem directory '/var/spool/abrt/ccpp-2024-11-19-12:03:48.912282-17456'

Checklist

  • Confluent.Kafka nuget version. -- Experienced this issue on both 2.6.0 and 2.6.1
  • Apache Kafka version. -- This is running locally. Don't currently have a kafka instance spun up
  • Operating system. -- Fedora workstation 41

GrantByrn3 avatar Nov 19 '24 18:11 GrantByrn3

One thing to add to this is that it only happens when running in debug. Just doing a dotnet run doesn't have the same issue.

GrantByrn3 avatar Nov 21 '24 21:11 GrantByrn3

Did you manage to fix this in any way? I think we are experiencing something similar

adrian-gheorghe avatar Sep 04 '25 11:09 adrian-gheorghe

Also happens, tried Ubuntu and Arch.. It doesn't crash if I attach gdb to it though.

Process 3783821 (...) of user 1000 dumped core.
                                                     
Stack trace of thread 3783913:
#0  0x00007f067a98a6f1 n/a (n/a + 0x0)
#1  0x00007ec50e892279 rd_kafka_log0 (librdkafka.so + 0x92279)
#2  0x00007ec50e8b28b6 rd_kafka_broker_thread_main (librdkafka.so + 0xb28b6)
#3  0x00007f06f269685b n/a (libc.so.6 + 0x9685b)
#4  0x00007f06f271aa0c n/a (libc.so.6 + 0x11aa0c)
                                                     
Stack trace of thread 3783823:
#0  0x00007f06f271876d syscall (libc.so.6 + 0x11876d)
#1  0x00007f06f259a2b9 n/a (liblttng-ust.so.0 + 0x172b9)
#2  0x00007f06f26969cb n/a (libc.so.6 + 0x969cb)
#3  0x00007f06f271aa0c n/a (libc.so.6 + 0x11aa0c)
{the rest here are the same as 3783823 or don't have symbols}
ELF object binary architecture: AMD x86-64

nepolak avatar Oct 07 '25 12:10 nepolak

Did you manage to fix this in any way? I think we are experiencing something similar

I've managed to "fix" it by compiling librdkafka with rd_kafka_log0(...) commented out and substituting it in nupkg cache.

nepolak avatar Oct 07 '25 12:10 nepolak