zfs icon indicating copy to clipboard operation
zfs copied to clipboard

[2.3.5] kernel driver bug stacktrace

Open pasztorl opened this issue 4 weeks ago • 2 comments

System information

Type Version/Name
Distribution Name Ubuntu
Distribution Version noble
Kernel Version 6.14.0-36-generic
Architecture amd64
OpenZFS Version 2.3.5

Describe the problem you're observing

kernel oops in dmesg:

Dec 02 21:16:40 n2c1r1 kernel: ------------[ cut here ]------------
Dec 02 21:16:40 n2c1r1 kernel: WARNING: CPU: 12 PID: 3253673 at mm/page-writeback.c:2541 writeback_iter+0x124/0x170
Dec 02 21:16:40 n2c1r1 kernel: Modules linked in: dm_mirror dm_region_hash dm_log zfs(POE) spl(OE) algif_hash af_alg xt_MASQUERADE xt_set xt_addrtype tcp_diag inet_diag ceph netfs xfrm_user xfrm_algo vxlan ip6_udp_tunnel udp_tunnel xt_TPROXY nf_tproxy_ipv6 nf_tproxy_ipv4 nft_chain_nat xt_CT xt_mark veth xt_socket nf_socket_ipv4 nf_socket_ipv6 ip6table_filter ip6table_raw ip6table_mangle ip6_tables iptable_filter iptable_raw iptable_mangle iptable_nat nf_nat ip_set sch_fq xt_conntrack nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 xt_comment nft_compat nf_tables 8021q garp mrp qrtr zram 842_decompress 842_compress lz4hc_compress overlay lz4_compress amdgpu ipmi_ssif binfmt_misc amd_atl intel_rapl_msr intel_rapl_common amd64_edac edac_mce_amd amdxcp gpu_sched drm_panel_backlight_quirks drm_buddy drm_ttm_helper nls_iso8859_1 ttm kvm_amd drm_exec drm_suballoc_helper drm_display_helper cec bonding kvm acpi_ipmi ipmi_si irqbypass ast rc_core ipmi_devintf i2c_piix4 rapl ccp k10temp i2c_algo_bit i2c_smbus ipmi_msghandler input_leds joydev mac_hid
Dec 02 21:16:40 n2c1r1 kernel:  cfg80211 sch_fq_codel rbd libceph nbd dm_multipath dm_thin_pool dm_persistent_data dm_bio_prison dm_snapshot dm_bufio br_netfilter bridge stp llc nvme_fabrics nvme_keyring efi_pstore nfnetlink dmi_sysfs ip_tables x_tables autofs4 btrfs blake2b_generic raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq raid1 raid0 linear mlx5_ib ib_uverbs macsec ib_core cdc_ether usbnet mii hid_generic usbhid hid mlx5_core polyval_clmulni polyval_generic ghash_clmulni_intel nvme sha256_ssse3 mlxfw sha1_ssse3 psample nvme_core tls nvme_auth pci_hyperv_intf video wmi aesni_intel crypto_simd cryptd [last unloaded: spl(OE)]
Dec 02 21:16:40 n2c1r1 kernel: CPU: 12 UID: 0 PID: 3253673 Comm: kworker/u130:2 Tainted: P           OE      6.14.0-36-generic #36~24.04.1-Ubuntu
Dec 02 21:16:40 n2c1r1 kernel: Tainted: [P]=PROPRIETARY_MODULE, [O]=OOT_MODULE, [E]=UNSIGNED_MODULE
Dec 02 21:16:40 n2c1r1 kernel: Hardware name: Supermicro AS -3015MR-H8TNR/H13SRD-F, BIOS 1.3 09/18/2024
Dec 02 21:16:40 n2c1r1 kernel: Workqueue: writeback wb_workfn (flush-zfs-9)
Dec 02 21:16:40 n2c1r1 kernel: RIP: 0010:writeback_iter+0x124/0x170
Dec 02 21:16:40 n2c1r1 kernel: Code: 83 7b 20 01 48 89 b3 38 01 00 00 74 08 a8 04 0f 84 4a ff ff ff 48 8b 53 18 48 c1 fa 0c 4c 89 e7 e8 31 d8 ff ff e9 35 ff ff ff <0f> 0b 83 fe 01 0f 84 51 ff ff ff f6 43 24 10 74 1c 48 8b 32 48 8b
Dec 02 21:16:40 n2c1r1 kernel: RSP: 0018:ffffccd03dcef8e0 EFLAGS: 00010202
Dec 02 21:16:40 n2c1r1 kernel: RAX: 0000000000002bff RBX: ffffccd03dcefae0 RCX: 000000000000007a
Dec 02 21:16:40 n2c1r1 kernel: RDX: fffff42ecf3015c0 RSI: 0000000000000000 RDI: ffff8a6f5f50d220
Dec 02 21:16:40 n2c1r1 kernel: RBP: ffffccd03dcef8f8 R08: 0000000000000000 R09: 0000000000000000
Dec 02 21:16:40 n2c1r1 kernel: R10: 0000000000000000 R11: 0000000000000000 R12: ffff8a6f5f50d220
Dec 02 21:16:40 n2c1r1 kernel: R13: ffffccd03dcef90c R14: ffffccd03dcef95c R15: ffffffffc3293a20
Dec 02 21:16:40 n2c1r1 kernel: FS:  0000000000000000(0000) GS:ffff8a7bfde00000(0000) knlGS:0000000000000000
Dec 02 21:16:40 n2c1r1 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Dec 02 21:16:40 n2c1r1 kernel: CR2: 000000c0005ff000 CR3: 000000012d1ea000 CR4: 0000000000f50ef0
Dec 02 21:16:40 n2c1r1 kernel: PKRU: 55555554
Dec 02 21:16:40 n2c1r1 kernel: Call Trace:
Dec 02 21:16:40 n2c1r1 kernel:  <TASK>
Dec 02 21:16:40 n2c1r1 kernel:  write_cache_pages+0x4c/0xb0
Dec 02 21:16:40 n2c1r1 kernel:  zpl_writepages+0xb0/0x1d0 [zfs]
Dec 02 21:16:40 n2c1r1 kernel:  do_writepages+0x83/0x290
Dec 02 21:16:40 n2c1r1 kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
Dec 02 21:16:40 n2c1r1 kernel:  ? psi_group_change+0x201/0x4e0
Dec 02 21:16:40 n2c1r1 kernel:  __writeback_single_inode+0x44/0x290
Dec 02 21:16:40 n2c1r1 kernel:  writeback_sb_inodes+0x255/0x550
Dec 02 21:16:40 n2c1r1 kernel:  __writeback_inodes_wb+0x54/0x100
Dec 02 21:16:40 n2c1r1 kernel:  ? queue_io+0x113/0x120
Dec 02 21:16:40 n2c1r1 kernel:  wb_writeback+0x1ac/0x330
Dec 02 21:16:40 n2c1r1 kernel:  ? get_nr_dirty_inodes+0x60/0xc0
Dec 02 21:16:40 n2c1r1 kernel:  wb_do_writeback+0x223/0x2a0
Dec 02 21:16:40 n2c1r1 kernel:  wb_workfn+0x4c/0x150
Dec 02 21:16:40 n2c1r1 kernel:  process_one_work+0x178/0x3d0
Dec 02 21:16:40 n2c1r1 kernel:  worker_thread+0x2de/0x410
Dec 02 21:16:40 n2c1r1 kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
Dec 02 21:16:40 n2c1r1 kernel:  ? _raw_spin_lock_irqsave+0xe/0x20
Dec 02 21:16:40 n2c1r1 kernel:  ? __pfx_worker_thread+0x10/0x10
Dec 02 21:16:40 n2c1r1 kernel:  kthread+0xfb/0x230
Dec 02 21:16:40 n2c1r1 kernel:  ? __pfx_kthread+0x10/0x10
Dec 02 21:16:40 n2c1r1 kernel:  ret_from_fork+0x44/0x70
Dec 02 21:16:40 n2c1r1 kernel:  ? __pfx_kthread+0x10/0x10
Dec 02 21:16:40 n2c1r1 kernel:  ret_from_fork_asm+0x1a/0x30
Dec 02 21:16:40 n2c1r1 kernel:  </TASK>
Dec 02 21:16:40 n2c1r1 kernel: ---[ end trace 0000000000000000 ]---

pasztorl avatar Dec 03 '25 08:12 pasztorl

Not an oops, just an internal warning inside the kernel. Was there any other effect? Did things just appear to keep going? Anything else happeing at the time; errors reported in zpool status, disk stuff in dmesg, etc?

I have an idea where it's coming from anyway; we might just not be cleaning up something properly in a callback. I'll look soon.

robn avatar Dec 03 '25 08:12 robn

thanks for the quick response. i runned zpool scrub, no errors reported. there was no effect, "things just appear to keep going" :)

pasztorl avatar Dec 03 '25 08:12 pasztorl