zfs
zfs copied to clipboard
Consistent NULL dereference during ZTS on Linux/sparc64
System information
| Type | Version/Name |
|---|---|
| Distribution Name | Debian |
| Distribution Version | sid |
| Linux Kernel | 4.15.0-2-sparc64 |
| Architecture | sparc64 |
| ZFS Version | 7457b024b* |
| (The * is "plus the trivial patch to fix #12008" ) |
Describe the problem you're observing
Ran a subset of ZTS twice, both times it spit this out.
The first time, it was hung in cli_root/zfs_receive/zfs_receive_raw.ksh; the second time, cli_root/zfs_receive/zfs_receive_raw_-d.ksh
Describe how to reproduce the problem
Run this runfile on sparc64.
Include any warning/errors/backtraces from the system logs
[35767.884793] ZFS: Loaded module v2.1.99-174_g7457b024b, ZFS pool version 5000, ZFS filesystem version 5
[36129.229830] INFO: task kworker/0:1:173284 blocked for more than 120 seconds.
[36129.322668] Tainted: P O 4.15.0-2-sparc64 #1 Debian 4.15.11-1
[36129.421134] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[36129.524213] kworker/0:1 D 0 173284 2 0x03000000
[36129.597516] Workqueue: events cgwb_release_workfn
[36129.659357] Call Trace:
[36129.691502] [00000000009f8d04] schedule+0x24/0xa0
[36129.754540] [00000000009f94ec] bit_wait+0xc/0x60
[36129.816400] [00000000009f9120] __wait_on_bit+0x60/0xe0
[36129.885121] [00000000009f9224] out_of_line_wait_on_bit+0x84/0xa0
[36129.965299] [000000000058c6c8] wb_shutdown+0x108/0x140
[36130.034040] [000000000058de2c] cgwb_release_workfn+0xc/0x120
[36130.109659] [000000000047e4b8] process_one_work+0x158/0x400
[36130.184090] [000000000047e8ac] worker_thread+0x14c/0x500
[36130.255119] [0000000000484b08] kthread+0x108/0x120
[36130.319273] [0000000000405f64] ret_from_fork+0x1c/0x2c
[36130.388076] [0000000000000000] (null)
[36252.074347] INFO: task kworker/0:1:173284 blocked for more than 120 seconds.
[36252.167183] Tainted: P O 4.15.0-2-sparc64 #1 Debian 4.15.11-1
[36252.265680] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[36252.368766] kworker/0:1 D 0 173284 2 0x03000000
[36252.442091] Workqueue: events cgwb_release_workfn
[36252.503928] Call Trace:
[36252.536082] [00000000009f8d04] schedule+0x24/0xa0
[36252.599092] [00000000009f94ec] bit_wait+0xc/0x60
[36252.660969] [00000000009f9120] __wait_on_bit+0x60/0xe0
[36252.729712] [00000000009f9224] out_of_line_wait_on_bit+0x84/0xa0
[36252.809875] [000000000058c6c8] wb_shutdown+0x108/0x140
[36252.878608] [000000000058de2c] cgwb_release_workfn+0xc/0x120
[36252.954220] [000000000047e4b8] process_one_work+0x158/0x400
[36253.028663] [000000000047e8ac] worker_thread+0x14c/0x500
[36253.099692] [0000000000484b08] kthread+0x108/0x120
[36253.163853] [0000000000405f64] ret_from_fork+0x1c/0x2c
[36253.232573] [0000000000000000] (null)
[36374.973680] INFO: task kworker/0:1:173284 blocked for more than 120 seconds.
[36375.066472] Tainted: P O 4.15.0-2-sparc64 #1 Debian 4.15.11-1
[36375.165019] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[36375.268127] kworker/0:1 D 0 173284 2 0x03000000
[36375.341485] Workqueue: events cgwb_release_workfn
[36375.403316] Call Trace:
[36375.435471] [00000000009f8d04] schedule+0x24/0xa0
[36375.498479] [00000000009f94ec] bit_wait+0xc/0x60
[36375.560372] [00000000009f9120] __wait_on_bit+0x60/0xe0
[36375.629094] [00000000009f9224] out_of_line_wait_on_bit+0x84/0xa0
[36375.709267] [000000000058c6c8] wb_shutdown+0x108/0x140
[36375.777994] [000000000058de2c] cgwb_release_workfn+0xc/0x120
[36375.853607] [000000000047e4b8] process_one_work+0x158/0x400
[36375.928056] [000000000047e8ac] worker_thread+0x14c/0x500
[36375.999078] [0000000000484b08] kthread+0x108/0x120
[36376.063241] [0000000000405f64] ret_from_fork+0x1c/0x2c
[36376.131959] [0000000000000000] (null)
[36456.759064] Unable to handle kernel NULL pointer dereference
[36456.833550] tsk->{mm,active_mm}->context = 00000000000011b2
[36456.906873] tsk->{mm,active_mm}->pgd = fffff8000012c000
[36456.975574] \|/ ____ \|/
[36456.975574] "@'/ .. \`@"
[36456.975574] /_| \__/ |_\
[36456.975574] \__U_/
[36457.169011] zfs(189510): Oops [#1]
[36457.213699] CPU: 0 PID: 189510 Comm: zfs Tainted: P O 4.15.0-2-sparc64 #1 Debian 4.15.11-1
[36457.338509] TSTATE: 0000004411001606 TPC: 00000000108fe1cc TNPC: 00000000108fe1d0 Y: 00000c80 Tainted: P O
[36457.486036] TPC: <abd_verify+0xc/0xa0 [zfs]>
[36457.542182] g0: 0000000000000000 g1: 0000000000000000 g2: 0000000000000000 g3: fffff80035f7c000
[36457.656670] g4: fffff80005bf8000 g5: 0000000000b22c9c g6: fffff80035f7c000 g7: 0000000000000002
[36457.771163] o0: 0000000000000001 o1: fffff80034e32c4c o2: fffff8003c4bd000 o3: 0000000000000000
[36457.885693] o4: 000000000000260e o5: fffff8003c4bd038 sp: fffff80035f7e471 ret_pc: 000000000048cb84
[36458.004737] RPC: <finish_task_switch+0x44/0x1c0>
[36458.065437] l0: fffff80034b8f2b0 l1: fffff80034b8f2b0 l2: 0000000000000000 l3: 0000000000000001
[36458.179935] l4: 00000000000010fb l5: 0000000010a4dc50 l6: 0000000000000000 l7: 0000000000000008
[36458.294422] i0: 0000000000000000 i1: 0000000000bd9f10 i2: 0000000000000002 i3: fffff80005bf8000
[36458.408937] i4: fffff80034e32c00 i5: 0000000000bd9c00 i6: fffff80035f7e521 i7: 00000000108ff20c
[36458.523771] I7: <abd_borrow_buf+0xc/0x60 [zfs]>
[36458.583295] Call Trace:
[36458.615780] [00000000108ff20c] abd_borrow_buf+0xc/0x60 [zfs]
[36458.691705] [00000000108ff3b0] abd_borrow_buf_copy+0x10/0x40 [zfs]
[36458.774579] [0000000010a2b310] zio_crypt_copy_dnode_bonus+0x10/0x140 [zfs]
[36458.866511] [0000000010907aa8] arc_buf_fill+0x388/0x940 [zfs]
[36458.943564] [0000000010908080] arc_untransform+0x20/0xa0 [zfs]
[36459.021767] [00000000109169c4] dbuf_read_verify_dnode_crypt+0xc4/0x160 [zfs]
[36459.115984] [000000001091a780] dbuf_read_impl.constprop.20+0x240/0x5a0 [zfs]
[36459.210195] [000000001091ab78] dbuf_read+0x98/0x4c0 [zfs]
[36459.282770] [0000000010922f10] dmu_buf_hold+0x50/0x80 [zfs]
[36459.357594] [00000000109d67e0] zap_lockdir+0x20/0x80 [zfs]
[36459.431320] [00000000109d6c18] zap_lookup_norm+0x38/0xa0 [zfs]
[36459.509644] [00000000109d6cb0] zap_lookup+0x30/0x40 [zfs]
[36459.582222] [0000000010a1c72c] zfs_get_zplprop+0x8c/0x1c0 [zfs]
[36459.661659] [0000000010a1c888] zfsvfs_init+0x28/0x4e0 [zfs]
[36459.736549] [0000000010a1cf34] zfsvfs_create_impl+0x1f4/0x2a0 [zfs]
[36459.820589] [0000000010a1d054] zfsvfs_create+0x74/0xc0 [zfs]
[36459.896506] Caller[00000000108ff20c]: abd_borrow_buf+0xc/0x60 [zfs]
[36459.979277] Caller[00000000108ff3b0]: abd_borrow_buf_copy+0x10/0x40 [zfs]
[36460.068990] Caller[0000000010a2b310]: zio_crypt_copy_dnode_bonus+0x10/0x140 [zfs]
[36460.167813] Caller[0000000010907aa8]: arc_buf_fill+0x388/0x940 [zfs]
[36460.251725] Caller[0000000010908080]: arc_untransform+0x20/0xa0 [zfs]
[36460.336793] Caller[00000000109169c4]: dbuf_read_verify_dnode_crypt+0xc4/0x160 [zfs]
[36460.437875] Caller[000000001091a780]: dbuf_read_impl.constprop.20+0x240/0x5a0 [zfs]
[36460.539049] Caller[000000001091ab78]: dbuf_read+0x98/0x4c0 [zfs]
[36460.618393] Caller[0000000010922f10]: dmu_buf_hold+0x50/0x80 [zfs]
[36460.700079] Caller[00000000109d67e0]: zap_lockdir+0x20/0x80 [zfs]
[36460.780664] Caller[00000000109d6c18]: zap_lookup_norm+0x38/0xa0 [zfs]
[36460.865834] Caller[00000000109d6cb0]: zap_lookup+0x30/0x40 [zfs]
[36460.945280] Caller[0000000010a1c72c]: zfs_get_zplprop+0x8c/0x1c0 [zfs]
[36461.031599] Caller[0000000010a1c888]: zfsvfs_init+0x28/0x4e0 [zfs]
[36461.113333] Caller[0000000010a1cf34]: zfsvfs_create_impl+0x1f4/0x2a0 [zfs]
[36461.204219] Caller[0000000010a1d054]: zfsvfs_create+0x74/0xc0 [zfs]
[36461.287095] Caller[0000000010a1d0e0]: zfs_domount+0x40/0x4a0 [zfs]
[36461.368832] Caller[0000000010a2ff5c]: zpl_mount+0x11c/0x180 [zfs]
[36461.449000] Caller[00000000005f0984]: mount_fs+0x24/0x160
[36461.520023] Caller[000000000060fa58]: vfs_kern_mount.part.7+0x38/0x120
[36461.605907] Caller[0000000000612770]: do_mount+0x6b0/0xce0
[36461.678072] Caller[0000000000613144]: SyS_mount+0x44/0xc0
[36461.749101] Caller[0000000000406114]: linux_sparc_syscall+0x34/0x44
[36461.831554] Caller[fffff801001656ec]: 0xfffff801001656ec
[36461.901418] Instruction DUMP:
[36461.901427] 9de3bf50
[36461.940413] 01000000
[36461.971398] 01000000
[36462.002387] <c2060000>
[36462.033376] 80886001
[36462.064364] 12600019
[36462.095352] 80886040
[36462.126340] 02600019
[36462.157327] 01000000
Just noticed this looks the same as #12257. "Great", one more bug that'll get closed whenever that gets fixed.
This issue has been automatically marked as "stale" because it has not had any activity for a while. It will be closed in 90 days if no further activity occurs. Thank you for your contributions.