bookkeeper icon indicating copy to clipboard operation
bookkeeper copied to clipboard

[fix] Bookie Info lost by notification race condition. #20642

Open gaozhangmin opened this issue 4 months ago • 7 comments

According to pulsar's fix: https://github.com/apache/pulsar/pull/20642 race conditions problem is also exists in bk's ZKRegistrationClient.java

gaozhangmin avatar Aug 07 '25 12:08 gaozhangmin

Can you provide some information about the process of locating the problem you encountered, or is this just a simple logic synchronization with Pulsar?

StevenLuMT avatar Aug 07 '25 21:08 StevenLuMT

@StevenLuMT
After switching the bookie from readonly to writable status online, the broker side continuously generated error logs: "Bookie Handle not available." The issue was resolved only after restarting the bookie. It was discovered that there was a bug in the ZkRegisterClient relied upon by the Pulsar broker, which caused the broker's bookie status cache update to be lost. Although the bookie was actually in a normal state, the broker failed to resolve the bookie's hostname due to the missing cache, resulting in the error: "Bookie Handle not available."

gaozhangmin avatar Aug 08 '25 02:08 gaozhangmin

rerun failure checks

gaozhangmin avatar Aug 11 '25 03:08 gaozhangmin

rerun failure checks

gaozhangmin avatar Aug 11 '25 07:08 gaozhangmin

@gaozhangmin Is it different from this fix #4481

wenbingshen avatar Aug 12 '25 06:08 wenbingshen

@gaozhangmin Is it different from this fix #4481

it's different case

gaozhangmin avatar Aug 13 '25 03:08 gaozhangmin

rerun failure checks

gaozhangmin avatar Aug 13 '25 08:08 gaozhangmin