incubator-uniffle
incubator-uniffle copied to clipboard
[Bug] The metric `grpc_open` sometime incorrect
We found the value of grpc_open sometime very big(>1000) even no application run in our cluster
Could you provide more detailed information? Could you add some logs to help us solve this problem?
No logs, we just found this phenomenon. Maybe org.apache.uniffle.common.rpc.MonitoringServerCall#close not called sometimes. I try to call decCounter in MonitoringServerCallListener#onComplete/onCancel/onComplete and it work. But i don't know the real reasion
No logs, we just found this phenomenon. Maybe
org.apache.uniffle.common.rpc.MonitoringServerCall#closenot called sometimes. I try to calldecCounterinMonitoringServerCallListener#onComplete/onCancel/onCompleteand it work. But i don't know the real reasion
I guess that method close could not be called sometimes if there is an exception.
cc @colinmjj , Do you remember our flaky metric test? I guess that it's caused by this issue.