rkubelog icon indicating copy to clipboard operation
rkubelog copied to clipboard

Intermittent logging issues in Papertrail

Open giovannif23 opened this issue 4 years ago • 11 comments
trafficstars

A couple weeks ago we noticed logs stopped flowing from our EKS nodeless clusters to Solarwinds. We restarted the rkubelog pod and it began reporting back to Solarwinds. 2 hours later reporting stopped again, restarting the pod again solved the problem in the short terms. Fast forward to this morning and the pods stopped reporting logs, 2 hours later they started reporting logs again without doing anything on our end. We continue to have intermittent logging issues. We saw both issues on two environments. Restarting has stopped the logging issue in one environment, however, we are still having issues in another. We have restarted several times over the past week and it will stop log flowing after a day or so.

giovannif23 avatar Mar 03 '21 16:03 giovannif23

Hello @giovannif23, thx for reporting this issue. Can you please tell me which version of rkubelog are you using?

girishranganathan avatar Mar 03 '21 16:03 girishranganathan

@girishranganathan we are using r17.

giovannif23 avatar Mar 03 '21 20:03 giovannif23

Thx @giovannif23. I will try to investigate this on my end. While I investigate this further: can you please give this image a try: quay.io/solarwinds/rkubelog:r19-b2 ?

girishranganathan avatar Mar 03 '21 20:03 girishranganathan

@girishranganathan I'll give it a try and I'll let you know. Thank you.

giovannif23 avatar Mar 03 '21 21:03 giovannif23

@giovannif23 can you please try this new image: quay.io/solarwinds/rkubelog:r19rc2? The earlier has the same issue you had reported. I have tried to fix the issue in this new version + i tested this new image on an mid-sized eks cluster for almost a week.

girishranganathan avatar Mar 10 '21 19:03 girishranganathan

@girishranganathan we're still seeing this issue with rkubelog:r19rc2. I've been unable to identify any pattern to when or why it happens, but a simple restart always fixes the problem. The environment in question is on kubernetes 1.19 if that's useful information.

cfroystad avatar May 06 '21 11:05 cfroystad

having the exact same issue with rkubelog:r19-b2. is there any fix available?

vovxox avatar Nov 09 '21 22:11 vovxox

also having same issue. I have to restart rkubelog pod every 3-4 days for it to reset and start logging again. Please investigate. When logging stops we no longer can get notified of issues so instability is causing concern.

dev-on2air avatar Nov 27 '21 17:11 dev-on2air

any chance this get fixed? such a pain...

dev-on2air avatar Jan 25 '22 21:01 dev-on2air

This is still a problem on the critical path.

I will have to move quickly. We do require to have logging working by the end of August. What can we do?

Please escalate within papertrail R&D!

hholst80 avatar Jul 03 '22 11:07 hholst80

We are still seeing this issue. Any fix to this issue yet?

ingin97 avatar May 24 '23 08:05 ingin97