Results 8 comments of Rahul Gupta

thanks for your effort @daipom. This fix/clarification is important imo as many dashboard metrics depend on prometheus and can cause noise if someone interprets this in a wrong way.

thanks for your effort @daipom. This fix/clarification is important imo as many dashboard metrics depend on prometheus and can cause noise if someone interprets this in a wrong way.

Can you try killing the fluentd process ? I can’t figure out the exact scenario to reproduce this issue. What I’ve noticed is that, In normal scenario we have two...

I understand without a location you don’t know where to send it. But since retry_forever is true and fluentd keeps on retrying this chunk. What I’ve noticed is that instead...

Another approach is to find a way how this problem would not appear in the first place. I’ve seen this appear frequently. Around 3-5 unique /160 hosts are facing this...

I’ll definitely add secondary_file. 1 question: If I use retry_timeout / retry_max_times, how will my retries work in this case. 1. If 1 buffer has exhausted the retry parameter it...

Thank you for the seconday_file workaround. It will help to manually recover and send logs in case of failures. It would however be great if we can have retries/solution that...

I’ve added graceful kill commands to kill running process and around 10 second of sleep for restarts. However, we have a process monitor that checks if fluentd is running or...