grpc-go icon indicating copy to clipboard operation
grpc-go copied to clipboard

Reasoning about retry throttling parameters

Open roanta opened this issue 1 year ago • 0 comments

Hi!

I would like to reason about retry throttling in terms of an upper bound for traffic amplification, relative to the steady state of my service. It looks like token_count is equivalent to successRate * tokenRatio, which is close to what I need. However, I don't know how to reasonably set maxTokens, which is also used to derive the threshold. Part of the problem is that maxTokens is a defined in absolute terms, rather then relative.

  • Am I missing an obvious way to set maxTokens?
  • Is it possible to "bend" this implementation to conform to the semantics I'm describing?

I'm new to gRPC, so please let me know if there are additional docs or resources I've missed which make this clearer.

Thanks in advance!

roanta avatar Aug 19 '24 21:08 roanta