gateway
gateway copied to clipboard
Adding requestExecTime to requestOptions in the context
This improves response time metrics for streaming and cache use cases, without retries.
We can add metrics like time to first token also here.
Closing this in favour of another PR with a larger scope