azure-functions-host icon indicating copy to clipboard operation
azure-functions-host copied to clipboard

Reliability improvements for worker-host communication

Open liliankasem opened this issue 3 years ago • 0 comments

We have seen many CRIs where out-of-proc language workers fail to start or call back to the host within the defined timeout period. We need to identify the root-cause and fix all the potential reasons, however currently this is very difficult as we are dealing with a long tail of various issues from different components and levels. To do this, we need to improve the reliability of host-worker communication and fix potential bugs.

This epic will keep track of issues that could help us improve worker-host reliability:

  • [x] #6704
  • [x] #4076
  • [ ] #7462
  • [ ] #7292

Protobuf Message Implementations

  • [ ] #8164
  • [ ] #2308
  • [x] #2152

Logging

  • [ ] #6879

Surface Customer Issues

  • [x] #8025
  • [x] #4296
  • [ ] https://github.com/Azure/azure-functions-dotnet-worker/issues/846

Investigation

  • [ ] #8165

liliankasem avatar Jan 19 '22 23:01 liliankasem