azure-functions-host
azure-functions-host copied to clipboard
Reliability improvements for worker-host communication
We have seen many CRIs where out-of-proc language workers fail to start or call back to the host within the defined timeout period. We need to identify the root-cause and fix all the potential reasons, however currently this is very difficult as we are dealing with a long tail of various issues from different components and levels. To do this, we need to improve the reliability of host-worker communication and fix potential bugs.
This epic will keep track of issues that could help us improve worker-host reliability:
- [x] #6704
- [x] #4076
- [ ] #7462
- [ ] #7292
Protobuf Message Implementations
- [ ] #8164
- [ ] #2308
- [x] #2152
Logging
- [ ] #6879
Surface Customer Issues
- [x] #8025
- [x] #4296
- [ ] https://github.com/Azure/azure-functions-dotnet-worker/issues/846
Investigation
- [ ] #8165