copilot
copilot copied to clipboard
Make socket io sessions scale
Currently, the socket connections work fine when there is a single LLM server instance. Once we have multiple instances, the socket connections get lost because they are not sticky, and the load balancer routes requests to a server where the session ID is not defined.
Hey, I want to work on this. Can you assign this to me?