Ning Ren
Ning Ren
## Why are these changes needed? Model worker (also vllm_worker) has error loading [Phi-3 models](https://huggingface.co/microsoft/Phi-3-mini-128k-instruct) "microsoft/Phi-3-mini-128k-instruct" etc..  ## Checks - [x] I've run `format.sh` to lint...
### What happened? Gemini 3.0 requires a thought_signature for tool calls, but litellm doesn't preserve this signature in streaming mode, causing errors on subsequent turns. ### Relevant log output ```shell...