Lee Stott

Results 176 comments of Lee Stott

Yes, you're correct. Each image is processed individually in the loop, and the results are handled separately for each image. To combine text and image tensors, you can create a...

@hinogi please retry requirements have been updated.

This is being looked into

Hi @Junaid-Nazir-828 so you need to set a system prompt for Phi-4-mm-instruct see https://github.com/microsoft/PhiCookBook/issues/277 TypeScript Example: Using Tools with Phi-4-MM-Instruct Here's a TypeScript example showing how to use function calling...

Here are a few suggestions to help you troubleshoot and potentially resolve this issue: Check PyTorch and Transformers Versions: Ensure that you're using compatible versions of PyTorch and the Hugging...

Content is now auto translated

Here are a few suggestions to potentially improve the inference speed. Batch Size: Ensure that you process a batch of inputs. Single input processing might be slower. Mixed Precision: Using...