Shubham Dayma

Results 2 issues of Shubham Dayma

Is there any possibilities to get this running real-time? GPU and Memory requirements to run with it's best version ? Thinking of sending audio tensor as audio input.

### What happened? I am trying to run `ollama/dolphin-phi` model on ollama but /chat/{chat_id}/question throws `{"error":"model 'llama2' not found, try pulling it first"}` error. I don't want to load `llama2`...

bug
area: backend
type: dependencies