InfamyStudio comments

Repositories
Issues
Comments

Results 3 comments of


                                            InfamyStudio

Why does notehighlight when copying into OneNote push everything together.

bump to this, still experiencing this!

The query (chat) process takes forever

Did you make any headway on improving speeds @chenle02 - this is also what we want to achieve!

Speed issues on prompt - what can be done to improve this?

> You can get faster inference using GPU offloading. llama.cpp now supports a mix of using CPU and GPU. Even langchain mentions it. That will significantly speed up inference. Are...