Daniel Hiltgen comments

Results 1263 comments of


                                            Daniel Hiltgen

CUDA error 999

@Wanhack others have reported that running `nvidia-modprobe -u` on your host may resolve the issue (might require a reboot)

Request and model concurrency

I've updated the description above to better describe how this works. There are 2 layers of concurrency introduced by this change. One layer is leveraging the parallelism support in llama.cpp...

Request and model concurrency

@alexander-potemkin we don't currently have any limits to the number of client connections. I don't believe we have wired up any sort of expiration/timeout setting on the server side, although...

Request and model concurrency

Slight correction to the above. I just updated the implementation for concurrent requests to a single model to use a semaphore package that does FIFO for blocked requests so that...

Request and model concurrency

Note for people following along. I've adjusted the defaults so this PR now mimics current behavior of a single request at a time, and a single model at a time,...

Request and model concurrency

@artem-zinnatullin thanks for giving it a try! We've been making minor fixes to the memory prediction on main, which I've been rebasing into this PR. I've got a Windows test...

Binary for Mac Intel doesn't work.

I wasn't able to reproduce. ``` % system_profiler SPSoftwareDataType SPHardwareDataType Software: System Software Overview: System Version: macOS 10.15.7 (19H2026) Kernel Version: Darwin 19.6.0 Boot Volume: ssd Boot Mode: Normal Computer...

Binary for Mac Intel doesn't work.

If you're still having trouble, please let us know.

Error: Post "http://127.0.0.1:11434/api/chat": read tcp 127.0.0.1:59108->127.0.0.1:11434: wsarecv: An existing connection was forcibly closed by the remote host.

Unfortunately it looks like your server is crashing. Can you share your server.log so we can see why?

Error: Post "http://127.0.0.1:11434/api/chat": read tcp 127.0.0.1:59108->127.0.0.1:11434: wsarecv: An existing connection was forcibly closed by the remote host.

@liquorLiu that log doesn't seem to contain a crash or any error messages. Let's try a different approach to try to understand what's going wrong. Please Quit the tray app,...