Hendrik comments

Results 15 comments of


                                            Hendrik

Unable to start any VM on macOS 14.5 with M1 Max, the VMs always end up with an `unknown` status

Same problem after a restart. Creating new instances and starting them works, previous instances don't. Also macOS 14.5 and M1 Max.

Retrieve attention score for all input tokens per generated token

Thanks for clarifying!

Retrieve attention score for all input tokens per generated token

As I have looked around in the llama.cpp-project, I found [this](https://github.com/ggerganov/llama.cpp/blob/master/llama.h#L227) callback. A sample usage can be found [here](https://github.com/ggerganov/llama.cpp/blob/81d41628f90c4a2e68e6253f20cd3c46235f41a5/examples/simple/simple.cpp#L9-L34) (but the `int node_index` must be removed) and a more extensive...

Retrieve attention score for all input tokens per generated token

Oh wow, this looks great! Thank you! I will see how far I can get with it 😄

Retrieve attention score for all input tokens per generated token

Hi @reuank, I switched from llama-cpp-python to llama.cpp for other reasons, and started implementing an attention score collecting callback for the server-implementation. I'm not sure how / if this may...