Martin Evans

Results 262 comments of Martin Evans

Are you _absolutely certain_ that you correctly configured the dotnet version? I've had a look through the docs and I can't see how this error could happen for you on...

> About LLamaSharp, could you point me to how to count the tokens for a given string? is there some example? If you have a `LLamaContext`: ```csharp LLamaContext context; int[]...

We had some trouble with embeddings before, I think last time we investigated it we found we got the same values from llama.cpp and there was nothing to fix on...

Q2 is a pretty small quantisation, have you tested your Q2 model in llama.cpp directly to check this isn't just a bad response caused by the quantisation?

The mixtral model you mentioned is Q8, which is much more forgiving than Q2. The smaller than number the more the model has been compressed, and the more likely it...

Definitely a good idea, are you willing to work on the modifications to the Github actions pipeline to compile the necessary binaries? That's the first step to getting this supported.

Now that the CLBLAST binaries are included in the project (or at least, will be once #479 is merged) the next step is to create a [nuspec file](https://github.com/SciSharp/LLamaSharp/tree/master/LLama/runtimes/build) and make...

According to [this comment](https://github.com/SciSharp/LLamaSharp/pull/479#issuecomment-1925250945) clblast.dll may need to be included in the distributed package.

@soyfrien @moozoo64 In general there is zero compatibility between llama.cpp versions - i.e. you can't just swap in a new binary an expect it to work. You'll probably have better...