Gorkem Ercan
Gorkem Ercan
We should add direct .java invocation to this epic too. I have it already implemented on the devtools repo. [1]. The only reason I did not PR it to here...
@lstocchi can you check this. I suspect this is due to changes to mirror site
Here is some more clarification on the requests. `POST /completion` is an API specific to llama.cpp server. There is example code available for its usage which we will adjust for...
I did a pass left a few comments.
Here is what I had so far on this https://gist.github.com/gorkem/9b3a719de32d1d6b0614f3a00c4d6b6a
I suggest we turn this into a convention rather than implementation any `code` section with `path` `README.md` that points to the root of the ModelKit should be considered the README....
GitHub does not treat READMEs differently in means of how it stores them. The special meaning is just a convention. Let's separate how a README is represented on Kitfile and...
Needs a doc update :)
Yes it does. Here are the related options. They are just inherited from `llama.cpp` ``` --lora FNAME Apply LoRA adapter (implies --no-mmap) --lora-scaled FNAME S Apply LoRA adapter with user...