Jlama
Jlama copied to clipboard
Jlama is a modern LLM inference engine for Java
Hi, I was trying this model here: https://huggingface.co/MoMonir/llava-llama-3-8b-v1_1-GGUF It also comes with some instructions on how to use it for images. Is this also possible somehow with Jlama, e.g. via...
See comments of #18
Is there a way to run and expose an API streaming server compatible with OpenAI API specifications?
Is there any plan to support GGUF format directly apart from SafeTensor, that will allow to use this to load other GGUF's. If support already exists can we add it...
To support things like json output Try https://github.com/sylvainhalle/Bullwinkle
Hello jake, I started playing around with jlama for my pet project and i have been hitting with this exception , could you please take a look ? i am...
Hi I'd love to use one model of Meta based on DEBERTA-V2 model from Microsoft. When I run the model on jlama I got a `: java.lang.IllegalArgumentException: No enum constant...