Jake Luciani

Results 96 comments of Jake Luciani

Hi, I'm working on a GPU backend see #150 ATM its CPU/SIMD Only

Can you run this with debug logging? it should show more details as to why it failed to load.

You should use `jlama chat Qwen/Qwen2-0.5B-JQ4`

Hi @Jozurf I looked into this and this requires Jlama support for unigram tokenizers. (see https://huggingface.co/learn/nlp-course/en/chapter6/7) This can be done but not as trivial as I was initially hoping

thanks for the report. I will test this on windows

Looks like an issue in the native code. Can you tell me the exact model you are using (with link) and how you are loading it in the code?