fast-llama
fast-llama copied to clipboard
any example of using any gguf mistral example?
what tokenizer to use for mistral? stories110M is working fine (a lot of nonsense text generated) but how to use mixtral gguf etc?