taozhiyuai comments

Results 117 comments of


                                            taozhiyuai

Jupyter Notebook caching issues when developing extensions

works on safari.thx

llama3-instruct models not stopping at stop token

my model file works fine. # Modelfile generated by "ollama show" # To build a new Modelfile based on this one, replace the FROM line with: # FROM llama3:8b-instruct-fp16 FROM...

llama3-instruct models not stopping at stop token

> create from gguf 7b q4, it's the same problem while `ollama run` > > ```shell > $ ollama -v > ollama version is 0.1.32 > > $ ollama show...

how to config octopus on ollama ?

I import gguf from https://hf-mirror.com/second-state/Octopus-v2-GGUF

QWEN 1.8B or less is OK; 4B or more not working

I think It take too long time to generate tokens.

QWEN 1.8B or less is OK; 4B or more not working

14B generates only 2 actions. so I think 7B is the best model size on my laptop.

modify template, system,or params on webpage

> You can change template, system and params in Modelfile, then recreate model and repush model to ollama. > > Or use a base model, then create a new model...

modify template, system,or params on webpage

> > > You can change template, system and params in Modelfile, then recreate model and repush model to ollama. > > > Or use a base model, then create...

modify template, system,or params on webpage

> > > You can change template, system and params in Modelfile, then recreate model and repush model to ollama. > > > Or use a base model, then create...

wrong inference for mixtral 8*22b

q6k is weird. works fine for q4 or q5. same model file. so strange