Josh Leverette issues

Results 37 issues of


                                            Josh Leverette

Llama 3 BPE tokenization needs improvement

### What is the issue? [This PR](https://github.com/ggerganov/llama.cpp/pull/6920) just merged on llama.cpp, which contained important improvements to how tokenization worked for Llama 3 and other models. An example of the issue...

bug

Add tab completions for fish shell

This is a little something I worked up (with some help :robot:) to make my life easier as a `fish` user: `~/.config/fish/completions/ollama.fish` ```fish function __ollama_list set -l query (string join...

feature request

Models remain resident in VRAM after deletion

### What is the issue? I downloaded the wrong model, ran it, realized my mistake, then deleted it, and noticed it was still listed as being present in VRAM according...

bug

Model memory usage / quantization

According to [this Refact blog post](https://refact.ai/blog/2023/self-hosted-15b-code-model/): > Check out the [docs on self-hosting](https://github.com/smallcloudai/refact-self-hosting) to get your AI code assistant up and running. > To run StarCoder using 4-bit quantization, you’ll...

kind:bug

ide:vscode

Josh Leverette

Llama 3 BPE tokenization needs improvement

Add tab completions for fish shell

Models remain resident in VRAM after deletion

Model memory usage / quantization

(WSL2) RuntimeError: CUDA failed with error out of memory

Support for Phi-3 small and Phi-3 medium? Multimodal support?

[CON-267] On an unsaved file, Continue sends an empty query, resulting in bad completions