rabidcopy comments

Results 91 comments of


                                            rabidcopy

Replace EOS with newline to prevent context/memory being flushed by EOS in interactive mode

> I have suggested a change at [rabidcopy#2](https://github.com/rabidcopy/llama.cpp/pull/2) (please check it out @rabidcopy) that would return control to the user by injecting the anti-prompt instead, which should solve that problem....

Replace EOS with newline to prevent context/memory being flushed by EOS in interactive mode

Ohh, this merge conflicts with https://github.com/ggerganov/llama.cpp/pull/330. Or rather doesn't work as the antiprompt is no longer tokenized.

Replace EOS with newline to prevent context/memory being flushed by EOS in interactive mode

> > Ohh, this merge conflicts with #330. Or rather doesn't work as the antiprompt is no longer tokenized. > > Oops! What if we tokenize the (first) reverse prompt...

Replace EOS with newline to prevent context/memory being flushed by EOS in interactive mode

I think this works. ``` Assistant: Hi! I'm your personal assistant to answer any questions you may have! You: Hi, sorry but I gotta go actually, unexpected plans. Goodbye! Assistant:...

Breaking change of models since PR #252

> @eiz It seems there is a problem with the alpaca 13B, after conversion, when loading it complains about the embedding size: > > ``` > main: seed = 1679320340...

Improve the Chat Mode with some tricks and considerations

I think I'm going to throw together a personal hack that throws characters like `#()\/[]{}` out of sampling by piggybacking off the `-ignore-eos` code. I've been getting a lot more...

Improve the Chat Mode with some tricks and considerations

Haha. This is a bit overkill but it works. ``` You:Hi Assistant: Do you need anything? You: Can you type a backslash? Assistant: Sure thing! You: A forward slash? Assistant:...

Change ./main help output to better reflect context size's affect on generation length

` -c N, --ctx_size N size of the prompt context (default: 512)` You'll want to set `-c 2048` (max recommended for LlaMa) Also set `-n 2048` as well, effectively they...

Add backwards-compatibility for older model format

I think the conversion script should be included in the repository somewhere to be honest. It's worked for every old format model I've thrown at it and saved me a...

Running a Vicuna-13B 4it model ?

Vicuna is a pretty strict model in terms of following that ### Human/### Assistant format when compared to alpaca and gpt4all. Less flexible but fairly impressive in how it mimics...