anzz1 comments

Results 149 comments of


                                            anzz1

segmentation fault Alpaca

Since the alpaca.cpp project currently does not exhibit this issue, and based on when these reports started appearing, the problem most likely is traced back to the tokenizer change and...

Improving the repetition penalty

@Piezoid Thanks for pointing me towards here from the other discussion. I'll be checking your branch out and testing it. I'm also rooting for you finishing the trace tool at...

Using non LoRA Alpaca model

@sr-hm @botatooo The cause of this is that the point-alpaca model in question has a added "[PAD]" token so the resulting model contains 32001 tokens, but the vocab size was...

Improve Alpaca integration to match it's trained prompt syntax

Reading the [data release](https://github.com/tatsu-lab/stanford_alpaca#data-release) closely, > During inference (eg for the web demo), we use the user instruction with an empty input field (second option). How it's currently implemented is...

High performance API

Yes, keep the core lean, portable, fast and free of dependencies while having the option of building things on top of it, as modules. This could be achieved with a...

MMAP for Windows (not working atm)

> ... > Again if the case is that more processes is what is wanted and an ability to share the state between them, a more general approach would be...

MMAP for Windows (not working atm)

> You mean something like importing https://github.com/alitrack/mman-win32 ? Nope, quite the opposite, steering clear of any non-portable code, imported libraries or dependencies *inside* the main program, and have any functionality...

MMAP for Windows (not working atm)

> @anzz1 This PR is not about multiprocessing or sharing memory but rather accelerating the loading of the model via a memory mapped file (see #91 for more details). Though...

Converting GGML back to Torch checkpoint for HuggingFace/Pytorch consumption/training/finetuning

the .pth files == ggml-f16 which both contain the full information. if you have a quantized .pt or ggml-q4_0 / q4_1 , the full information is already lost so you...

Converting GGML back to Torch checkpoint for HuggingFace/Pytorch consumption/training/finetuning

> @anzz1 Thank you for your comment. However, what if you want to study the effect of finetuning on quantized models? Or simply want to look at the distribution of...