llama2.c issues

bugfix: unexpected behavior when hidden_dim % group_size != 0

1

This fixes two bugs that cause unexpected behavior when the hidden dim isn't evenly divisible by the quantization group size like in Stories42M which has hidden dim 1376 and group...

EmreAdabag

bugfix: export.py allows hidden_dim % group_size != 0

runq.c requires hidden_dim to be evenly divisible by the quantization group size. This change enforces that condition during model export. This can also be fixed [by changing runq.c](https://github.com/karpathy/llama2.c/pull/532).

EmreAdabag

Error with torch not compiled with cuda enabled

1

I tried to run with this: python3 -m train.py --compile=False --eval_iters=10 --batch_size=8 But got this error, I think it is around my mac and cuda and torch compiled mode? File...

berlinbrown

Simple text

1

These changes add support for training with tinyshakesphere (change from llama2.py), and simple blank line separated text.

dmahurin

contribute sycl version in a separate folder

Thank you for this nice repo! We at Intel have created a SYCL version of it and would like to contribute it here. The SYCL code inside `/sycl` was tested...

mgrabban

failed to convert llama_2 7B model in .gguf to .bin format

2

I have tried to convert llama 2 model from .gguf to .bin ``` ~/llm_inferences/llama.cpp/models/meta$ ls llama-2-7b.Q4_K_M.gguf python3 export.py llama2_7b.bin --meta-llama /home/####/llm_inferences/llama.cpp/models Traceback (most recent call last): File "/home/aadithya.bhat/llm_inferences/llama2.c/export.py", line 559,...

adi-lb-phoenix