llama2.c issues

Results 146 llama2.c issues

Sort by recently updated

Add llama3.2.c port to README.md

Clone of llama2.c but updated to work with Llama 3.2 1B/3B base and instruct

Dylan-Harden3

Add Windows 98 port to README

AlexCheema

bfloat16

The weights are natively bfloat16. Rather than convert them into float, you could just keep them as bfloat16 and convert between float and bfloat16 on the fly using a union...

ryao

EOS

Why is the termination condition of the `generate` function `next = 1` (BOS) instead of `next = 2` (EOS)?

wendadawen

runq.c quantization not symmetric

Hi, I believe that the bias is not removed in the quantize() function. This would be necessary to have a symmetric Q8_0 quantization of activations. Is that not needed? ```...

hafezmg48

how to use train.py to train from llama2-HF model.

Is the export.py only created for model in run.c ? I use it to export hf model to a model.bin, but it doesn't work when I use it in train.py,...

bailuan

llama2.c
llama2.c copied to clipboard

Metadata

Add llama3.2.c port to README.md

Add Windows 98 port to README

bfloat16

EOS

runq.c quantization not symmetric

how to use train.py to train from llama2-HF model.

← Metadata

Owner

Metadata

llama2.c llama2.c copied to clipboard

Metadata

Add llama3.2.c port to README.md

Add Windows 98 port to README

bfloat16

EOS

runq.c quantization not symmetric

how to use train.py to train from llama2-HF model.

← Metadata

Owner

Metadata

llama2.c
llama2.c copied to clipboard