llama2.c issues

Is this project still active?

7

Maintainer @karpathy is inactive since many weeks, he made [a single commit few days ago](https://github.com/karpathy/llama2.c/commit/d0237abd32e553317a2bd80ecd5d4c621ddd307a), but it seems more like a "ping signal" than a real commit. We can understand...

xefoci7612

Questions about the matmul function in run.c

As a newcomer to transformers, there are some questions about the matrix multiplication operator in run.c, can the author answer them. ![image](https://github.com/karpathy/llama2.c/assets/124141787/5febdd0f-7265-4a15-815d-bc0554991212) In the forward function, it can be seen...

KangkangStu

I found that the dim parameter affects the learning loss and n_layers affects the training speed.

I found that the dim parameter affects the learning loss and n_layers affects the training speed. ![螢幕擷取畫面 2023-10-07 184924](https://github.com/karpathy/llama2.c/assets/125795763/7906a654-9697-4c05-8022-a99430c22ae9) ![螢幕擷取畫面 2023-10-07 185043](https://github.com/karpathy/llama2.c/assets/125795763/b2ad8f05-d745-4228-8364-3c10d9db644f) It took 30 minutes. The larger layer only...

win10ogod

Q: How to finetune?

2

I have trained TinyStories to validation loss that Karpathy achieves. I want to fine tune on top of this now. I imagine it is like "resume", but don't reset the...

SpaceCowboy850

How to save checkpoints at each step?

1

How to save checkpoints at each step? what is mfu ?

win10ogod

baby llama2 The training reported an error, and it was still good just now and suddenly reported the error

3

Overriding: compile = False Overriding: eval_iters = 1 Overriding: batch_size = 1 tokens per iteration will be: 1,024 breaks down as: 4 grad accum steps * 1 processes * 1...

musellama

Use cblas for matrix multiplication

2

One potential optimization is to use a library such as OpenBLAS or Intel's MKL to perform the matrix multiplication in the matmul function. ``` #include void matmul(float* xout, float* x,...

shamsburki

Export tokenizers to huggingface (eg: Tinystories260K)

In https://github.com/karpathy/llama2.c/pull/395 support was added to export the model to work with HuggingFace trasformers. However, this only works on the model, and not the tokenizer, so only works with the...

nickypro

How to convert the huggingface model with GQA to bin?

4

I want to convert this small 1.1B llama2 architecture model [PY007/TinyLlama-1.1B-intermediate-step-240k-503b](https://huggingface.co/PY007/TinyLlama-1.1B-intermediate-step-240k-503b) to llama2.c version. (Layers: 22, Heads: 32, Query Groups: 4, Embedding Size: 2048, Intermediate Size (Swiglu): 5632) Then I...

tic-top

Is it possible to increase or decrease the size of only some of the layers of the model structure?

win10ogod

llama2.c
llama2.c copied to clipboard

Metadata

Is this project still active?

Questions about the matmul function in run.c

I found that the dim parameter affects the learning loss and n_layers affects the training speed.

Q: How to finetune?

How to save checkpoints at each step?

baby llama2 The training reported an error, and it was still good just now and suddenly reported the error

Use cblas for matrix multiplication

Export tokenizers to huggingface (eg: Tinystories260K)

How to convert the huggingface model with GQA to bin?

Is it possible to increase or decrease the size of only some of the layers of the model structure?

← Metadata

Owner

Metadata

llama2.c llama2.c copied to clipboard

Metadata

← Metadata

Owner

Metadata

llama2.c
llama2.c copied to clipboard