stanford_alpaca issues

launch.py:sigkill_handler exits with return code = -11

我尝试过换成更小的模型，例如`https://huggingface.co/`中的`gpt2-small-chinese-cluecorpussmall'`，但是得到的错误码是相同的情况。 ``` (lmflow) [root@a4113ca43b08 LMFlow-main]# ./scripts/run_finetune.sh [2023-04-15 13:15:13,800] [WARNING] [runner.py:186:fetch_hostfile] Unable to find hostfile, will proceed with training with local resources only. [2023-04-15 13:15:13,815] [INFO] [runner.py:550:main] cmd = /home/minicoda3/envs/lmflow/bin/python -u...

zeroandexe

CUDA out of memory when Train llama-7b-hf single gpu

1

Hi, I am trying to train the model `llama-7b-hf` with single GPU. I tried to reduce some parameters but I don't know if they are better. Components of my pc...

EnzoDeg40

what is the evaluation_strategy and how to use it?

2

Hi, I got one question, I have seen the `evaluation_strategy` in the [training script](https://github.com/tatsu-lab/stanford_alpaca#fine-tuning). But I do not know the meaning for that parameter? Does that mean if I set...

14H034160212

How to add validation dataset or record validation loss during the training process?

Hi, I got one question that how to add validation dataset or record validation loss into the weight and bias during the [fine-tuning process](https://github.com/tatsu-lab/stanford_alpaca#fine-tuning)? Thanks in advance.

14H034160212

Demo doesn't seem to be working

1

https://alpaca-ai.ngrok.io/

shivamMg

OOM error while training llama-7b with five V100-32G GPUs

3

I use five V100-32G GPUs to train fine tune llama-7b and get OOM error every time. Here is the error messages: torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 388.00...

chenzuozhou

How to change the batch size and epoch for fine-tuning?

Hi, I got one question for the fine-tunining. I have seen the fine-tuning hyperparameters for training Alpaca-7B and Alpaca-13B. I feel confused about the batch size and epoch. from this...

14H034160212

Question about Huggingface llama weight convert and rotary embedding

1. The weight convert script of llama in https://github.com/huggingface/transformers/blob/main/src/transformers/models/llama/convert_llama_weights_to_hf.py#L101 seems have something wrong? There is a function called permute, which is called on each q_proj/k_proj weight ```python def permute(w): return...

irasin