Tungsong issues

Results 6 issues of


                                            Tungsong

There were missing keys in the checkpoint model loaded

**I trained Alpaca-LoRA model with params:** base_model: /usr/local/dbbd/model/llama-7b-hf data_path: alpaca_data.json output_dir: ./lora-alpaca batch_size: 128 micro_batch_size: 4 num_epochs: 2 learning_rate: 0.0001 cutoff_len: 512 val_set_size: 2000 lora_r: 8 lora_alpha: 16 lora_dropout: 0.05...

How to improve training efficiency and shorten training time

I have two T4 on my machine, and I want to improve training efficiency， because it has enough memory when I use the default params ![企业微信截图_16814425311604](https://user-images.githubusercontent.com/35001280/231933309-606a555a-b870-4c9b-b864-c51320895cbb.png) I tried to update...

llama-7b和Lora权重合并之后，在自己的数据上继续fine-tuning

微调完给出的answer一直在重复同一句话，而且也不是答案微调方式参考的https://github.com/27182812/ChatGLM-LLaMA-chinese-insturct 训练了3个epoch，效果如下，请教有谁知道为什么会这样吗 ![企业微信截图_16810901118715](https://user-images.githubusercontent.com/35001280/230808375-697f6a51-2b36-4daf-bfc3-69d5518e994c.png)

Tungsong

There were missing keys in the checkpoint model loaded

How to improve training efficiency and shorten training time

llama-7b和Lora权重合并之后，在自己的数据上继续fine-tuning

论文中input vector和keyword vector计算Vi的时候， α是怎么初始化的

微调完多卡推理时报精度不对的问题 expected scalar type Half but found Float ，单卡推理就没有这个问题

微调bge-large-zh-v1.5后在自己的数据上评估