Roya issues

Results 9 issues of


                                            Roya

Is something wrong with loss?

I don't know if you've ever encountered this issue in your training, the loss increased but all other metrics decreased. It's weird... I don't know if it affects convergence.

reslut on snips

Hello!!! I just can't replicate your reslut on snips when using bert + CRF, it's the baseline bert-base-uncased or any tricks when implement training?

Why loss is always 0?

I wanna to ask why loss is 0 at random in the same experimental setting when runing finetune_toy_low_resource.sh in ltu_as code or have you ever meet this problem. Do you...

Stage training sh scripts for low resource

I'd like to ask is there stage training sh scripts for low resource. It seems there is only low resource training sh scripts for finetune toy data. ![image](https://github.com/YuanGongND/ltu/assets/156107697/24ca0a00-79a8-4534-b9c2-33feb21b0dc0)

where to download whisper model?

I want to ask where can download the whisper model file end with .pt. ![image](https://github.com/YuanGongND/ltu/assets/156107697/b06c8e67-4b1f-417b-af5d-0fe594469b51)

question

train_scripts

I notice a lot of sh script in the directory train_scripts, I want to ask what's the difference between them. ![image](https://github.com/YuanGongND/ltu/assets/156107697/7009bc46-09ff-4913-b1ff-ecafcfc4d5e3)

question

Is this code incomplete？

There is no code related with training, is this code incomplete or there are no plans to open source the training code

deepspeed报错

![image](https://github.com/taishan1994/Llama3-Finetuning/assets/156107697/ec8a28a9-125e-4a23-9414-767bd0450297) 佬我把sh脚本里的模型改成了没有Instruct过的llama，但是运行torchrun的时候报错了，这个要怎么解决呀

缩写单词合成不准确，有解决方案吗？

合成英文的时候在碰到英文单词缩写合成不准确，比如地名缩写NY、BS、KFC等等，请问有解决方案吗？

documentation