An Xiao
An Xiao
> For the miss weights, I think it may caused by the transformer pkg version. I update it from 4.31.0 to 4.33.2 and solved. @ZizhenWang I faced the problem like...
@martinakaduc Thank you very much for your prompt reply! Below is my pretraining script. The --model_name_or_path is the model I downloaded from HF mistralai/Mixtral-8x7B-v0.1. Despite the warnings, running this script...
@martinakaduc Thank you! I will try it now and find out the problem!
@martinakaduc Hi, I'm using your pretrained MixSUraV model downloaded from HF to finetune on my own dataset. The script I use is like Figure 1, is it correct? If correct,...
> Hi, how do you know the training was effecitve? Did you use the default training setting? I LoRA with default parameters and basically no improvement. @fisher75 I have LoRA...
@fisher75 Sure, e-mail me your WeChat ID is ok.