FHL1998

Results 3 issues of FHL1998

Traceback (most recent call last): File "qlora.py", line 791, in train() File "qlora.py", line 636, in train tokenizer = AutoTokenizer.from_pretrained( File "/opt/conda/envs/env/lib/python3.8/site-packages/transformers/models/auto/tokenization_auto.py", line 635, in from_pretrained tokenizer_class = tokenizer_class_from_name(tokenizer_class_name) File...

A weird thing happens when I inspect the val_loss, it decreases and then increases in a strange way. Does anyone have an idea of solving it? ![1686276272335](https://github.com/artidoro/qlora/assets/59732331/6fb6a635-808e-4fcd-8f70-a082e24826bf)

I find a strange thing that when I set `train_on_source=Ture`, the train loss is smaller. When I set `train_on_source=False`, the evaluation will output something from the source, it seems IGNORE_INDEX...