I'm having trouble running the distill_llama.py.

Open foreverboy1989 opened this issue 1 year ago • 1 comments

I think there's one experiment missing in the experiment config folder: eval_Scrolls.yaml, which calls OurTrainer in the finetune_seq2seq.py in the eval phase, which uses dataset scrolls, but doesn't precede the operation of importing datasets. Traceback (most recent call last): File "/data1/lxw/code/lolcats/distill_llama.py", line 469, in <module> main() File "/data1/lxw/code/lolcats/distill_llama.py", line 393, in main final_metrics = finetune_trainer.evaluate(model, step=-1, max_batches=None, prefix='final') ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/data1/lxw/code/lolcats/src/trainer/finetune_seq2seq.py", line 68, in evaluate return self.eval_step(*args, **kwargs) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/data1/lxw/code/lolcats/src/trainer/finetune_seq2seq.py", line 94, in eval_step raise ValueError("scrolls_metric is not set in the dataset") ValueError: scrolls_metric is not set in the dataset

Nov 23 '24 20:11 foreverboy1989

hi there! sorry for the late response.

just to double-check, are you getting this error from calling the sample script like ?

python distill_llama.py --model_config distill_llama3_8b_lk_smd_wtk64_fd64_w01 \
--distill_config distill_alpaca_clean_xent0_mse1000_lr1e-2 \
--finetune_config finetune_lora_qkvo_alpaca_clean \
--eval_config eval_alpaca_clean \
...

and does the error only happen at the end of calling the script?

I'm a bit confused by the error message, but what I think is happening is that the metric is not being loaded here

to debug, are you able to:

import load_metric from huggingface datasets? i.e.,

from datasets import load_metric

Run this function? in dataloaders/utils/setup.py

Nov 28 '24 06:11 mzio