lm-evaluation-harness icon indicating copy to clipboard operation
lm-evaluation-harness copied to clipboard

HellaSwag with UnicodeDecodeError

Open Hua-rookie opened this issue 10 months ago • 13 comments

When I was trying to evaluate HellaSwag using: lm_eval --model hf --model_args pretrained=HuggingFaceH4/zephyr-7b-beta,dtype="bfloat16" --tasks hellaswag --device cuda:0 --num_fewshot 10 --batch_size auto --trust_remote_code I met the error: File "/root/miniconda3/envs/lm_eval/lib/python3.10/site-packages/datasets/load.py", line 2587, in load_dataset builder_instance = load_dataset_builder( File "/root/miniconda3/envs/lm_eval/lib/python3.10/site-packages/datasets/load.py", line 2259, in load_dataset_builder dataset_module = dataset_module_factory( File "/root/miniconda3/envs/lm_eval/lib/python3.10/site-packages/datasets/load.py", line 1910, in dataset_module_factory raise e1 from None File "/root/miniconda3/envs/lm_eval/lib/python3.10/site-packages/datasets/load.py", line 1862, in dataset_module_factory can_load_config_from_parquet_export = "DEFAULT_CONFIG_NAME" not in f.read() File "/root/miniconda3/envs/lm_eval/lib/python3.10/codecs.py", line 322, in decode (result, consumed) = self._buffer_decode(data, self.errors, final) UnicodeDecodeError: 'utf-8' codec can't decode byte 0xb5 in position 1: invalid start byte

How can I solve this error?

Hua-rookie avatar Apr 27 '24 08:04 Hua-rookie

same problem

zjuruizhechen avatar Apr 27 '24 10:04 zjuruizhechen

encounter the same issue in local environment

PotatoBearP avatar Apr 29 '24 08:04 PotatoBearP

same issue

Shuizhimei avatar Apr 30 '24 10:04 Shuizhimei

same issue

huangwei021230 avatar May 02 '24 05:05 huangwei021230

same issue update: it is working now

cs32963 avatar May 05 '24 02:05 cs32963

Cannot initially seem to replicate on a fresh HF cache... perhaps did something wrong though? Is the connection to the HF Hub working for those facing this problem?

haileyschoelkopf avatar May 06 '24 14:05 haileyschoelkopf

Cannot initially seem to replicate on a fresh HF cache... perhaps did something wrong though? Is the connection to the HF Hub working for those facing this problem?

It seems not this problem, the connection is well on my machine.

Hua-rookie avatar May 06 '24 16:05 Hua-rookie

same problem update: it is working now

savannahfan avatar May 07 '24 12:05 savannahfan

same problem update: it is working now

So what changes did you make?

Hua-rookie avatar May 08 '24 06:05 Hua-rookie

same with drop

rangehow avatar Jun 05 '24 10:06 rangehow

Waiting for solution...

sci-m-wang avatar Jun 10 '24 07:06 sci-m-wang

same problem, someone solved it?

chen1yunan avatar Jun 26 '24 13:06 chen1yunan

download dataset from hf to local,then modify yaml files will success

chen1yunan avatar Jun 27 '24 09:06 chen1yunan