ChatGLM-Tuning icon indicating copy to clipboard operation
ChatGLM-Tuning copied to clipboard

datasets.builder.DatasetGenerationError: An error occurred while generating the dataset

Open cristianohello opened this issue 2 years ago • 6 comments
trafficstars

Traceback (most recent call last): File "/root/chatglm/ChatGLM-Tuning-master/tokenize_dataset_rows.py", line 53, in main() File "/root/chatglm/ChatGLM-Tuning-master/tokenize_dataset_rows.py", line 46, in main dataset = datasets.Dataset.from_generator( File "/root/miniconda3/envs/chatglm20230401/lib/python3.9/site-packages/datasets/arrow_dataset.py", line 986, in from_generator return GeneratorDatasetInputStream( File "/root/miniconda3/envs/chatglm20230401/lib/python3.9/site-packages/datasets/io/generator.py", line 42, in read self.builder.download_and_prepare( File "/root/miniconda3/envs/chatglm20230401/lib/python3.9/site-packages/datasets/builder.py", line 822, in download_and_prepare self._download_and_prepare( File "/root/miniconda3/envs/chatglm20230401/lib/python3.9/site-packages/datasets/builder.py", line 1555, in _download_and_prepare super()._download_and_prepare( File "/root/miniconda3/envs/chatglm20230401/lib/python3.9/site-packages/datasets/builder.py", line 913, in _download_and_prepare self._prepare_split(split_generator, **prepare_split_kwargs) File "/root/miniconda3/envs/chatglm20230401/lib/python3.9/site-packages/datasets/builder.py", line 1396, in _prepare_split for job_id, done, content in self._prepare_split_single( File "/root/miniconda3/envs/chatglm20230401/lib/python3.9/site-packages/datasets/builder.py", line 1550, in _prepare_split_single raise DatasetGenerationError("An error occurred while generating the dataset") from e datasets.builder.DatasetGenerationError: An error occurred while generating the dataset (chatglm20230401) root@autodl-container-e82d11963c-10ece0d7:~/chatglm/ChatGLM-Tuning-master# pip list

cristianohello avatar Apr 13 '23 07:04 cristianohello

  1. 手动将跳转链接修改,增加端口号并更改协议为https,可以访问 image

56warmers avatar Mar 17 '23 06:03 56warmers

感谢反馈,请问你是前后端分离部署的吗?

BBchicken-9527 avatar Mar 27 '23 11:03 BBchicken-9527

感谢反馈,请问你是前后端分离部署的吗?

离线部署docker脚本安装,mysql和kettle都是容器集成的

56warmers avatar Mar 27 '23 11:03 56warmers

感谢反馈,请问你中间的代理配置方式是什么?可能和此处有关。

BBchicken-9527 avatar Mar 30 '23 03:03 BBchicken-9527

感谢反馈,请问你中间的代理配置方式是什么?可能和此处有关。

域名+端口的nginx转发。 具体是域名指向公网IP,交换机分配公网IP+端口指向内网IP+端口,最后nginx转发。

56warmers avatar Mar 30 '23 03:03 56warmers

感谢反馈,请问你中间的代理配置方式是什么?可能和此处有关。

域名+端口的nginx转发。 具体是域名指向公网IP,交换机分配公网IP+端口指向内网IP+端口,最后nginx转发。

+1,https会变成http

jsummer avatar Apr 23 '23 10:04 jsummer

您超过 30 天未反馈信息,我们将关闭该 issue,如有需求您可以重新打开或者提交新的 issue。

github-actions[bot] avatar May 24 '23 00:05 github-actions[bot]