WeClone icon indicating copy to clipboard operation
WeClone copied to clipboard

对于PyWxDump导出的数据问题

Open datakdata opened this issue 7 months ago • 3 comments

1、选择导出的格式是csv但是实际上是xls,我已经将乱码转化为正常文本了,但是仍然是xls格式,不知道有没有问题。 2、在运行数据预处理的命令时,显示 Traceback (most recent call last): File "/root/WeClone/.venv/bin/weclone-cli", line 10, in <module> sys.exit(cli()) File "/root/WeClone/.venv/lib/python3.10/site-packages/click/core.py", line 1442, in __call__ return self.main(*args, **kwargs) File "/root/WeClone/.venv/lib/python3.10/site-packages/click/core.py", line 1363, in main rv = self.invoke(ctx) File "/root/WeClone/.venv/lib/python3.10/site-packages/click/core.py", line 1830, in invoke return _process_result(sub_ctx.command.invoke(sub_ctx)) File "/root/WeClone/.venv/lib/python3.10/site-packages/click/core.py", line 1226, in invoke return ctx.invoke(self.callback, **ctx.params) File "/root/WeClone/.venv/lib/python3.10/site-packages/click/core.py", line 794, in invoke return callback(*args, **kwargs) File "/root/WeClone/weclone/cli.py", line 26, in wrapper return func(*args, **kwargs) File "/root/WeClone/weclone/cli.py", line 47, in qa_generator processor.main() File "/root/WeClone/weclone/data/qa_generator.py", line 89, in main message_list.extend(self.group_consecutive_messages(messages=chat_messages)) File "/root/WeClone/weclone/data/qa_generator.py", line 391, in group_consecutive_messages and self.single_combine_strategy.is_same_conversation([last_msg], current_msg) File "/root/WeClone/weclone/data/strategies.py", line 30, in is_same_conversation time_diff = abs( TypeError: bad operand type for abs(): 'NaTType' 说是因为时间啥的原因,不太懂这个是什么问题,是因为导出的数据中时间列的格式不对吗?

datakdata avatar May 15 '25 13:05 datakdata

第一个问题 乱码是excel的原因,不用转,你不转试试

xming521 avatar May 15 '25 14:05 xming521

第一个问题 乱码是excel的原因,不用转,你不转试试

好的多谢,数据没问题了,但是出现了新的问题。还是在数据预处理的步骤, ERROR 05-16 12:17:05 [core.py:343] EngineCore hit an exception: Traceback (most recent call last): ERROR 05-16 12:17:05 [core.py:343] File "/root/autodl-tmp/WeClone/.venv/lib/python3.10/site-packages/vllm/v1/engine/core.py", line 335, in run_engine_core ERROR 05-16 12:17:05 [core.py:343] engine_core = EngineCoreProc(*args, **kwargs) ERROR 05-16 12:17:05 [core.py:343] File "/root/autodl-tmp/WeClone/.venv/lib/python3.10/site-packages/vllm/v1/engine/core.py", line 290, in __init__ ERROR 05-16 12:17:05 [core.py:343] super().__init__(vllm_config, executor_class, log_stats) ERROR 05-16 12:17:05 [core.py:343] File "/root/autodl-tmp/WeClone/.venv/lib/python3.10/site-packages/vllm/v1/engine/core.py", line 60, in __init__ ERROR 05-16 12:17:05 [core.py:343] self.model_executor = executor_class(vllm_config) ERROR 05-16 12:17:05 [core.py:343] File "/root/autodl-tmp/WeClone/.venv/lib/python3.10/site-packages/vllm/executor/executor_base.py", line 52, in __init__ ERROR 05-16 12:17:05 [core.py:343] self._init_executor() ERROR 05-16 12:17:05 [core.py:343] File "/root/autodl-tmp/WeClone/.venv/lib/python3.10/site-packages/vllm/executor/uniproc_executor.py", line 47, in _init_executor ERROR 05-16 12:17:05 [core.py:343] self.collective_rpc("load_model") ERROR 05-16 12:17:05 [core.py:343] File "/root/autodl-tmp/WeClone/.venv/lib/python3.10/site-packages/vllm/executor/uniproc_executor.py", line 56, in collective_rpc ERROR 05-16 12:17:05 [core.py:343] answer = run_method(self.driver_worker, method, args, kwargs) ERROR 05-16 12:17:05 [core.py:343] File "/root/autodl-tmp/WeClone/.venv/lib/python3.10/site-packages/vllm/utils.py", line 2255, in run_method ERROR 05-16 12:17:05 [core.py:343] return func(*args, **kwargs) ERROR 05-16 12:17:05 [core.py:343] File "/root/autodl-tmp/WeClone/.venv/lib/python3.10/site-packages/vllm/v1/worker/gpu_worker.py", line 136, in load_model ERROR 05-16 12:17:05 [core.py:343] self.model_runner.load_model() ERROR 05-16 12:17:05 [core.py:343] File "/root/autodl-tmp/WeClone/.venv/lib/python3.10/site-packages/vllm/v1/worker/gpu_model_runner.py", line 1177, in load_model ERROR 05-16 12:17:05 [core.py:343] self.model = get_model(vllm_config=self.vllm_config) ERROR 05-16 12:17:05 [core.py:343] File "/root/autodl-tmp/WeClone/.venv/lib/python3.10/site-packages/vllm/model_executor/model_loader/__init__.py", line 14, in get_model ERROR 05-16 12:17:05 [core.py:343] return loader.load_model(vllm_config=vllm_config) ERROR 05-16 12:17:05 [core.py:343] File "/root/autodl-tmp/WeClone/.venv/lib/python3.10/site-packages/vllm/model_executor/model_loader/loader.py", line 444, in load_model ERROR 05-16 12:17:05 [core.py:343] loaded_weights = model.load_weights( ERROR 05-16 12:17:05 [core.py:343] File "/root/autodl-tmp/WeClone/.venv/lib/python3.10/site-packages/vllm/model_executor/models/qwen2.py", line 490, in load_weights ERROR 05-16 12:17:05 [core.py:343] return loader.load_weights(weights) ERROR 05-16 12:17:05 [core.py:343] File "/root/autodl-tmp/WeClone/.venv/lib/python3.10/site-packages/vllm/model_executor/models/utils.py", line 235, in load_weights ERROR 05-16 12:17:05 [core.py:343] autoloaded_weights = set(self._load_module("", self.module, weights)) ERROR 05-16 12:17:05 [core.py:343] File "/root/autodl-tmp/WeClone/.venv/lib/python3.10/site-packages/vllm/model_executor/models/utils.py", line 196, in _load_module ERROR 05-16 12:17:05 [core.py:343] yield from self._load_module(prefix, ERROR 05-16 12:17:05 [core.py:343] File "/root/autodl-tmp/WeClone/.venv/lib/python3.10/site-packages/vllm/model_executor/models/utils.py", line 173, in _load_module ERROR 05-16 12:17:05 [core.py:343] loaded_params = module_load_weights(weights) ERROR 05-16 12:17:05 [core.py:343] File "/root/autodl-tmp/WeClone/.venv/lib/python3.10/site-packages/vllm/model_executor/models/qwen2.py", line 363, in load_weights ERROR 05-16 12:17:05 [core.py:343] for name, loaded_weight in weights: ERROR 05-16 12:17:05 [core.py:343] File "/root/autodl-tmp/WeClone/.venv/lib/python3.10/site-packages/vllm/model_executor/models/utils.py", line 107, in <genexpr> ERROR 05-16 12:17:05 [core.py:343] (("" if len(parts) == 1 else parts[1], weights_data) ERROR 05-16 12:17:05 [core.py:343] File "/root/autodl-tmp/WeClone/.venv/lib/python3.10/site-packages/vllm/model_executor/models/utils.py", line 98, in <genexpr> ERROR 05-16 12:17:05 [core.py:343] weights_by_parts = ((weight_name.split(".", 1), weight_data) ERROR 05-16 12:17:05 [core.py:343] File "/root/autodl-tmp/WeClone/.venv/lib/python3.10/site-packages/vllm/model_executor/model_loader/loader.py", line 420, in _get_all_weights ERROR 05-16 12:17:05 [core.py:343] yield from self._get_weights_iterator(primary_weights) ERROR 05-16 12:17:05 [core.py:343] File "/root/autodl-tmp/WeClone/.venv/lib/python3.10/site-packages/vllm/model_executor/model_loader/loader.py", line 403, in <genexpr> ERROR 05-16 12:17:05 [core.py:343] return ((source.prefix + name, tensor) ERROR 05-16 12:17:05 [core.py:343] File "/root/autodl-tmp/WeClone/.venv/lib/python3.10/site-packages/vllm/model_executor/model_loader/weight_utils.py", line 441, in safetensors_weights_iterator ERROR 05-16 12:17:05 [core.py:343] with safe_open(st_file, framework="pt") as f: ERROR 05-16 12:17:05 [core.py:343] safetensors_rust.SafetensorError: Error while deserializing header: MetadataIncompleteBuffer ERROR 05-16 12:17:05 [core.py:343] CRITICAL 05-16 12:17:05 [core_client.py:269] Got fatal signal from worker processes, shutting down. See stack trace above for root cause issue. Killed 我先去问了一下ai,说可能的错误是 1、内存或者显存不足,我租用的服务器是24g显存,120g内存,有一张4090显卡 2、safetensors版本不兼容,但是我更新之后仍然没有解决问题 3、只剩下一个下载模型时文件损坏的情况我不知道如何尝试 请问有什么简单的解决办法吗?

datakdata avatar May 16 '25 04:05 datakdata

好像git lfs有指令能继续下,但是我不记得是哪个了

xming521 avatar May 16 '25 13:05 xming521

好像git lfs有指令能继续下,但是我不记得是哪个了 好的好的,感谢。我找到解决问题的方式了,我去对比了一下模型的文件大小,确实没下载完全,我在自己电脑上下载的只有14g左右,但是在服务器上下载的时候不知道为什么需要36g左右的空间。

datakdata avatar May 17 '25 03:05 datakdata