data-juicer icon indicating copy to clipboard operation
data-juicer copied to clipboard

配好环境之后尝试运行python tools/process_data.py --config demos/process_on_ray/configs/demo.yaml,遇到疑似卡住无任何日志输出的情况

Open butterbutterflies opened this issue 8 months ago • 6 comments

Before Asking 在提问之前

  • [x] I have read the README carefully. 我已经仔细阅读了 README 上的操作指引。

  • [x] I have pulled the latest code of main branch to run again and the problem still existed. 我已经拉取了主分支上最新的代码,重新运行之后,问题仍不能解决。

Search before asking 先搜索,再提问

  • [x] I have searched the Data-Juicer issues and found no similar questions. 我已经在 issue列表 中搜索但是没有发现类似的问题。

Question

从源码进行安装了data-juicer(python==3.10.6, Ray==2.40.0, grpcio==1.71.0),当前设备是4*T4, 24Core, 512G。

ray start --head ray status python tools/process_data.py --config demos/process_on_ray/configs/demo.yaml

命令行界面在输出如下内容之后卡住: 2025-03-18 07:47:26 | INFO | data_juicer.core.ray_executor:56 - Initing Ray ... 2025-03-18 07:47:26,492 INFO worker.py:1636 -- Connecting to existing Ray cluster at address: 10.233.65.253:6379... 2025-03-18 07:47:26,504 INFO worker.py:1812 -- Connected to Ray cluster. View the dashboard at 127.0.0.1:8265

log中也仅有如下内容: 2025-03-18 07:47:26.256 | INFO | data_juicer.config.config:config_backup:742 - Back up the input config file [/workspace/data-juicer/demos/process_on_ray/configs/demo.yaml] into the work_dir [/workspace/data-juicer/outputs/demo] 2025-03-18 07:47:26.277 | INFO | data_juicer.config.config:display_config:764 - Configuration table: 2025-03-18 07:47:26.477 | INFO | data_juicer.core.ray_executor:init:56 - Initing Ray ...

不知道是哪里的问题,十分困惑,还请解答。

Additional 额外信息

No response

butterbutterflies avatar Mar 18 '25 08:03 butterbutterflies