KAG icon indicating copy to clipboard operation
KAG copied to clipboard

[Bug] [Module Name] Vectorizer 向量化一直失败

Open wind7xl opened this issue 8 months ago • 4 comments

Search before asking

  • [x] I had searched in the issues and found no similar issues.

Operating system information

Windows

What happened

本地docker启动KAG,知识库切片后,做Vectorizer时一直失败,向量模型配置测试正常的

Image

How to reproduce

1.创建知识库 2.上传文档 3.解析切片后向量化失败

Are you willing to submit PR?

  • [ ] Yes I am willing to submit a PR!

wind7xl avatar Apr 30 '25 04:04 wind7xl

Search before asking

  • [x] I had searched in the issues and found no similar issues.

Operating system information

Windows

What happened

本地docker启动KAG,知识库切片后,做Vectorizer时一直失败,向量模型配置测试正常的

Image ### How to reproduce 1.创建知识库 2.上传文档 3.解析切片后向量化失败

Are you willing to submit PR?

  • [ ] Yes I am willing to submit a PR!

You can refer to KAG FAQ to reset your embedding model conf:

Image

caszkgui avatar Apr 30 '25 06:04 caszkgui

docker 模式下始终出错,在开发模式下,可以跑通;

Vectorizer(执行异常) 已调度次数 :8耗时 :3min 32s

2025-05-02 16:11:23(172.28.0.5): Task scheduling completed. cost:2 ms ! 2025-05-02 16:11:23(172.28.0.5): Lock released successfully! 2025-05-02 16:11:23(172.28.0.5): Vectorizer task trace log: pemja.core.PythonException: <class 'TypeError'>: 'NoneType' object is not iterable at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:111) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:103) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/interface/builder/base.invoke(base.py:153) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._invoke(batch_vectorizer.py:327) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.wrapped_f(init.py:338) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:477) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.iter(init.py:378) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.exc_check(init.py:420) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.reraise(init.py:187) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.result(_base.py:451) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.__get_result(_base.py:403) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:480) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_embedding_vectors(batch_vectorizer.py:273) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:164) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:119) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_vectors(batch_vectorizer.py:92) at pemja.core.PythonInterpreter.invokeMethod(Native Method) at pemja.core.PythonInterpreter.invokeMethod(PythonInterpreter.java:118) at com.antgroup.openspg.common.util.pemja.PemjaUtils.invoke(PemjaUtils.java:41) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.vectorizer(KagVectorizerAsyncTask.java:209) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:161) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:128) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.executeTask(MemoryTaskServer.java:74) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.lambda$submit$0(MemoryTaskServer.java:62) at java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1604) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:750)

2025-05-02 16:11:23(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:11:23(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:11:23(172.28.0.5): Lock preempted successfully!

2025-05-02 16:11:22(172.28.0.5): Task scheduling completed. cost:4 ms ! 2025-05-02 16:11:22(172.28.0.5): Lock released successfully! 2025-05-02 16:11:22(172.28.0.5): Vectorizer task has been successfully created! 2025-05-02 16:11:22(172.28.0.5): Vectorizer task execute failed, recreating…… 2025-05-02 16:11:22(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:11:22(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:11:22(172.28.0.5): Lock preempted successfully!

2025-05-02 16:10:22(172.28.0.5): Task scheduling completed. cost:1 ms ! 2025-05-02 16:10:22(172.28.0.5): Lock released successfully! 2025-05-02 16:10:22(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:10:22(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:10:22(172.28.0.5): Lock preempted successfully!

2025-05-02 16:09:23(172.28.0.5): Task scheduling completed. cost:2 ms ! 2025-05-02 16:09:23(172.28.0.5): Lock released successfully! 2025-05-02 16:09:23(172.28.0.5): Vectorizer task trace log: pemja.core.PythonException: <class 'TypeError'>: 'NoneType' object is not iterable at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:111) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:103) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/interface/builder/base.invoke(base.py:153) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._invoke(batch_vectorizer.py:327) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.wrapped_f(init.py:338) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:477) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.iter(init.py:378) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.exc_check(init.py:420) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.reraise(init.py:187) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.result(_base.py:451) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.__get_result(_base.py:403) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:480) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_embedding_vectors(batch_vectorizer.py:273) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:164) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:119) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_vectors(batch_vectorizer.py:92) at pemja.core.PythonInterpreter.invokeMethod(Native Method) at pemja.core.PythonInterpreter.invokeMethod(PythonInterpreter.java:118) at com.antgroup.openspg.common.util.pemja.PemjaUtils.invoke(PemjaUtils.java:41) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.vectorizer(KagVectorizerAsyncTask.java:209) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:161) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:128) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.executeTask(MemoryTaskServer.java:74) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.lambda$submit$0(MemoryTaskServer.java:62) at java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1604) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:750)

2025-05-02 16:09:23(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:09:23(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:09:23(172.28.0.5): Lock preempted successfully!

2025-05-02 16:09:22(172.28.0.5): Task scheduling completed. cost:4 ms ! 2025-05-02 16:09:22(172.28.0.5): Lock released successfully! 2025-05-02 16:09:22(172.28.0.5): Vectorizer task has been successfully created! 2025-05-02 16:09:22(172.28.0.5): Vectorizer task execute failed, recreating…… 2025-05-02 16:09:22(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:09:22(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:09:22(172.28.0.5): Lock preempted successfully!

2025-05-02 16:08:22(172.28.0.5): Task scheduling completed. cost:2 ms ! 2025-05-02 16:08:22(172.28.0.5): Lock released successfully! 2025-05-02 16:08:22(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:08:22(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:08:22(172.28.0.5): Lock preempted successfully!

2025-05-02 16:08:02(172.28.0.5): Task scheduling completed. cost:2 ms ! 2025-05-02 16:08:02(172.28.0.5): Lock released successfully! 2025-05-02 16:08:02(172.28.0.5): Vectorizer task trace log: pemja.core.PythonException: <class 'TypeError'>: 'NoneType' object is not iterable at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:111) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:103) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/interface/builder/base.invoke(base.py:153) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._invoke(batch_vectorizer.py:327) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.wrapped_f(init.py:338) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:477) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.iter(init.py:378) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.exc_check(init.py:420) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.reraise(init.py:187) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.result(_base.py:451) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.__get_result(_base.py:403) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:480) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_embedding_vectors(batch_vectorizer.py:273) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:164) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:119) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_vectors(batch_vectorizer.py:92) at pemja.core.PythonInterpreter.invokeMethod(Native Method) at pemja.core.PythonInterpreter.invokeMethod(PythonInterpreter.java:118) at com.antgroup.openspg.common.util.pemja.PemjaUtils.invoke(PemjaUtils.java:41) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.vectorizer(KagVectorizerAsyncTask.java:209) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:161) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:128) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.executeTask(MemoryTaskServer.java:74) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.lambda$submit$0(MemoryTaskServer.java:62) at java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1604) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:750)

2025-05-02 16:08:02(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:08:02(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:08:02(172.28.0.5): Lock preempted successfully!

2025-05-02 16:08:01(172.28.0.5): Task scheduling completed. cost:7 ms ! 2025-05-02 16:08:01(172.28.0.5): Lock released successfully! 2025-05-02 16:08:01(172.28.0.5): The asynchronous task created successfully! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:08:01(172.28.0.5): Vectorizer task has been successfully created! 2025-05-02 16:08:01(172.28.0.5): The asynchronous task has not been created yet! 2025-05-02 16:08:01(172.28.0.5): Lock preempted successfully!

Alignment(等待执行) 已调度次数 :0

Writer(等待执行) 已调度次数 :0

charlieliu9999 avatar May 02 '25 08:05 charlieliu9999

docker 模式下始终出错,在开发模式下,可以跑通;

Vectorizer(执行异常) 已调度次数 :8耗时 :3min 32s

2025-05-02 16:11:23(172.28.0.5): Task scheduling completed. cost:2 ms ! 2025-05-02 16:11:23(172.28.0.5): Lock released successfully! 2025-05-02 16:11:23(172.28.0.5): Vectorizer task trace log: pemja.core.PythonException: <class 'TypeError'>: 'NoneType' object is not iterable at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:111) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:103) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/interface/builder/base.invoke(base.py:153) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._invoke(batch_vectorizer.py:327) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.wrapped_f(init.py:338) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:477) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.iter(init.py:378) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.exc_check(init.py:420) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.reraise(init.py:187) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.result(_base.py:451) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.__get_result(_base.py:403) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:480) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_embedding_vectors(batch_vectorizer.py:273) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:164) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:119) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_vectors(batch_vectorizer.py:92) at pemja.core.PythonInterpreter.invokeMethod(Native Method) at pemja.core.PythonInterpreter.invokeMethod(PythonInterpreter.java:118) at com.antgroup.openspg.common.util.pemja.PemjaUtils.invoke(PemjaUtils.java:41) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.vectorizer(KagVectorizerAsyncTask.java:209) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:161) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:128) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.executeTask(MemoryTaskServer.java:74) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.lambda$submit$0(MemoryTaskServer.java:62) at java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1604) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:750)

2025-05-02 16:11:23(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:11:23(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:11:23(172.28.0.5): Lock preempted successfully!

2025-05-02 16:11:22(172.28.0.5): Task scheduling completed. cost:4 ms ! 2025-05-02 16:11:22(172.28.0.5): Lock released successfully! 2025-05-02 16:11:22(172.28.0.5): Vectorizer task has been successfully created! 2025-05-02 16:11:22(172.28.0.5): Vectorizer task execute failed, recreating…… 2025-05-02 16:11:22(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:11:22(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:11:22(172.28.0.5): Lock preempted successfully!

2025-05-02 16:10:22(172.28.0.5): Task scheduling completed. cost:1 ms ! 2025-05-02 16:10:22(172.28.0.5): Lock released successfully! 2025-05-02 16:10:22(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:10:22(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:10:22(172.28.0.5): Lock preempted successfully!

2025-05-02 16:09:23(172.28.0.5): Task scheduling completed. cost:2 ms ! 2025-05-02 16:09:23(172.28.0.5): Lock released successfully! 2025-05-02 16:09:23(172.28.0.5): Vectorizer task trace log: pemja.core.PythonException: <class 'TypeError'>: 'NoneType' object is not iterable at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:111) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:103) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/interface/builder/base.invoke(base.py:153) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._invoke(batch_vectorizer.py:327) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.wrapped_f(init.py:338) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:477) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.iter(init.py:378) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.exc_check(init.py:420) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.reraise(init.py:187) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.result(_base.py:451) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.__get_result(_base.py:403) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:480) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_embedding_vectors(batch_vectorizer.py:273) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:164) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:119) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_vectors(batch_vectorizer.py:92) at pemja.core.PythonInterpreter.invokeMethod(Native Method) at pemja.core.PythonInterpreter.invokeMethod(PythonInterpreter.java:118) at com.antgroup.openspg.common.util.pemja.PemjaUtils.invoke(PemjaUtils.java:41) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.vectorizer(KagVectorizerAsyncTask.java:209) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:161) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:128) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.executeTask(MemoryTaskServer.java:74) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.lambda$submit$0(MemoryTaskServer.java:62) at java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1604) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:750)

2025-05-02 16:09:23(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:09:23(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:09:23(172.28.0.5): Lock preempted successfully!

2025-05-02 16:09:22(172.28.0.5): Task scheduling completed. cost:4 ms ! 2025-05-02 16:09:22(172.28.0.5): Lock released successfully! 2025-05-02 16:09:22(172.28.0.5): Vectorizer task has been successfully created! 2025-05-02 16:09:22(172.28.0.5): Vectorizer task execute failed, recreating…… 2025-05-02 16:09:22(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:09:22(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:09:22(172.28.0.5): Lock preempted successfully!

2025-05-02 16:08:22(172.28.0.5): Task scheduling completed. cost:2 ms ! 2025-05-02 16:08:22(172.28.0.5): Lock released successfully! 2025-05-02 16:08:22(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:08:22(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:08:22(172.28.0.5): Lock preempted successfully!

2025-05-02 16:08:02(172.28.0.5): Task scheduling completed. cost:2 ms ! 2025-05-02 16:08:02(172.28.0.5): Lock released successfully! 2025-05-02 16:08:02(172.28.0.5): Vectorizer task trace log: pemja.core.PythonException: <class 'TypeError'>: 'NoneType' object is not iterable at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:111) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:103) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/interface/builder/base.invoke(base.py:153) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._invoke(batch_vectorizer.py:327) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.wrapped_f(init.py:338) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:477) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.iter(init.py:378) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.exc_check(init.py:420) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.reraise(init.py:187) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.result(_base.py:451) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.__get_result(_base.py:403) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:480) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_embedding_vectors(batch_vectorizer.py:273) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:164) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:119) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_vectors(batch_vectorizer.py:92) at pemja.core.PythonInterpreter.invokeMethod(Native Method) at pemja.core.PythonInterpreter.invokeMethod(PythonInterpreter.java:118) at com.antgroup.openspg.common.util.pemja.PemjaUtils.invoke(PemjaUtils.java:41) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.vectorizer(KagVectorizerAsyncTask.java:209) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:161) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:128) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.executeTask(MemoryTaskServer.java:74) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.lambda$submit$0(MemoryTaskServer.java:62) at java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1604) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:750)

2025-05-02 16:08:02(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:08:02(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:08:02(172.28.0.5): Lock preempted successfully!

2025-05-02 16:08:01(172.28.0.5): Task scheduling completed. cost:7 ms ! 2025-05-02 16:08:01(172.28.0.5): Lock released successfully! 2025-05-02 16:08:01(172.28.0.5): The asynchronous task created successfully! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:08:01(172.28.0.5): Vectorizer task has been successfully created! 2025-05-02 16:08:01(172.28.0.5): The asynchronous task has not been created yet! 2025-05-02 16:08:01(172.28.0.5): Lock preempted successfully!

Alignment(等待执行) 已调度次数 :0

Writer(等待执行) 已调度次数 :0

这通常是由于您填了无法从容器无法 ping 通的向量服务 URL。

The reason might be you specified an embedding service url which can you can not ping from the docker container.

xionghuaidong avatar May 20 '25 08:05 xionghuaidong

docker 模式下始终出错,在开发模式下,可以跑通; Vectorizer(执行异常) 已调度次数 :8耗时 :3min 32s 2025-05-02 16:11:23(172.28.0.5): Task scheduling completed. cost:2 ms ! 2025-05-02 16:11:23(172.28.0.5): Lock released successfully! 2025-05-02 16:11:23(172.28.0.5): Vectorizer task trace log: pemja.core.PythonException: <class 'TypeError'>: 'NoneType' object is not iterable at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:111) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:103) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/interface/builder/base.invoke(base.py:153) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._invoke(batch_vectorizer.py:327) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.wrapped_f(init.py:338) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:477) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.iter(init.py:378) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.exc_check(init.py:420) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.reraise(init.py:187) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.result(_base.py:451) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.__get_result(_base.py:403) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:480) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_embedding_vectors(batch_vectorizer.py:273) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:164) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:119) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_vectors(batch_vectorizer.py:92) at pemja.core.PythonInterpreter.invokeMethod(Native Method) at pemja.core.PythonInterpreter.invokeMethod(PythonInterpreter.java:118) at com.antgroup.openspg.common.util.pemja.PemjaUtils.invoke(PemjaUtils.java:41) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.vectorizer(KagVectorizerAsyncTask.java:209) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:161) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:128) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.executeTask(MemoryTaskServer.java:74) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.lambda$submit$0(MemoryTaskServer.java:62) at java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1604) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:750) 2025-05-02 16:11:23(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:11:23(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:11:23(172.28.0.5): Lock preempted successfully! 2025-05-02 16:11:22(172.28.0.5): Task scheduling completed. cost:4 ms ! 2025-05-02 16:11:22(172.28.0.5): Lock released successfully! 2025-05-02 16:11:22(172.28.0.5): Vectorizer task has been successfully created! 2025-05-02 16:11:22(172.28.0.5): Vectorizer task execute failed, recreating…… 2025-05-02 16:11:22(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:11:22(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:11:22(172.28.0.5): Lock preempted successfully! 2025-05-02 16:10:22(172.28.0.5): Task scheduling completed. cost:1 ms ! 2025-05-02 16:10:22(172.28.0.5): Lock released successfully! 2025-05-02 16:10:22(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:10:22(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:10:22(172.28.0.5): Lock preempted successfully! 2025-05-02 16:09:23(172.28.0.5): Task scheduling completed. cost:2 ms ! 2025-05-02 16:09:23(172.28.0.5): Lock released successfully! 2025-05-02 16:09:23(172.28.0.5): Vectorizer task trace log: pemja.core.PythonException: <class 'TypeError'>: 'NoneType' object is not iterable at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:111) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:103) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/interface/builder/base.invoke(base.py:153) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._invoke(batch_vectorizer.py:327) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.wrapped_f(init.py:338) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:477) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.iter(init.py:378) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.exc_check(init.py:420) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.reraise(init.py:187) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.result(_base.py:451) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.__get_result(_base.py:403) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:480) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_embedding_vectors(batch_vectorizer.py:273) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:164) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:119) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_vectors(batch_vectorizer.py:92) at pemja.core.PythonInterpreter.invokeMethod(Native Method) at pemja.core.PythonInterpreter.invokeMethod(PythonInterpreter.java:118) at com.antgroup.openspg.common.util.pemja.PemjaUtils.invoke(PemjaUtils.java:41) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.vectorizer(KagVectorizerAsyncTask.java:209) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:161) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:128) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.executeTask(MemoryTaskServer.java:74) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.lambda$submit$0(MemoryTaskServer.java:62) at java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1604) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:750) 2025-05-02 16:09:23(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:09:23(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:09:23(172.28.0.5): Lock preempted successfully! 2025-05-02 16:09:22(172.28.0.5): Task scheduling completed. cost:4 ms ! 2025-05-02 16:09:22(172.28.0.5): Lock released successfully! 2025-05-02 16:09:22(172.28.0.5): Vectorizer task has been successfully created! 2025-05-02 16:09:22(172.28.0.5): Vectorizer task execute failed, recreating…… 2025-05-02 16:09:22(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:09:22(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:09:22(172.28.0.5): Lock preempted successfully! 2025-05-02 16:08:22(172.28.0.5): Task scheduling completed. cost:2 ms ! 2025-05-02 16:08:22(172.28.0.5): Lock released successfully! 2025-05-02 16:08:22(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:08:22(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:08:22(172.28.0.5): Lock preempted successfully! 2025-05-02 16:08:02(172.28.0.5): Task scheduling completed. cost:2 ms ! 2025-05-02 16:08:02(172.28.0.5): Lock released successfully! 2025-05-02 16:08:02(172.28.0.5): Vectorizer task trace log: pemja.core.PythonException: <class 'TypeError'>: 'NoneType' object is not iterable at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:111) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:103) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/interface/builder/base.invoke(base.py:153) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._invoke(batch_vectorizer.py:327) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.wrapped_f(init.py:338) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:477) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.iter(init.py:378) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.exc_check(init.py:420) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.reraise(init.py:187) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.result(_base.py:451) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.__get_result(_base.py:403) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:480) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_embedding_vectors(batch_vectorizer.py:273) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:164) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:119) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_vectors(batch_vectorizer.py:92) at pemja.core.PythonInterpreter.invokeMethod(Native Method) at pemja.core.PythonInterpreter.invokeMethod(PythonInterpreter.java:118) at com.antgroup.openspg.common.util.pemja.PemjaUtils.invoke(PemjaUtils.java:41) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.vectorizer(KagVectorizerAsyncTask.java:209) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:161) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:128) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.executeTask(MemoryTaskServer.java:74) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.lambda$submit$0(MemoryTaskServer.java:62) at java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1604) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:750) 2025-05-02 16:08:02(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:08:02(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:08:02(172.28.0.5): Lock preempted successfully! 2025-05-02 16:08:01(172.28.0.5): Task scheduling completed. cost:7 ms ! 2025-05-02 16:08:01(172.28.0.5): Lock released successfully! 2025-05-02 16:08:01(172.28.0.5): The asynchronous task created successfully! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:08:01(172.28.0.5): Vectorizer task has been successfully created! 2025-05-02 16:08:01(172.28.0.5): The asynchronous task has not been created yet! 2025-05-02 16:08:01(172.28.0.5): Lock preempted successfully! Alignment(等待执行) 已调度次数 :0 Writer(等待执行) 已调度次数 :0

这通常是由于您填了无法从容器无法 ping 通的向量服务 URL。

The reason might be you specified an embedding service url which can you can not ping from the docker container.

并不是这个原因,在docker里面embedding接口一样是有返回结果的

TmacChenQian avatar Jun 25 '25 07:06 TmacChenQian