[Bug] [Module Name] Vectorizer 向量化一直失败
Search before asking
- [x] I had searched in the issues and found no similar issues.
Operating system information
Windows
What happened
本地docker启动KAG,知识库切片后,做Vectorizer时一直失败,向量模型配置测试正常的
How to reproduce
1.创建知识库 2.上传文档 3.解析切片后向量化失败
Are you willing to submit PR?
- [ ] Yes I am willing to submit a PR!
Search before asking
- [x] I had searched in the issues and found no similar issues.
Operating system information
Windows
What happened
本地docker启动KAG,知识库切片后,做Vectorizer时一直失败,向量模型配置测试正常的
### How to reproduce 1.创建知识库 2.上传文档 3.解析切片后向量化失败
Are you willing to submit PR?
- [ ] Yes I am willing to submit a PR!
You can refer to KAG FAQ to reset your embedding model conf:
docker 模式下始终出错,在开发模式下,可以跑通;
Vectorizer(执行异常) 已调度次数 :8耗时 :3min 32s
2025-05-02 16:11:23(172.28.0.5): Task scheduling completed. cost:2 ms ! 2025-05-02 16:11:23(172.28.0.5): Lock released successfully! 2025-05-02 16:11:23(172.28.0.5): Vectorizer task trace log: pemja.core.PythonException: <class 'TypeError'>: 'NoneType' object is not iterable at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:111) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:103) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/interface/builder/base.invoke(base.py:153) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._invoke(batch_vectorizer.py:327) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.wrapped_f(init.py:338) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:477) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.iter(init.py:378) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.exc_check(init.py:420) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.reraise(init.py:187) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.result(_base.py:451) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.__get_result(_base.py:403) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:480) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_embedding_vectors(batch_vectorizer.py:273) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:164) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:119) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_vectors(batch_vectorizer.py:92) at pemja.core.PythonInterpreter.invokeMethod(Native Method) at pemja.core.PythonInterpreter.invokeMethod(PythonInterpreter.java:118) at com.antgroup.openspg.common.util.pemja.PemjaUtils.invoke(PemjaUtils.java:41) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.vectorizer(KagVectorizerAsyncTask.java:209) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:161) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:128) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.executeTask(MemoryTaskServer.java:74) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.lambda$submit$0(MemoryTaskServer.java:62) at java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1604) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:750)
2025-05-02 16:11:23(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:11:23(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:11:23(172.28.0.5): Lock preempted successfully!
2025-05-02 16:11:22(172.28.0.5): Task scheduling completed. cost:4 ms ! 2025-05-02 16:11:22(172.28.0.5): Lock released successfully! 2025-05-02 16:11:22(172.28.0.5): Vectorizer task has been successfully created! 2025-05-02 16:11:22(172.28.0.5): Vectorizer task execute failed, recreating…… 2025-05-02 16:11:22(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:11:22(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:11:22(172.28.0.5): Lock preempted successfully!
2025-05-02 16:10:22(172.28.0.5): Task scheduling completed. cost:1 ms ! 2025-05-02 16:10:22(172.28.0.5): Lock released successfully! 2025-05-02 16:10:22(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:10:22(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:10:22(172.28.0.5): Lock preempted successfully!
2025-05-02 16:09:23(172.28.0.5): Task scheduling completed. cost:2 ms ! 2025-05-02 16:09:23(172.28.0.5): Lock released successfully! 2025-05-02 16:09:23(172.28.0.5): Vectorizer task trace log: pemja.core.PythonException: <class 'TypeError'>: 'NoneType' object is not iterable at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:111) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:103) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/interface/builder/base.invoke(base.py:153) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._invoke(batch_vectorizer.py:327) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.wrapped_f(init.py:338) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:477) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.iter(init.py:378) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.exc_check(init.py:420) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.reraise(init.py:187) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.result(_base.py:451) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.__get_result(_base.py:403) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:480) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_embedding_vectors(batch_vectorizer.py:273) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:164) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:119) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_vectors(batch_vectorizer.py:92) at pemja.core.PythonInterpreter.invokeMethod(Native Method) at pemja.core.PythonInterpreter.invokeMethod(PythonInterpreter.java:118) at com.antgroup.openspg.common.util.pemja.PemjaUtils.invoke(PemjaUtils.java:41) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.vectorizer(KagVectorizerAsyncTask.java:209) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:161) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:128) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.executeTask(MemoryTaskServer.java:74) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.lambda$submit$0(MemoryTaskServer.java:62) at java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1604) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:750)
2025-05-02 16:09:23(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:09:23(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:09:23(172.28.0.5): Lock preempted successfully!
2025-05-02 16:09:22(172.28.0.5): Task scheduling completed. cost:4 ms ! 2025-05-02 16:09:22(172.28.0.5): Lock released successfully! 2025-05-02 16:09:22(172.28.0.5): Vectorizer task has been successfully created! 2025-05-02 16:09:22(172.28.0.5): Vectorizer task execute failed, recreating…… 2025-05-02 16:09:22(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:09:22(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:09:22(172.28.0.5): Lock preempted successfully!
2025-05-02 16:08:22(172.28.0.5): Task scheduling completed. cost:2 ms ! 2025-05-02 16:08:22(172.28.0.5): Lock released successfully! 2025-05-02 16:08:22(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:08:22(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:08:22(172.28.0.5): Lock preempted successfully!
2025-05-02 16:08:02(172.28.0.5): Task scheduling completed. cost:2 ms ! 2025-05-02 16:08:02(172.28.0.5): Lock released successfully! 2025-05-02 16:08:02(172.28.0.5): Vectorizer task trace log: pemja.core.PythonException: <class 'TypeError'>: 'NoneType' object is not iterable at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:111) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:103) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/interface/builder/base.invoke(base.py:153) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._invoke(batch_vectorizer.py:327) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.wrapped_f(init.py:338) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:477) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.iter(init.py:378) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.exc_check(init.py:420) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.reraise(init.py:187) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.result(_base.py:451) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.__get_result(_base.py:403) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:480) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_embedding_vectors(batch_vectorizer.py:273) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:164) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:119) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_vectors(batch_vectorizer.py:92) at pemja.core.PythonInterpreter.invokeMethod(Native Method) at pemja.core.PythonInterpreter.invokeMethod(PythonInterpreter.java:118) at com.antgroup.openspg.common.util.pemja.PemjaUtils.invoke(PemjaUtils.java:41) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.vectorizer(KagVectorizerAsyncTask.java:209) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:161) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:128) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.executeTask(MemoryTaskServer.java:74) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.lambda$submit$0(MemoryTaskServer.java:62) at java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1604) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:750)
2025-05-02 16:08:02(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:08:02(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:08:02(172.28.0.5): Lock preempted successfully!
2025-05-02 16:08:01(172.28.0.5): Task scheduling completed. cost:7 ms ! 2025-05-02 16:08:01(172.28.0.5): Lock released successfully! 2025-05-02 16:08:01(172.28.0.5): The asynchronous task created successfully! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:08:01(172.28.0.5): Vectorizer task has been successfully created! 2025-05-02 16:08:01(172.28.0.5): The asynchronous task has not been created yet! 2025-05-02 16:08:01(172.28.0.5): Lock preempted successfully!
Alignment(等待执行) 已调度次数 :0
Writer(等待执行) 已调度次数 :0
docker 模式下始终出错,在开发模式下,可以跑通;
Vectorizer(执行异常) 已调度次数 :8耗时 :3min 32s
2025-05-02 16:11:23(172.28.0.5): Task scheduling completed. cost:2 ms ! 2025-05-02 16:11:23(172.28.0.5): Lock released successfully! 2025-05-02 16:11:23(172.28.0.5): Vectorizer task trace log: pemja.core.PythonException: <class 'TypeError'>: 'NoneType' object is not iterable at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:111) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:103) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/interface/builder/base.invoke(base.py:153) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._invoke(batch_vectorizer.py:327) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.wrapped_f(init.py:338) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:477) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.iter(init.py:378) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.exc_check(init.py:420) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.reraise(init.py:187) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.result(_base.py:451) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.__get_result(_base.py:403) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:480) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_embedding_vectors(batch_vectorizer.py:273) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:164) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:119) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_vectors(batch_vectorizer.py:92) at pemja.core.PythonInterpreter.invokeMethod(Native Method) at pemja.core.PythonInterpreter.invokeMethod(PythonInterpreter.java:118) at com.antgroup.openspg.common.util.pemja.PemjaUtils.invoke(PemjaUtils.java:41) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.vectorizer(KagVectorizerAsyncTask.java:209) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:161) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:128) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.executeTask(MemoryTaskServer.java:74) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.lambda$submit$0(MemoryTaskServer.java:62) at java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1604) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:750)
2025-05-02 16:11:23(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:11:23(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:11:23(172.28.0.5): Lock preempted successfully!
2025-05-02 16:11:22(172.28.0.5): Task scheduling completed. cost:4 ms ! 2025-05-02 16:11:22(172.28.0.5): Lock released successfully! 2025-05-02 16:11:22(172.28.0.5): Vectorizer task has been successfully created! 2025-05-02 16:11:22(172.28.0.5): Vectorizer task execute failed, recreating…… 2025-05-02 16:11:22(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:11:22(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:11:22(172.28.0.5): Lock preempted successfully!
2025-05-02 16:10:22(172.28.0.5): Task scheduling completed. cost:1 ms ! 2025-05-02 16:10:22(172.28.0.5): Lock released successfully! 2025-05-02 16:10:22(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:10:22(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:10:22(172.28.0.5): Lock preempted successfully!
2025-05-02 16:09:23(172.28.0.5): Task scheduling completed. cost:2 ms ! 2025-05-02 16:09:23(172.28.0.5): Lock released successfully! 2025-05-02 16:09:23(172.28.0.5): Vectorizer task trace log: pemja.core.PythonException: <class 'TypeError'>: 'NoneType' object is not iterable at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:111) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:103) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/interface/builder/base.invoke(base.py:153) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._invoke(batch_vectorizer.py:327) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.wrapped_f(init.py:338) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:477) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.iter(init.py:378) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.exc_check(init.py:420) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.reraise(init.py:187) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.result(_base.py:451) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.__get_result(_base.py:403) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:480) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_embedding_vectors(batch_vectorizer.py:273) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:164) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:119) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_vectors(batch_vectorizer.py:92) at pemja.core.PythonInterpreter.invokeMethod(Native Method) at pemja.core.PythonInterpreter.invokeMethod(PythonInterpreter.java:118) at com.antgroup.openspg.common.util.pemja.PemjaUtils.invoke(PemjaUtils.java:41) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.vectorizer(KagVectorizerAsyncTask.java:209) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:161) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:128) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.executeTask(MemoryTaskServer.java:74) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.lambda$submit$0(MemoryTaskServer.java:62) at java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1604) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:750)
2025-05-02 16:09:23(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:09:23(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:09:23(172.28.0.5): Lock preempted successfully!
2025-05-02 16:09:22(172.28.0.5): Task scheduling completed. cost:4 ms ! 2025-05-02 16:09:22(172.28.0.5): Lock released successfully! 2025-05-02 16:09:22(172.28.0.5): Vectorizer task has been successfully created! 2025-05-02 16:09:22(172.28.0.5): Vectorizer task execute failed, recreating…… 2025-05-02 16:09:22(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:09:22(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:09:22(172.28.0.5): Lock preempted successfully!
2025-05-02 16:08:22(172.28.0.5): Task scheduling completed. cost:2 ms ! 2025-05-02 16:08:22(172.28.0.5): Lock released successfully! 2025-05-02 16:08:22(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:08:22(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:08:22(172.28.0.5): Lock preempted successfully!
2025-05-02 16:08:02(172.28.0.5): Task scheduling completed. cost:2 ms ! 2025-05-02 16:08:02(172.28.0.5): Lock released successfully! 2025-05-02 16:08:02(172.28.0.5): Vectorizer task trace log: pemja.core.PythonException: <class 'TypeError'>: 'NoneType' object is not iterable at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:111) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:103) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/interface/builder/base.invoke(base.py:153) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._invoke(batch_vectorizer.py:327) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.wrapped_f(init.py:338) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:477) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.iter(init.py:378) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.exc_check(init.py:420) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.reraise(init.py:187) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.result(_base.py:451) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.__get_result(_base.py:403) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:480) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_embedding_vectors(batch_vectorizer.py:273) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:164) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:119) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_vectors(batch_vectorizer.py:92) at pemja.core.PythonInterpreter.invokeMethod(Native Method) at pemja.core.PythonInterpreter.invokeMethod(PythonInterpreter.java:118) at com.antgroup.openspg.common.util.pemja.PemjaUtils.invoke(PemjaUtils.java:41) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.vectorizer(KagVectorizerAsyncTask.java:209) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:161) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:128) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.executeTask(MemoryTaskServer.java:74) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.lambda$submit$0(MemoryTaskServer.java:62) at java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1604) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:750)
2025-05-02 16:08:02(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:08:02(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:08:02(172.28.0.5): Lock preempted successfully!
2025-05-02 16:08:01(172.28.0.5): Task scheduling completed. cost:7 ms ! 2025-05-02 16:08:01(172.28.0.5): Lock released successfully! 2025-05-02 16:08:01(172.28.0.5): The asynchronous task created successfully! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:08:01(172.28.0.5): Vectorizer task has been successfully created! 2025-05-02 16:08:01(172.28.0.5): The asynchronous task has not been created yet! 2025-05-02 16:08:01(172.28.0.5): Lock preempted successfully!
Alignment(等待执行) 已调度次数 :0
Writer(等待执行) 已调度次数 :0
这通常是由于您填了无法从容器无法 ping 通的向量服务 URL。
The reason might be you specified an embedding service url which can you can not ping from the docker container.
docker 模式下始终出错,在开发模式下,可以跑通; Vectorizer(执行异常) 已调度次数 :8耗时 :3min 32s 2025-05-02 16:11:23(172.28.0.5): Task scheduling completed. cost:2 ms ! 2025-05-02 16:11:23(172.28.0.5): Lock released successfully! 2025-05-02 16:11:23(172.28.0.5): Vectorizer task trace log: pemja.core.PythonException: <class 'TypeError'>: 'NoneType' object is not iterable at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:111) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:103) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/interface/builder/base.invoke(base.py:153) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._invoke(batch_vectorizer.py:327) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.wrapped_f(init.py:338) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:477) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.iter(init.py:378) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.exc_check(init.py:420) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.reraise(init.py:187) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.result(_base.py:451) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.__get_result(_base.py:403) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:480) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_embedding_vectors(batch_vectorizer.py:273) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:164) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:119) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_vectors(batch_vectorizer.py:92) at pemja.core.PythonInterpreter.invokeMethod(Native Method) at pemja.core.PythonInterpreter.invokeMethod(PythonInterpreter.java:118) at com.antgroup.openspg.common.util.pemja.PemjaUtils.invoke(PemjaUtils.java:41) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.vectorizer(KagVectorizerAsyncTask.java:209) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:161) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:128) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.executeTask(MemoryTaskServer.java:74) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.lambda$submit$0(MemoryTaskServer.java:62) at java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1604) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:750) 2025-05-02 16:11:23(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:11:23(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:11:23(172.28.0.5): Lock preempted successfully! 2025-05-02 16:11:22(172.28.0.5): Task scheduling completed. cost:4 ms ! 2025-05-02 16:11:22(172.28.0.5): Lock released successfully! 2025-05-02 16:11:22(172.28.0.5): Vectorizer task has been successfully created! 2025-05-02 16:11:22(172.28.0.5): Vectorizer task execute failed, recreating…… 2025-05-02 16:11:22(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:11:22(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:11:22(172.28.0.5): Lock preempted successfully! 2025-05-02 16:10:22(172.28.0.5): Task scheduling completed. cost:1 ms ! 2025-05-02 16:10:22(172.28.0.5): Lock released successfully! 2025-05-02 16:10:22(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:10:22(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:10:22(172.28.0.5): Lock preempted successfully! 2025-05-02 16:09:23(172.28.0.5): Task scheduling completed. cost:2 ms ! 2025-05-02 16:09:23(172.28.0.5): Lock released successfully! 2025-05-02 16:09:23(172.28.0.5): Vectorizer task trace log: pemja.core.PythonException: <class 'TypeError'>: 'NoneType' object is not iterable at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:111) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:103) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/interface/builder/base.invoke(base.py:153) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._invoke(batch_vectorizer.py:327) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.wrapped_f(init.py:338) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:477) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.iter(init.py:378) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.exc_check(init.py:420) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.reraise(init.py:187) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.result(_base.py:451) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.__get_result(_base.py:403) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:480) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_embedding_vectors(batch_vectorizer.py:273) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:164) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:119) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_vectors(batch_vectorizer.py:92) at pemja.core.PythonInterpreter.invokeMethod(Native Method) at pemja.core.PythonInterpreter.invokeMethod(PythonInterpreter.java:118) at com.antgroup.openspg.common.util.pemja.PemjaUtils.invoke(PemjaUtils.java:41) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.vectorizer(KagVectorizerAsyncTask.java:209) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:161) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:128) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.executeTask(MemoryTaskServer.java:74) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.lambda$submit$0(MemoryTaskServer.java:62) at java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1604) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:750) 2025-05-02 16:09:23(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:09:23(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:09:23(172.28.0.5): Lock preempted successfully! 2025-05-02 16:09:22(172.28.0.5): Task scheduling completed. cost:4 ms ! 2025-05-02 16:09:22(172.28.0.5): Lock released successfully! 2025-05-02 16:09:22(172.28.0.5): Vectorizer task has been successfully created! 2025-05-02 16:09:22(172.28.0.5): Vectorizer task execute failed, recreating…… 2025-05-02 16:09:22(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:09:22(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:09:22(172.28.0.5): Lock preempted successfully! 2025-05-02 16:08:22(172.28.0.5): Task scheduling completed. cost:2 ms ! 2025-05-02 16:08:22(172.28.0.5): Lock released successfully! 2025-05-02 16:08:22(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:08:22(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:08:22(172.28.0.5): Lock preempted successfully! 2025-05-02 16:08:02(172.28.0.5): Task scheduling completed. cost:2 ms ! 2025-05-02 16:08:02(172.28.0.5): Lock released successfully! 2025-05-02 16:08:02(172.28.0.5): Vectorizer task trace log: pemja.core.PythonException: <class 'TypeError'>: 'NoneType' object is not iterable at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:111) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:103) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/interface/builder/base.invoke(base.py:153) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._invoke(batch_vectorizer.py:327) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.wrapped_f(init.py:338) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:477) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.iter(init.py:378) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.exc_check(init.py:420) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.reraise(init.py:187) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.result(_base.py:451) at /home/admin/miniconda3/lib/python3.10/concurrent/futures/_base.__get_result(_base.py:403) at /home/admin/miniconda3/lib/python3.10/site-packages/tenacity/init.call(init.py:480) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_embedding_vectors(batch_vectorizer.py:273) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:164) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer.batch_generate(batch_vectorizer.py:119) at /home/admin/miniconda3/lib/python3.10/site-packages/kag/builder/component/vectorizer/batch_vectorizer._generate_vectors(batch_vectorizer.py:92) at pemja.core.PythonInterpreter.invokeMethod(Native Method) at pemja.core.PythonInterpreter.invokeMethod(PythonInterpreter.java:118) at com.antgroup.openspg.common.util.pemja.PemjaUtils.invoke(PemjaUtils.java:41) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.vectorizer(KagVectorizerAsyncTask.java:209) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:161) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagVectorizerAsyncTask$VectorizerTaskCallable.call(KagVectorizerAsyncTask.java:128) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.executeTask(MemoryTaskServer.java:74) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.lambda$submit$0(MemoryTaskServer.java:62) at java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1604) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:750) 2025-05-02 16:08:02(172.28.0.5): Vectorizer task status is ERROR 2025-05-02 16:08:02(172.28.0.5): The asynchronous task has been created! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:08:02(172.28.0.5): Lock preempted successfully! 2025-05-02 16:08:01(172.28.0.5): Task scheduling completed. cost:7 ms ! 2025-05-02 16:08:01(172.28.0.5): Lock released successfully! 2025-05-02 16:08:01(172.28.0.5): The asynchronous task created successfully! resource:builder/project_4/instance_8/46_kagVectorizerAsyncTask.kag 2025-05-02 16:08:01(172.28.0.5): Vectorizer task has been successfully created! 2025-05-02 16:08:01(172.28.0.5): The asynchronous task has not been created yet! 2025-05-02 16:08:01(172.28.0.5): Lock preempted successfully! Alignment(等待执行) 已调度次数 :0 Writer(等待执行) 已调度次数 :0
这通常是由于您填了无法从容器无法 ping 通的向量服务 URL。
The reason might be you specified an embedding service url which can you can not ping from the docker container.
并不是这个原因,在docker里面embedding接口一样是有返回结果的
### How to reproduce
1.创建知识库 2.上传文档 3.解析切片后向量化失败