Unable to Extract Text
Search before asking
- [X] I had searched in the issues and found no similar issues.
Operating system information
Linux
What happened
Extractor task trace log: java.lang.RuntimeException: invoke extract Exception at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagExtractorAsyncTask$ExtractorTaskCallable.call(KagExtractorAsyncTask.java:224) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagExtractorAsyncTask$ExtractorTaskCallable.call(KagExtractorAsyncTask.java:178) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.executeTask(MemoryTaskServer.java:67) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.lambda$submit$0(MemoryTaskServer.java:56) at java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1604) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:750) Caused by: java.util.concurrent.ExecutionException: pemja.core.PythonException: <class 'tenacity.RetryError'>: <Future at 0xffff100ac250 state=finished raised RuntimeError> at java.util.concurrent.FutureTask.report(FutureTask.java:122) at java.util.concurrent.FutureTask.get(FutureTask.java:192) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagExtractorAsyncTask$ExtractorTaskCallable.call(KagExtractorAsyncTask.java:221) ... 7 more Caused by: pemja.core.PythonException: <class 'tenacity.RetryError'>: <Future at 0xffff100ac250 state=finished raised RuntimeError> at /openspg_venv/lib/python3.8/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:49) at /openspg_venv/lib/python3.8/site-packages/kag/interface/builder/base.invoke(base.py:114) at /openspg_venv/lib/python3.8/site-packages/kag/builder/component/extractor/schema_free_extractor._invoke(schema_free_extractor.py:387) at /openspg_venv/lib/python3.8/site-packages/tenacity/init.wrapped_f(init.py:336) at /openspg_venv/lib/python3.8/site-packages/tenacity/init.call(init.py:475) at /openspg_venv/lib/python3.8/site-packages/tenacity/init.iter(init.py:376) at /openspg_venv/lib/python3.8/site-packages/tenacity/init.exc_check(init.py:419) at pemja.core.PythonInterpreter.invokeMethod(Native Method) at pemja.core.PythonInterpreter.invokeMethod(PythonInterpreter.java:118) at com.antgroup.openspg.common.util.pemja.PemjaUtils.invoke(PemjaUtils.java:33) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagExtractorAsyncTask$ExtractorTaskCallable$ExtractTaskCallable.call(KagExtractorAsyncTask.java:275) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagExtractorAsyncTask$ExtractorTaskCallable$ExtractTaskCallable.call(KagExtractorAsyncTask.java:244) at java.util.concurrent.FutureTask.run(FutureTask.java:266) ... 3 more
2025-01-15 00:19:14(172.18.0.5): Extractor task status is ERROR
How to reproduce
- Setup Global Configurations
- Upload a PDF
- Check Log
Are you willing to submit PR?
- [ ] Yes I am willing to submit a PR!
I‘ve also encountered the same problem. Have you solved it?
The log indicates that the LLM-based NER has failed. Could you check if the LLM service is available?
The log indicates that the LLM-based NER has failed. Could you check if the LLM service is available?
这是我的模型,我有相同的报错:
The log indicates that the LLM-based NER has failed. Could you check if the LLM service is available?
这是我的模型,我有相同的报错:
![]()
I encountered the same issue. Has anyone successfully resolved it?
I also tried with default OpenAI based models and encounter the same issue. Any help regarding the default OpenAI specific config would be warmly appreciated.
I also tried with default OpenAI based models and encounter the same issue. Any help regarding the default OpenAI specific config would be warmly appreciated.
You can refer to User Guide to test Generative Model Service Accessibility and re-settle your OpenAI model configuration.