KAG icon indicating copy to clipboard operation
KAG copied to clipboard

Unable to Extract Text

Open dipaksha1201 opened this issue 11 months ago • 3 comments

Search before asking

  • [X] I had searched in the issues and found no similar issues.

Operating system information

Linux

What happened

Extractor task trace log: java.lang.RuntimeException: invoke extract Exception at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagExtractorAsyncTask$ExtractorTaskCallable.call(KagExtractorAsyncTask.java:224) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagExtractorAsyncTask$ExtractorTaskCallable.call(KagExtractorAsyncTask.java:178) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.executeTask(MemoryTaskServer.java:67) at com.antgroup.openspg.server.core.scheduler.service.common.MemoryTaskServer.lambda$submit$0(MemoryTaskServer.java:56) at java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1604) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:750) Caused by: java.util.concurrent.ExecutionException: pemja.core.PythonException: <class 'tenacity.RetryError'>: <Future at 0xffff100ac250 state=finished raised RuntimeError> at java.util.concurrent.FutureTask.report(FutureTask.java:122) at java.util.concurrent.FutureTask.get(FutureTask.java:192) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagExtractorAsyncTask$ExtractorTaskCallable.call(KagExtractorAsyncTask.java:221) ... 7 more Caused by: pemja.core.PythonException: <class 'tenacity.RetryError'>: <Future at 0xffff100ac250 state=finished raised RuntimeError> at /openspg_venv/lib/python3.8/site-packages/kag/bridge/spg_server_bridge.run_component(spg_server_bridge.py:49) at /openspg_venv/lib/python3.8/site-packages/kag/interface/builder/base.invoke(base.py:114) at /openspg_venv/lib/python3.8/site-packages/kag/builder/component/extractor/schema_free_extractor._invoke(schema_free_extractor.py:387) at /openspg_venv/lib/python3.8/site-packages/tenacity/init.wrapped_f(init.py:336) at /openspg_venv/lib/python3.8/site-packages/tenacity/init.call(init.py:475) at /openspg_venv/lib/python3.8/site-packages/tenacity/init.iter(init.py:376) at /openspg_venv/lib/python3.8/site-packages/tenacity/init.exc_check(init.py:419) at pemja.core.PythonInterpreter.invokeMethod(Native Method) at pemja.core.PythonInterpreter.invokeMethod(PythonInterpreter.java:118) at com.antgroup.openspg.common.util.pemja.PemjaUtils.invoke(PemjaUtils.java:33) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagExtractorAsyncTask$ExtractorTaskCallable$ExtractTaskCallable.call(KagExtractorAsyncTask.java:275) at com.antgroup.openspg.server.core.scheduler.service.task.async.builder.KagExtractorAsyncTask$ExtractorTaskCallable$ExtractTaskCallable.call(KagExtractorAsyncTask.java:244) at java.util.concurrent.FutureTask.run(FutureTask.java:266) ... 3 more

2025-01-15 00:19:14(172.18.0.5): Extractor task status is ERROR

How to reproduce

  1. Setup Global Configurations
  2. Upload a PDF
  3. Check Log

Are you willing to submit PR?

  • [ ] Yes I am willing to submit a PR!

dipaksha1201 avatar Jan 14 '25 16:01 dipaksha1201

I‘ve also encountered the same problem. Have you solved it?

Mogul5306 avatar Jan 15 '25 01:01 Mogul5306

The log indicates that the LLM-based NER has failed. Could you check if the LLM service is available?

zhuzhongshu123 avatar Jan 15 '25 02:01 zhuzhongshu123

The log indicates that the LLM-based NER has failed. Could you check if the LLM service is available?

这是我的模型,我有相同的报错: image image

Gotodie avatar Jan 15 '25 09:01 Gotodie

The log indicates that the LLM-based NER has failed. Could you check if the LLM service is available?

这是我的模型,我有相同的报错: image image

I encountered the same issue. Has anyone successfully resolved it?

nilin1998 avatar Jan 17 '25 08:01 nilin1998

I also tried with default OpenAI based models and encounter the same issue. Any help regarding the default OpenAI specific config would be warmly appreciated.

tomthebuzz avatar Feb 11 '25 14:02 tomthebuzz

I also tried with default OpenAI based models and encounter the same issue. Any help regarding the default OpenAI specific config would be warmly appreciated.

You can refer to User Guide to test Generative Model Service Accessibility and re-settle your OpenAI model configuration.

Image

caszkgui avatar Feb 12 '25 01:02 caszkgui