ragflow icon indicating copy to clipboard operation
ragflow copied to clipboard

[Bug]: Parsing PDF file get stucked after uploading in new deployment

Open shaoyie opened this issue 1 year ago • 8 comments

Is there an existing issue for the same bug?

  • [X] I have checked the existing issues.

Branch name

main

Commit ID

bef1bbdf3e16e5163bc563407bd7fd8f7da97d7a

Other environment information

No response

Actual behavior

New deployed system, create a knowledgebase in general parse method, upload a PDF file, then click start. It's always sth like 0.03% and in status Task is dispatched. Wait for long time with no change. Cancel the task and restart it again, this time the parsing task will get real started very quickly.

Expected behavior

Should be able to complete the parsing on the first time.

Steps to reproduce

1. New deployed system.
2. Create a knowledgebase in general parse method.
3. Upload a PDF file.
4. Click start parsing.
5. Observe the status get stucked in Task is dispatched.
6. Cancel the task and restart it again, the parsing task will complete quickly.

Additional information

No response

shaoyie avatar May 11 '24 14:05 shaoyie

We also find this, but we are not sure the reason yet.

KevinHuSh avatar May 13 '24 09:05 KevinHuSh

We also find this, but we are not sure the reason yet.

What I know is the version on Apr 17 works fine. So maybe check the changes related to task executor these days?

shaoyie avatar May 13 '24 12:05 shaoyie

Try to upgrade it with the dev version. We fixed this.

KevinHuSh avatar May 16 '24 01:05 KevinHuSh

Try to upgrade it with the dev version. We fixed this.

Tried with the latest code, for the first time, still met the blocking here: image

But for the followed request, it works fine. Even after recreate the container. Should be fine to go with it for now, if only the first time need manual ramp up.

shaoyie avatar May 17 '24 09:05 shaoyie

This should remain open, as it's still an issue. Thanks.

timdonovanuk avatar May 21 '24 11:05 timdonovanuk

Yes, and seems this issue happens sometime, reopen it.

shaoyie avatar Jun 04 '24 06:06 shaoyie

Cancelling the task and restarting don't work for me. It's terrible

jiajunly avatar Aug 13 '24 09:08 jiajunly

Maybe relate to #1383 . Hope the cause can be found and fixed as soon as possible.

jiajunly avatar Aug 13 '24 09:08 jiajunly