ragflow [Bug]: deepdoc use GPU still appears this error

Is there an existing issue for the same bug?

[x] I have checked the existing issues.

RAGFlow workspace code commit ID

29fb6ca11b318bd437b82299a800aca91f679efa

RAGFlow image version

v0.16.0

Other environment information

- GPU: A800 80G
- CUDA Version: 12.4
- OS: Ubuntu 20.04.1
- Docker Version: 24.0.7

Actual behavior

knowledge >> pdf file >> recognizer error

Expected behavior

Traceback (most recent call last): File "/ragflow/rag/svr/task_executor.py", line 626, in handle_task do_handle_task(task) File "/ragflow/rag/svr/task_executor.py", line 559, in do_handle_task chunks = build_chunks(task, progress_callback) File "/ragflow/rag/svr/task_executor.py", line 218, in build_chunks cks = chunker.chunk(task["name"], binary=binary, from_page=task["from_page"], File "/ragflow/rag/app/manual.py", line 190, in chunk sections, tbls = pdf_parser(filename if not binary else binary, File "/ragflow/rag/app/manual.py", line 54, in call self._layouts_rec(zoomin) File "/ragflow/deepdoc/parser/pdf_parser.py", line 327, in _layouts_rec self.boxes, self.page_layout = self.layouter( File "/ragflow/deepdoc/vision/layout_recognizer.py", line 70, in call layouts = super().call(image_list, thr, batch_size) File "/ragflow/deepdoc/vision/recognizer.py", line 487, in call bb = self.postprocess(self.ort_sess.run(None, {k:v for k,v in ins.items() if k in self.input_names}, self.run_options)[0], ins, thr) File "/ragflow/.venv/lib/python3.10/site-packages/onnxruntime/capi/onnxruntime_inference_collection.py", line 220, in run return self._sess.run(output_names, input_feed, run_options) onnxruntime.capi.onnxruntime_pybind11_state.RuntimeException: [ONNXRuntimeError] : 6 : RUNTIME_EXCEPTION : Non-zero status code returned while running Softmax node. Name:'/model.23/dfl/Softmax' Status Message: /onnxruntime_src/onnxruntime/core/framework/bfc_arena.cc:376 void* onnxruntime::BFCArena::AllocateRawInternal(size_t, bool, onnxruntime::Stream*, bool, onnxruntime::WaitNotificationFn) Available memory of 0 is smaller than requested bytes of 5505024

Steps to reproduce

knowledge >> pdf  file >> recognizer error

Additional information

No response

Feb 10 '25 08:02 nhlzh8708

It seems short of memory. OOM.

Feb 11 '25 01:02 KevinHuSh

It seems short of memory. OOM.

docker-compose-gpu.yml Specify id:3 gpu

when upload .pdf to knowledge and start embedding, I see that the CPU still has enough memory.

I don't know why this step why use show much gpu memory? File "/ragflow/deepdoc/vision/recognizer.py", line 487, in call bb = self.postprocess(self.ort_sess.run(None, {k:v for k,v in ins.items() if k in self.input_names}, self.run_options)[0], ins, thr)

Feb 11 '25 06:02 nhlzh8708

As I know, onnxruntime did not support GPU well

Feb 12 '25 03:02 KevinHuSh

As I know, onnxruntime did not support GPU well

now, Is it recommended to use CPU mode?

Feb 12 '25 07:02 nhlzh8708

I also encountered this problem

rag flow version

v0.16.0-24

error message

onnxruntime::BFCArena::AllocateRawInternal(size_t, bool, onnxruntime::Stream*, bool, onnxruntime::WaitNotificationFn) Available memory of 0 is smaller than requested bytes of 5505024

environment

NVIDIA A100 80GB

Feb 13 '25 02:02 Deeperfinder

I also encountered this problem

rag flow version

v0.16.0-24

error message

onnxruntime::BFCArena::AllocateRawInternal(size_t, bool, onnxruntime::Stream*, bool, onnxruntime::WaitNotificationFn) Available memory of 0 is smaller than requested bytes of 5505024

environment

NVIDIA A100 80GB

I can successfuly use cpu to extract pdf or other types of file!

Feb 13 '25 02:02 Deeperfinder

I also encountered this problem. Uploading a PDF to start embedding. I observe that one GPU is being used. However, in “docker-compose-gpu.yml“, the count is set to all.

Feb 19 '25 01:02 brookejiang

So do I

Mar 01 '25 01:03 prettyprettyboy

It work well on 4090.But it doesn't work on A800 80G.

Mar 01 '25 01:03 prettyprettyboy

the same question

Mar 08 '25 08:03 donblack01

the same question

Apr 18 '25 14:04 Aebestach