Kevin Hu comments

Results 1524 comments of


                                            Kevin Hu

[Question]: How to improve document parsing speed through GPU

Do not utilize GPUs for RAGFlow server. You could deploy an embedding inference server on GPUs which will accelerate chunking procedure much more.

[Question]: How to improve document parsing speed through GPU

I recommand to apply `slim` version of docker image and not to deploy RAGFlow with GPU which is more feasible for embedding/LLM inference.

[Question]: ragflow Error Get "https://registry-1.docker.io/v2/": dial tcp 31.13.87.9:443: i/o timeout

set this in docker/.env RAGFLOW_IMAGE=registry.cn-hangzhou.aliyuncs.com/infiniflow/ragflow:dev

[Question]: Why do we need to hard-code the value of max tokens

This is for the case we can't find max token length for assigned LLM.

[Question]: Why do we need to hard-code the value of max tokens

What kind of LLM did you use, let me check if there's a bug or something?

[Question]: Why do we need to hard-code the value of max tokens

RAGFlow does not know the context length of models added through XInference, which needs to be improved.

[Question]: Why do we need to hard-code the value of max tokens

It‘s definitely controled by context length of LLM.

[Question]: A question about how ragflow is deployed locally.

Developing with docker image/container is more promising and faster way.

[Feature Request]: Please Build a Retrieval API

Check out [this](https://github.com/infiniflow/ragflow/blob/main/sdk/python/test/t_document.py#L292)

[Question]: Knowledge Graph mode the answer is cut

The context is out of length. Adjust these 2 parameters. Or, cut down chunk token number. ![image](https://github.com/user-attachments/assets/e5d6b51c-653e-40cb-82d5-5f4c8f9ee909)