Weishaoya
Weishaoya
### Is there an existing issue for the same bug? - [X] I have checked the existing issues. ### Branch name main ### Commit ID the latest code ### Other...
system: ubuntu 22.04 (no gpu) I am a Chinese user. When I run the commad, "docker run -p 8000:8000 -e HF_ENDPOINT=https://hf-mirror.com savatar101/omniparse:0.1", It has the following bug: ========== == CUDA...
### Describe your problem    For ragflow_streaming_output api, when I set the number of concurrent requests to 1, 10, and 100, the first token latency was 0.6719s, 4.7593s,...
### What happened? No matter how I adjust the context window size parameter, the right side of the context window will always be 128k, right ### Steps to reproduce Since...