Chevolier

Results 16 comments of Chevolier

I met the same issue, the memory keeps increasing to 256GB in the data loading process until it got killed by the system, any solution to solve this?

> I met the same issue, the memory keeps increasing to 256GB in the data loading process until it got killed by the system, any solution to solve this? Updates:...

Any solutions? I encountered the same issue with bloomz model, so far I just removed --enable_hybrid_engine to bypass the issue, and the program runs. However, guess the efficiency would reduce.

> @Chevolier, can you please clarify the program you are referring to? It would be helpful to share what you are running and the expected output. Thanks! I mean the...

Same problem, dynamic batching does not work. Environment: docker imges nvcr.io/nvidia/tritonserver:24.01-trtllm-python-py3 tensorrt_llm==0.7.1 tensorrtllm_backend==0.7.1 Any way to solve this problem?

> Same problem, dynamic batching does not work. Environment: docker imges nvcr.io/nvidia/tritonserver:24.01-trtllm-python-py3 tensorrt_llm==0.7.1 tensorrtllm_backend==0.7.1 Any way to solve this problem? The following steps helped solve the problem in the above...

> Might you take a try on latest main branch? Tested with the latest main branch, but encountered other issues. So far I solved it using postprocessing. May give it...

I was wondering how was the development of this feature going? Is this available now?