tlyhenry comments

Results 5 comments of


                                            tlyhenry

"Trying to remove block n by 0 that is not in hash map" spam in release 0.17

I had this issue for my qwen2.5 14b model in 0.17 Does it means I need to upgrade to a newer version? __

Getting runtime error OOM for generation logits and never go back to normal

Sure. We are building it by passing the parameters. subprocesses.run_command( __cuda_visible_devices(num_gpu), "trtllm-build", "--checkpoint_dir", to_local_dir(checkpoint_dir), "--output_dir", tmpdir, "--gemm_plugin", dtype, f"--max_seq_len {max_seq_len}", f"--max_input_len {max_input_len}", f"--max_batch_size {max_batch_size}", "--use_paged_context_fmha enable" if use_paged_context_fmha else "",...