jaywongs comments

Results 21 comments of


                                            jaywongs

Error invalid configuration argument at line 119 in file /src/csrc/ops.cu

> @jaywongs , did upgrading deepspeed work for you? not work for me,i use the deepspeed 0.14.2

Error invalid configuration argument at line 119 in file /src/csrc/ops.cu

> > > @jaywongs , did upgrading deepspeed work for you? > > > > > > not work for me,i use the deepspeed 0.14.2 > > Hello, have you...

Is is possible to train 70b model on 8*A100 80G with full fine tunning?

> I recall that you may be able to with deepspeed 3 and cpu offload Apologies for the confusion. I attempted to use deepspeed 3 with CPU offload, but the...

Is is possible to train 70b model on 8*A100 80G with full fine tunning?

The batch size set to 1 is not working. I haven't tried the 8-bit optimization. Will using 8-bit affect the quality of the trained model?

Start Triton failed to load libtriton_tensorrtllm on aarch64.

any update on this problem?

Confusion about versions and NGC images

I'm also confused about this. The issues with the build process and version compatibility are driving me crazy.

Triton server is running, but no response returned.

@sleepwalker2017 Hi, did you solve this? I'm facing the same problem as you and have no idea what happened.

Triton server is running, but no response returned.

How to build the Mistral using BF16

> Hi @plt12138, it is a known bug in v0.8.0 release. It has been fixed in the recent main branch. Could you, please, try it? TensorRT-LLm :0.9.0.dev2024031900 Confirmed, it didn't...

How to build the Mistral using BF16

> Hello, would you mind spending some time testing the parameter length_penalty? In my case, the parameter length_penalty doesn't make sense in Mistral. I'm not sure if the bug is...