Pierre Janeke

Results 5 comments of Pierre Janeke

I see 2. was fixed with https://github.com/sgl-project/sglang/commit/b0890631a011be28d5ef5a0b4d5551fdeb94ab25

Does this mean the problem with 1. is fixed @merrymercy?

@rlouf did you manage to make much progress yet?

I had a similar problem running on an EC2 g5.2xlarge instance (1 x A10G) using openchat/openchat3.5-0106. I have long sequences (6-7k tokens). A batch size of 19 sequences is fine,...