Neo

Results 2 issues of Neo

When initializing the buffer, I need to pass in low_latency_mode. If it is true, low_latency mode is enabled. However, I found that in almost all cases, the performance of setting...

### Your current environment I used 4*5090 to test QwenImage by examples/offline_inference/qwen_image/text_to_image.py it will crash with EOFError, btw it occasionally exit with no error / warning log this is my...

bug
help wanted