vllm icon indicating copy to clipboard operation
vllm copied to clipboard

Use O3 optimization instead of O2 for CUDA compilation?

Open WoosukKwon opened this issue 2 years ago • 0 comments

We are currently using the -O2 flag in compiling our CUDA kernels. We need to investigate whether/how changing it to -O3 affects the system performance and compilation time.

WoosukKwon avatar May 04 '23 09:05 WoosukKwon