vllm
vllm copied to clipboard
[Hardware][Intel-Gaudi] Regional compilation support
Add support for regional compilation for HPU. It is turned on by default, but can be turned off with VLLM_REGIONAL_COMPILATION
env variable. It works only for torch.compile execution mode. It significantly speeds up warmup time and slightly increases throughput.
Signed-off-by: Kacper Pietkun [email protected]