vllm icon indicating copy to clipboard operation
vllm copied to clipboard

Add support for BLOOM

Open WoosukKwon opened this issue 2 years ago • 0 comments

Closes #61

This PR adds the BLOOM model and modifies the paged attention kernel to support ALiBi bias.

WoosukKwon avatar Jul 02 '23 07:07 WoosukKwon