LongChat
LongChat copied to clipboard
Add support for flash attention with use_cache
thanks for your guys' amazing work ! i want know is this issue associate with vLLM ?