rjmehta1993

Results 7 comments of rjmehta1993

This will be the biggest release for vllm to support exllamav2. +1

+1. EXl quants is unbeatable

Changing key from type to rope_type matches to what transformers expect and it removes warning. But does this confirms that the YaRN is implemented and working?

+1. Changing key from type to rope_type matches to what transformers expect and it removes warning. But does this confirms that the YaRN is implemented and working?

is this supported yet? @zhyncs