rjmehta1993
rjmehta1993
This will be the biggest release for vllm to support exllamav2. +1
+1. EXl quants is unbeatable
Changing key from type to rope_type matches to what transformers expect and it removes warning. But does this confirms that the YaRN is implemented and working?
Any updates?
+1. Changing key from type to rope_type matches to what transformers expect and it removes warning. But does this confirms that the YaRN is implemented and working?
is this supported yet? @zhyncs