Casper comments

Results 295 comments of


                                            Casper

torch 2.3.x support

This is a pytorch-related issue. You are installing wheels that are built with 2.2.0 and trying with torch 2.3.0. To fix this, you need to build from source or wait...

torch 2.3.x support

@BenjaminBossan yes. I am building new kernels now and pushing an updated version to PyPi. Sorry for the delay - I plan to be faster at updating when new torch...

Issues with quantizing Cohere model

Hi @kwonjihun-theori, thanks for your very detailed post. I met the same challenges as you when trying to apply activation-aware quantization to this model as it's a non-standard model definition....

Hi @Zephyr69, I am happy to investigate this issue. Do you have an example of how to trigger the jibberish? Otherwise, I would advise you to use [vLLM](https://github.com/vllm-project/vllm) to serve...

After quantization，the ppl is ok but humaneval score drops sharply

You will probably have to quantize your model using a custom dataset for coding.

Cohere Support

I see this is a draft PR. @TechxGenus have you done any further testing?

Bug - mixtral qlora(after b&b peft train) quantization broadcast problem

We don't support lora modules. You would have to convert your model to standard weights

Cohere Support

Sorry for taking so long. I took leave from working on open-source the past month.

Have any plan to support chatglm serial models?

I plan to support new models as they come out. I focus most of my efforts on models that are better than previous models and mostly let others add older...

Have any plan to support chatglm serial models?

Hi @user-ZJ, I would love to support more models. There is no documentation yet, but that is subject to change as AutoAWQ now has a dedicated documentation page. Do you...