LLaMA-Factory icon indicating copy to clipboard operation
LLaMA-Factory copied to clipboard

[Question]support for quantization algorithms that are not performed on-the-fly

Open wenhuach21 opened this issue 5 months ago • 3 comments

Reminder

  • [X] I have read the README and searched the existing issues.

System Info

None

Reproduction

None

Expected behavior

None

Others

Hi, Thank you for the fantastic work on LLaMA Factory! I’ve noticed that the repository supports both quantized models generated by various algorithms and on-the-fly quantization.

I am curious if LLaMA Factory is open to contributions of quantization algorithms that are not performed on-the-fly. We have open-source AutoRound that serves as a strong alternative to existing methods. We could contribute if it's ok to you.

github

User experience on Finetuning

wenhuach21 avatar Sep 12 '24 06:09 wenhuach21