enable medusa int8 weight only quantization

Open XiaobingSuper opened this issue 1 year ago • 2 comments

May 16 '24 06:05 XiaobingSuper

@kaiyux, could you help review it?

May 21 '24 01:05 XiaobingSuper

@XiaobingSuper Thanks for the support! We will review the changes in the internal codebase and get back to you.

May 21 '24 09:05 kaiyux

Hi @XiaobingSuper , we've merged your changes into main branch and thanks a lot for your contributing.

Jun 05 '24 09:06 nv-guomingz