Zhewei Yao comments

Results 13 comments of


Zhewei Yao

MoQ problem ：'str' object has no attribute 'size'

Hi, We recently refactored the MoQ part (version >=0.7.0 for Deepspeed). Please try the newest version and let us know if that works. Here is the new tutorial link: https://www.deepspeed.ai/tutorials/model-compression/

Wired behaviors of AdaHessian on ResNext-50

Hi Xuezhe, Please let me know if the version in our branch can solve your problem.

questuion : how to inference Int8 models (GPT) supported through ZeroQuant technology ?

Hi This will be released as a part of (MII-Azure) later: https://github.com/microsoft/DeepSpeed-MII

ZeroQuant quantization kernels and LKD

Hi, The engine of ZeroQuant inference is not released yet. The code example in DeepSpeed-Example is only to help verify the accuracy of ZeroQuant. The kernel/engine released is on our...

ZeroQuant quantization kernels and LKD

@david-macleod LKD example is just released (not merged yet): https://github.com/microsoft/DeepSpeedExamples/pull/214 For kernel, please stay tuned

hi ，when the ZeroQuant inference will be released?

Reza wraps up this https://github.com/microsoft/DeepSpeed/pull/2217 which answers some part of your questions, such as the model size reduction. Regarding the kernels, we are working on a plan to release it...

Latex doesn't render in 3d axis titles

Has there been any updates on this feature?

[REQUEST] Add more device-agnostic compression algorithms

Thanks for the great proposal and we appreciate your contributation here :). We will discuss it internally and get back to you soon. Best,

[REQUEST] Add more device-agnostic compression algorithms

Hi there, The proposal looks great to us. For the pruning/scarification proposal, we wonder if the return of callback is needed or we can just use something like `deepspeed.sparse_callback_step() `...

[REQUEST] Add more device-agnostic compression algorithms

@ftian1 We did not provide a calibration based PTQ but provided ZeroQuant (PTQ without calibration) example here: https://github.com/microsoft/DeepSpeedExamples/tree/master/model_compression/bert/bash_script/ZeroQuant. It is always good to have more examples and we appreciate if...