Ma Xinyin comments

Results 58 comments of


                                            Ma Xinyin

Openai Whisper pruning

HI. We have updated the code in https://github.com/horseee/LLM-Pruner. Please refer to the new repo-v-

I encountered the following error message when I assign iterative_steps = 2 during baichuan-7B pruning

Our code currently does not support iterative_steps > 1 for baichuan. Please try iterative_steps = 1.

this method can be used for bloom?

Hi. LLM-Pruner is a general structural pruning method for LLM and it can also be used on the pruning of BLOOM. However, due to the increasing number of LLMs in...

this method can be used for bloom?

Hi. I uploaded the code for pruning BLOOM. You can find the instructions for pruning the BLOOM [here](https://github.com/horseee/LLM-Pruner/tree/main/examples#cherry_blossom-bloom) I only conducted a quick test on BLOOM-3B to make sure it...

this method can be used for bloom?

Hi. From the perspective of the algorithm, it is entirely feasible to use this on Bloom 176B. However, the current algorithm requires gradient computation, and recording these gradients for a...

recover training

Hi. Please refer to this line: ``` RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cuda:1! ``` Please check whether...

a post-training issue

Hi. You need to first configure the wandb. You can follow the instructions of wandb😄

the new pytorch.bin is bigger than original model issue

Hi. Could you please check if you deleted the gradient for calculating Taylor before saving the model?

Pruning Llama2-7B

Hi. The pruning needs around 80G memory if you use the Taylor pruner, since it needs to compute the gradient of the model. If you use other pruners, like L2...

Reproducing paper results

Hi. Can I check which LLaMa-7B checkpoint you use? `decapoda-research/llama-7b-hf` in my code is not available currently and I'm not sure if it is the reason that causes this difference.