LLM-Pruner issues

Evaluation：UnicodeDecodeError: 'utf-8' codec can't decode byte 0x8b in position 1: invalid start byte

1

Thank you very much for doing such great open-source work! i try: CUDA_VISIBLE_DEVICES=X bash scripts/evaluate.sh PATH_OR_NAME_TO_BASE_MODEL PATH_TO_SAVE_TUNE_MODEL PATH_TO_PRUNE_MODEL EPOCHS_YOU_WANT_TO_EVALUATE but get the result: Selected Tasks: ['piqa', 'boolq', 'arc_challenge', 'hellaswag', 'openbookqa',...

manlenzzz

Pruning llama3

Will this lib natively support pruning of recently released llama3？

yinwangsong

I tired Mistral 7b model, but I got this issue

LOGS: You are using a model of type mistral to instantiate a model of type llama. This is not supported for all configurations of models and can yield errors. Loading...

TejasLidhure

Checking the pruned but uncompressed model

10

Hi, Thanks a lot for this awesome work! I am wondering whether there is a way to check the pruned but uncompressed model. Now when I save the model, they...

ZN1010

Pruning Llama2-7B

4

I’ve tried to prune Llama2-7B on a MacBook Pro M1 but the system end it by killing the process because of OOM (I’ve 32GB) Is there something I can do?...

acalatrava

Is this method implementable on multi-GPUs?

LeonCheng0129

How to prune the embedding and lm_head?

L-hongbin

RecursionError: maximum recursion depth exceeded

2

When I'm running `python generate.py --model_type pretrain` The error occurs, I can't understand the reason...

Zhenyu001225

Eval Loss NaN on Llama-2

3

Hi, By anychance, have you tried literally run on the llama-2 model? I tried using default llama parameters for pruning and post-training, resulting in similar wikitext2 score (~19) but much...

mmichaelzhang

question

Unable to reproduce the results for param_first and param_second in the paper after finetuning.

Could you please tell me which command you used for post training of model to get results for element 1 and element 2 method.

danyal97

LLM-Pruner
LLM-Pruner copied to clipboard

Metadata

Evaluation：UnicodeDecodeError: 'utf-8' codec can't decode byte 0x8b in position 1: invalid start byte

Pruning llama3

I tired Mistral 7b model, but I got this issue

Checking the pruned but uncompressed model

Pruning Llama2-7B

Is this method implementable on multi-GPUs?

How to prune the embedding and lm_head?

RecursionError: maximum recursion depth exceeded

Eval Loss NaN on Llama-2

Unable to reproduce the results for param_first and param_second in the paper after finetuning.

← Metadata

Owner

Metadata

LLM-Pruner LLM-Pruner copied to clipboard

Metadata

← Metadata

Owner

Metadata

LLM-Pruner
LLM-Pruner copied to clipboard