LLM-Pruner issues

I am trying to evaluate the perplexity of Llama-2 13b on WikiText-2. When using the script from [GitHub - yxli2123/LoftQ](https://github.com/yxli2123/LoftQ), I get a perplexity of 12.02. However when using the...

nikhil-ghosh-berkeley

Llama3 reports shape error after pruning

7

The command I run: ''' python llama3.py --pruning_ratio 0.25 \ --device cuda --eval_device cuda \ --base_model home/Meta-Llama-3-8B \ --block_wise --block_mlp_layer_start 4 --block_mlp_layer_end 30 \ --block_attention_layer_start 4 --block_attention_layer_end 30 \ --save_ckpt_log_name...

WentaoTan

evaluate PPL with the post-training model

1

Hi there! How to evaluate the PPL of "wikitext2,ptb" with the post-training model?

VincentZ-2020

Post training more than 1 epoch leads to performance degradation

I'm running post-training on a pruning model. After post-training, I get degraded performance - eg. mmlu goes down to 24%. is this expected? ``` MODEL=meta-llama/Llama-2-7b-hf prune_ckpt_path='llama_prune' tune_ckpt_path='model' RATIO=0.10 # Pruning...

sidhantls

How to prune 20% of parameters?

In the readme, a --pruning_ratio 0.25 is used and it's mentioned it prunes 20% of parameters. Why is this? If I want to prune 10%, should I use --pruning_ratio 0.15?

sidhantls

No such file or directory: pytorch_model.bin

2

Issue resolved. The problem is that when constructing the trainer, `save_safetensors=False` should be set. Otherwise, the above `safe_serialization=False` will not work. https://huggingface.co/docs/transformers/v4.36.1/en/main_classes/trainer#transformers.TrainingArguments.save_safetensors _Originally posted by @WilliamYi96 in https://github.com/horseee/LLM-Pruner/issues/45#issuecomment-1867980732_ I use...

yaolu-zjut

LLM-Pruner
LLM-Pruner copied to clipboard

Metadata

请问可以支持chatglm3剪枝吗

Difference in Perplexity Values

Llama3 reports shape error after pruning

evaluate PPL with the post-training model

Post training more than 1 epoch leads to performance degradation

How to prune 20% of parameters?

No such file or directory: pytorch_model.bin

关于consecutive_groups

Taylor pruner under-utilizing resources

Creating custom configuration files in hgging face format

← Metadata

Owner

Metadata

LLM-Pruner LLM-Pruner copied to clipboard

Metadata

← Metadata

Owner

Metadata

LLM-Pruner
LLM-Pruner copied to clipboard