davidray222 comments

Results 12 comments of


                                            davidray222

The accuracy problem

@VainF Do you have any suggestions? Thank you!! I think the model has already been severely damaged after pruning, so fine-tuning may not be very effective.

The accuracy problem

@Cyber-Vadok I think will have size mismatch problem when load the model after pruning? but we can try!

![Image](https://github.com/user-attachments/assets/64ee1d80-7734-4294-ab25-31c5c46e9837) ![Image](https://github.com/user-attachments/assets/2762bfe0-1510-4d14-bb34-4b38bddd4d30) whether I put the wrong lora_path,thank you!

merging question :)

@yuhuixu1993 thank!

merging question :)

@yuhuixu1993 I performed quantization using the AutoGPTQ method with the following script: Path: AutoGPTQ/examples/quantization/quant_with_alpaca.py python quant_with_alpaca.py --pretrained_model_dir huggyllama/llama-7b --quantized_model_dir llama7b-quant4bit-g32 --bits 4 --group_size 32 --save_and_reload ![Image](https://github.com/user-attachments/assets/408d797a-ceed-4af9-87b1-94e25afb76d2) I would like to...