Roman Makarov comments

Repositories
Issues
Comments

Results 2 comments of


                                            Roman Makarov

[BUG] The loading of sharded checkpoints with Marlin is currently not supported

Well, changing the version of transformers does not help. What do you mean with GPTQModel? Use their generation benchmark? Quantize model with their method and try it on AutoGPTQ generation...

Qwen2.5 series error when prune

I have the same issue with llamas and phi, even though I follow their instructions from [here](https://github.com/VainF/Torch-Pruning/tree/master/examples/LLMs). Am I the only one to encounter that?