orellavie1212 comments

Results 13 comments of


                                            orellavie1212

Error when setting a high batch-size: `AttributeError: 'NoneType' object has no attribute 'backward'`

any updates for that bug? I can't run it even with batch_size of 2 or 8 (tried in sagemaker with ml.g5.12xlarge and ml.g4dn.12xlarge) I am out of ideas, even tried...

Issues with Upgraded Pip Libs for LoRA Weights in 4-bit and 8-bit Training

yes, that is the problem! I found out the dict is empty.. when I am checking the adapter_weights. how did you fix it specifically? I'll mock. at the checkpoints there...

Issues with Upgraded Pip Libs for LoRA Weights in 4-bit and 8-bit Training

On the main directory I have dapter_config.json adapter_model.bin checkpoint-69 checkpoint-72 checkpoint-75 runs On the specific directory (checkpoint-75) optimizer.pt pytorch_model.bin rng_state.pth scaler.pt scheduler.pt special_tokens_map.json tokenizer_config.json tokenizer.json trainer_state.json training_args.bin you only need...

Issues with Upgraded Pip Libs for LoRA Weights in 4-bit and 8-bit Training

I have adapter_model.bin but it is empty as I said. I tried to understand what you offered, but the only possible I see is to take pytorch_model.bin from the checkpoint...

Issues with Upgraded Pip Libs for LoRA Weights in 4-bit and 8-bit Training

yes, the weights actually loaded successful now. Any idea if the last checkpoint is exactly the last training? or I missed some of the epoch, so adapter_bin.model is advanced in...

Load Mixtral 8x7b AWQ model failed

> I have an example script that works with Mixtral: > > https://github.com/casper-hansen/AutoAWQ/blob/main/examples/basic_vllm.py checking it right now https://github.com/casper-hansen/AutoAWQ/blob/main/examples/mixtral_quant.py, I hope this is the configuration you added to your model at...