Maxwell comments

Results 9 comments of


                                            Maxwell

Hydra error in fairseq-generate cli (task: translation_multi_simple_epoch)

> > > trying to generate with 4 rtx 3090: > > > ``` > > > fairseq-generate \ > > > bin \ > > > --batch-size 1 \...

Polacode not using my set font

> Nope, I tried setting `fontLigatures` to `true` but it didn't help WARNING: This is not a solution, it's a hack, however works If you REALLY want to use your...

ValueError:paged_adamw_32bit is not a valid OptimizerNames

This is definitly caused by the version of `transformers`. Here are my versions, which is clearly higher: ``` accelerate 0.20.0.dev0 bitsandbytes 0.39.0 transformers 4.30.0.dev0 peft 0.4.0.dev0 ``` Please try upgrading...

[bug] Completed model does not load from checkpoint / generate produces same as base model

🥰 Your "workaround" is a very good fix, it is clearly working and should be merged ASAP. I can comfirm its working. I was trying to do a predict using...

fix issues to be compatible with latest peft

I dont know if this is the RIGHT way, but this simple modification at [L275](https://github.com/tloen/alpaca-lora/blob/8bb8579e403dc78e37fe81ffbb253c413007323f/finetune.py#L275) produces a `adapter_model.bin` with the right size: ``` diff - model.save_pretrained(output_dir) + model.save_pretrained(output_dir, state_dict=old_state_dict()) ```

Maxwell

Hydra error in fairseq-generate cli (task: translation_multi_simple_epoch)

Polacode not using my set font

ValueError:paged_adamw_32bit is not a valid OptimizerNames

[bug] Completed model does not load from checkpoint / generate produces same as base model

fix issues to be compatible with latest peft

V100 can not supprt load_in_4bit and fp16?

Only cpu ram getting used...

[Feature request] Add custom dataset compatibility

Loss spike during training phase