Wing Lian comments

Results 103 comments of


                                            Wing Lian

Training fails with an error `WARNING:torch.distributed.elastic.multiprocessing.api:Sending process 202100 closing signal SIGTERM`

> I get similar error with > > `accelerate launch -m axolotl.cli.train llama_lora.yml --deepspeed deepspeed_configs/zero1.json` > > > > With config same in [examples](https://github.com/OpenAccess-AI-Collective/axolotl/blob/main/examples/llama-2/lora.yml). > > Just added additionally >...

LongLora suport

May want to keep track of https://github.com/huggingface/peft/issues/958 in case it is supported there.

LongLora suport

looking at the shift/unshift code, it seems it's not packed sequence length aware, so that would need some modification (or simply not allow packed sequences to work w this features)

MLX Support

These will be helpful https://github.com/philipturner/metal-flash-attention https://github.com/ml-explore/mlx/issues/129

Axolotl has significantly higher train loss, longer train time compare with my training script.

it's hard to say, with 300 rows, and 10% held out for the eval split, it could be randomness in the dataset that small that could lead to train loss...

Axolotl has significantly higher train loss, longer train time compare with my training script.

@hengjiUSTC are you able to compare with the SFT trainer with proper label masking for instruct tuning?

Axolotl has significantly higher train loss, longer train time compare with my training script.

You have completion only set to false with trl. You should start there. That should probably be true for that trainer to set the labels properly

ZeroDivisionError: division by zero

It looks like you have a really small dataset. You should consider disabling sample packing.

pip install (as per docs) fails with ModuleNotFoundError: No module named 'axolotl'

Axolotl needs to be installed from source by GitHub cloning the repository currently. We have dependencies that aren't packages currently so we can't push axolotl as a pypi package currently

Collate missing features

Is there another part that goes with this to optionally have the tokenization step be a bit more sparse for this feature?