Carlos Mocholí comments

Results 428 comments of


                                            Carlos Mocholí

Make max_seq_length an optional argument in prepare_alpaca and fine tuning scripts

@gkroiz Why does `block_size` run into recompilations but `256` doesn't? `256` would use less memory, but it could limit learning depending on your data's length.

Make max_seq_length an optional argument in prepare_alpaca and fine tuning scripts

Thanks for the explanation. For pretraining, one can decrease the `micro_batch_size`. The data is packed together in a sample so 4 batches of 10 should be approximately equal to 1...

Make max_seq_length an optional argument in prepare_alpaca and fine tuning scripts

I opened https://github.com/Lightning-AI/lit-parrot/pull/143 which does the above but automatically by saving a `config.json` file in the data directory with the optimal `max_seq_length`

Implement LoRA for efficient finetuning

Uh that's strange. Try pushing new commits and I'll debug it if it keeps happening

Fix Warmup and LR Schedule

This sounds good to me too. Sorry for the confusion! @AngainorDev Would you like to update all occurrences together here?

Fix Warmup and LR Schedule

All the finetune/ and pretrain/ scripts should be updated too

Falcon 7B fails on 16GB memory with OOM

Hi! Here's the memory usage using current master (commit b29ca09) with falcon-7b and always passing `--precision 16-true` - **finetune/adapter.py**: 32.69 GB (`micro_batch_size=4`), 17.37 GB (`micro_batch_size=1`) - **finetune/adapter_v2.py**: 41.75 GB (`micro_batch_size=4`),...

Carlos Mocholí

Make max_seq_length an optional argument in prepare_alpaca and fine tuning scripts

Make max_seq_length an optional argument in prepare_alpaca and fine tuning scripts

Make max_seq_length an optional argument in prepare_alpaca and fine tuning scripts

Implement LoRA for efficient finetuning

Fix Warmup and LR Schedule

Fix Warmup and LR Schedule

Falcon 7B fails on 16GB memory with OOM

Falcon 7B fails on 16GB memory with OOM

Text generation fails on --devices 2

Text generation fails on --devices 2