Sebastian Raschka comments

Results 818 comments of


                                            Sebastian Raschka

Is there any support for visual generation?

Hi there! Image/video is not supported but I can surely add a Trainer recipe some time. Thanks for suggesting!

OOM for training llama

Thanks for the feedback. It does work on 4 x L4s, which have 24 Gb each. I can see that the usage is around 22-24 GB. Other than trying a...

OOM for training llama

It was on each GPU. I think that it uses substantially less RAM than 22 x 4 in total though; it might be that it works just fine on a...

OOM for training llama

Ah yes, `litgpt finetune ...` uses LoRA by default. For full finetuning, it's `litgpt finetune_full ...`

OOM with bf16-true, Quantization, for long context length.

Hm, I haven't had any issues with that recently. But there have been a couple of changes in the last few days. Hm. I assume it's the same issue with...

OOM with bf16-true, Quantization, for long context length.

@KOVVURISATYANARAYANAREDDY I just tried it and it works fine for me with Llama 2 7B: ``` (qlora) sebastian@hyperplane1:~/Developer/prs/debug/lit-gpt$ python finetune/lora.py --precision "bf16-true" --quantize "bnb.nf4" --checkpoint_dir checkpoints/meta-llama/Llama-2-7b-hf/ {'eval_interval': 100, 'save_interval': 100,...