Sebastian Raschka comments

Results 818 comments of


                                            Sebastian Raschka

chat/base.py: extend checkpoint_dir before accessing it

Yes, I think so. But one last question, after the update, have you double-checked / tested it on custom paths that don't start with `"checkpoints", like ```bash litgpt chat my_custom_dir/google/gemma-2-9b-it...

chat/base.py: extend checkpoint_dir before accessing it

Nice, thanks for checking! Looks all good to me now :)

Deal with warnings during model download and conversion

Good call. I can take care of these next week

Deal with warnings during model download and conversion

Good point, I agree

Deal with warnings during model download and conversion

Good point. Thanks again for the contribution!

Is there any best practice for using litdata to load custom data for pretraining?

Good point. I think the main thing here is that if you have large amounts of texts, you would store it in a compressed or pretokenized format, and perhaps also...

performing continuous pretraining and then finetuning causes error

Thanks for reporting, and hm, yes, this is weird. I can reproduce it: ### Pretraining ```bash litgpt pretrain \ --model_name pythia-14m \ --tokenizer_dir checkpoints/EleutherAI/pythia-14m \ --out_dir my_test_dir \ --data TextFiles...

Abnormal Output from Gemma Pretrained Model After Conversion to Hugging Face Format

Hi there, do you remember what the output was before the conversion? It would be useful to know to make sure that it was trained well.

Add Salamandra models - Multilingual LLaMA-based models including minority languages

I would be open to adding these models. If it helps, I've recently written a how-to guide here: https://github.com/Lightning-AI/litgpt/blob/main/tutorials/developer-docs/adding-models.md

TypeError: TextInputSequence must be str

Hi there, could you try this with a very small text example that only consists of a few entries, e.g., repeated versions of the entry you showed: ```json [ {...