Sebastian Raschka comments

Results 821 comments of


                                            Sebastian Raschka

QLoRA seems to be broken

Sounds great, thanks. I will make a reminder to test this on Sunday/Monday!

Add slow interconnect warning

How would you check it? Via the [nccl-tests](https://github.com/NVIDIA/nccl-tests) tool?

Add slow interconnect warning

> No, nccl-tests are to measure performance. We don't need it. I agree. That would be overkill, which is why I implemented the current approach. > Anyway, I think your...

Error instaling the plugin

Hi there. It could be that you just need to add a blank `__init__.py` file. Please note that I am no longer developing or maintaining the code in this repo,...

"RuntimeError: All the chunks should have been deleted." on non-Studio machine

Might be a LitData bug. Reported it here with a smaller self-contained example that doesn't use LitGPT: https://github.com/Lightning-AI/litdata/issues/367

Tensor parallelism generates non-sensical outputs

It seems to be related to the MLP class: ## Has problem: - microsoft/phi-2 - GptNeoxMLP - EleutherAI/pythia-2.8b - GptNeoxMLP - stabilityai/stablelm-base-alpha-7b - GptNeoxMLP - google/gemma-2-2b - GemmaMLP ## Is...

`pretrain` vs `finetune_full`

Hi there, these are good questions. Off the top of my head, the major usage difference is the dataset. The `finetune_*` scripts are mainly designed for instruction-finetuning. (I wanted to...

`pretrain` vs `finetune_full`

That's a fair point, but there is this philosophy in this repo that some code duplication isn't bad if it helps with readability. Because too much refactoring and code sharing...

`pretrain` vs `finetune_full`

Closing to clean up the issues a bit. But please feel free to respond or reopen in case you have additional questions.

litgpt download ..model_name... --convert_checkpoint false is performing conversion

Thanks for reporting! The `--convert_checkpoint false` flag is not for the `.safetensor` file conversion but for the `HF -> LitGPT` conversion. Some models on the HF Hub have `.bin` files...