Sebastian Raschka
Sebastian Raschka
Hm, that's weird. Not sure why this is happening. Did you install LitGPT with the `pip` development mode (`-e`) so updates are reflected in general? ``` pip install -e ".[all]"...
Arg, sorry ... it's Friday afternoon and my brain is probably already in weekend mode. Actually, the train.save_interval is not based on max tokens but on steps. So it's probably...
If the microbatch size is equal to the global batch size, I think it should be the following relationship: max tokens = max_steps * batch_size * max_seq_length (I think that's...
Thanks for opening the discussion. Regarding issue 1, I think you may have an older version of the book as this issue has been fixed in a reprint back in...
Great point. I think it's best to switch to .iloc here
Unfortunately, multi-GPU inference is not supported yet, but that's something on the roadmap.
Hi there, just wanted to say thanks for taking on this PR (I know this is a lot of work)! The OLMo models are awesome, and I'd be great to...
This is nice, thanks! The report is ``` Files already downloaded and verified Found 18 distinct operations, of which 15 (83.3%) are supported Please file an issue requesting the following...
This sounds totally reasonable, please feel free to break it up into these three. Re first issue: Not sure if that's feasible, but perhaps even automatically calling `examine` upon failure...