OLMo
OLMo copied to clipboard
We can no longer run on a single CPU
We can't run in a debugger anymore.
What were you trying to debug? Sure, we could make the training run on a single CPU (or GPU) but that adds complexity and new code paths. E.g. we can't use FSDP for non-distributed training so we'd have to have a separate checkpointing mechanism.
Marking the items prior to Feb 29th as "closed".