OLMo
OLMo copied to clipboard
Adding support for activation logging
🚀 The feature, motivation and pitch
No response
Alternatives
No response
Additional context
No response
Draft PR for this is here: https://github.com/allenai/LLM/pull/330
Right now it's about 40% slower when you turn on activation logging (which is a config flag). I hope this can be optimized (e.g. by removing some .item() calls which load GPU tensors to CPU), but this is where it is right now.
Marking the items prior to Feb 29th as "closed".