OLMo
OLMo copied to clipboard
Llama config with a default layer norm instead of RMS for performance