WenJett
Results
2
issues of
WenJett
Hi, Appreciate your work done so far. With the new release of OLMo 2, the tokenizer used seems to be **allenai_domla2.json** but in **prepare_memmap_dataset.py**, the tokenizer is **allenai/eleuther-ai-gpt-neox-20b-pii-special**. Understand that...
### ❓ The question Hi, I was unable to reopen the previous issue: https://github.com/allenai/OLMo/issues/790. Hence, creating another open issue and copying my response below. Hi Aman, Thanks for the guidance,...
type/question