James Chen
Results
2
issues of
James Chen
Right now, data loading and loss computation assume one is only doing LM pretraining, but it'd be useful to support packed SFT style datasets (i.e. datasets with cleanly delineated prompt/completion...
feature request
Running autoformatting via `pyink` via the `code_style.sh` script causes a lot of files to be reformatted and introduces noise in the commits. Is it possible for `code_style.sh` to be run...