James Chen

Results 2 issues of James Chen

Right now, data loading and loss computation assume one is only doing LM pretraining, but it'd be useful to support packed SFT style datasets (i.e. datasets with cleanly delineated prompt/completion...

feature request

Running autoformatting via `pyink` via the `code_style.sh` script causes a lot of files to be reformatted and introduces noise in the commits. Is it possible for `code_style.sh` to be run...