RedPajama-Data
RedPajama-Data copied to clipboard
fixed some errors in Makefile for lm preparation
- install sentencepiece from github repo. I can not run the .zip version on my MacOS.
- make some necessary directories during make
- cache the wiki json.gz if has already been downloaded
@mauriceweber could you review this PR?
Hi @feifeibear , thanks a lot for your PR! I will look into it:)