training_results_v0.6 icon indicating copy to clipboard operation
training_results_v0.6 copied to clipboard

Update issues found in NVIDIA Transformer data processing scripts and make better instructions in README for data processing.

Open kevinstephano opened this issue 6 years ago • 3 comments

  1. The Transformer data processing scripts had logging imports that did not really exist and caused errors for users attempting to use the scripts.
  2. Updated the download path for the test dataset as the original path disappeared
  3. Made explicit instructions in the README.md on how to execute dataset downloading, tokenization, and conversion to the file format needed by pytorch.

kevinstephano avatar Nov 14 '19 00:11 kevinstephano

Thanks for your pull request. It looks like this may be your first contribution to a Google open source project (if not, look below for help). Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

:memo: Please visit https://cla.developers.google.com/ to sign.

Once you've signed (or fixed any issues), please reply here with @googlebot I signed it! and we'll verify it.


What to do if you already signed the CLA

Individual signers
Corporate signers

ℹ️ Googlers: Go here for more info.

googlebot avatar Nov 14 '19 00:11 googlebot

@googlebot I signed it!

kevinstephano avatar Nov 14 '19 00:11 kevinstephano

CLAs look good, thanks!

ℹ️ Googlers: Go here for more info.

googlebot avatar Nov 14 '19 00:11 googlebot