DialoGPT
DialoGPT copied to clipboard
No file named train.tsv.gz
Hi, Thank you for the implementation. While running the demo.py file, I encountered an error saying "No such file or directory". Can you help with the same?
TIA.
+1
Same here
Can you post the command you use to run demo.py, and the result of "ls" command under data folder?
Thanks!
python demo.py --data small
dummy_data.tsv prepare4db.sh train_raw.tsv train.tsv
I ran python demo.py --data full
and received the following error:
`PROJECT_FOLDER = /root/Documents/gpt_test/DialoGPT
Found existing models folder at /root/Documents/gpt_test/DialoGPT/models, skip creating a new one!
02/11/2020 23:25:22 - INFO - main - Downloading models...
02/11/2020 23:25:22 - INFO - demo_utils - /root/Documents/gpt_test/DialoGPT/models/small/config.json exists, return!
02/11/2020 23:25:22 - INFO - demo_utils - /root/Documents/gpt_test/DialoGPT/models/small/vocab.json exists, return!
02/11/2020 23:25:22 - INFO - demo_utils - /root/Documents/gpt_test/DialoGPT/models/small/merges.txt exists, return!
02/11/2020 23:25:22 - INFO - demo_utils - /root/Documents/gpt_test/DialoGPT/models/small/pytorch_model.bin exists, return!
02/11/2020 23:25:22 - INFO - demo_utils - /root/Documents/gpt_test/DialoGPT/models/small/small_ft.pkl exists, return!
02/11/2020 23:25:22 - INFO - main - Done!
02/11/2020 23:25:22 - INFO - main - Downloading and Extracting Data... error occurred, b'gzip: ./train.tsv.gz: No such file or directory\n' `
Same here. Is there any solution? Thanks.
Before you run 'python demo.py --data small' or 'full', you could try to remove the files in data folder and just leave the original files (dummy_data.tsv prepare4db.sh train_raw.tsv) in data folder. This solves my problem.
I have the same problem, however I just checked in that location and I'm noting seeing anything other than those 3 files. Any other ideas?
Same problem here. Could you please help?
Hi, make sure you setup the conda env according to README
git clone https://github.com/microsoft/DialoGPT.git
cd DialoGPT
conda env create -f LSP-linux.yml -n LSP
conda activate LSP
This command will install some necessary packages and tools (e.g. nltk) for preparing training data from demo.py.
If the environment is incomplete, cd reddit_extractor; make -j 8; cd ..
will fail without showing error messages in CMD, then occurs the error "No such file or directory" at cmd = 'gzip -d ./train.tsv.gz'
Or you can make your own train.tsv.gz tar file. Use the tar command with the arguments to convert train.tsv -> train.tsv.gz in the data folder.