DialoGPT icon indicating copy to clipboard operation
DialoGPT copied to clipboard

No file named train.tsv.gz

Open Megh-Thakkar opened this issue 5 years ago • 11 comments

Hi, Thank you for the implementation. While running the demo.py file, I encountered an error saying "No such file or directory". Can you help with the same?

TIA.

Megh-Thakkar avatar Jan 27 '20 06:01 Megh-Thakkar

+1

WolfNiu avatar Jan 29 '20 22:01 WolfNiu

Same here

wise-east avatar Jan 31 '20 02:01 wise-east

Can you post the command you use to run demo.py, and the result of "ls" command under data folder?

Thanks!

intersun avatar Feb 01 '20 01:02 intersun

python demo.py --data small

dummy_data.tsv prepare4db.sh train_raw.tsv train.tsv

WolfNiu avatar Feb 04 '20 23:02 WolfNiu

I ran python demo.py --data full and received the following error: `PROJECT_FOLDER = /root/Documents/gpt_test/DialoGPT Found existing models folder at /root/Documents/gpt_test/DialoGPT/models, skip creating a new one! 02/11/2020 23:25:22 - INFO - main - Downloading models... 02/11/2020 23:25:22 - INFO - demo_utils - /root/Documents/gpt_test/DialoGPT/models/small/config.json exists, return! 02/11/2020 23:25:22 - INFO - demo_utils - /root/Documents/gpt_test/DialoGPT/models/small/vocab.json exists, return! 02/11/2020 23:25:22 - INFO - demo_utils - /root/Documents/gpt_test/DialoGPT/models/small/merges.txt exists, return! 02/11/2020 23:25:22 - INFO - demo_utils - /root/Documents/gpt_test/DialoGPT/models/small/pytorch_model.bin exists, return! 02/11/2020 23:25:22 - INFO - demo_utils - /root/Documents/gpt_test/DialoGPT/models/small/small_ft.pkl exists, return! 02/11/2020 23:25:22 - INFO - main - Done!

02/11/2020 23:25:22 - INFO - main - Downloading and Extracting Data... error occurred, b'gzip: ./train.tsv.gz: No such file or directory\n' `

JingyaXun avatar Feb 11 '20 23:02 JingyaXun

Same here. Is there any solution? Thanks.

HouyuZhang1007 avatar Feb 29 '20 19:02 HouyuZhang1007

Before you run 'python demo.py --data small' or 'full', you could try to remove the files in data folder and just leave the original files (dummy_data.tsv prepare4db.sh train_raw.tsv) in data folder. This solves my problem.

guangyliu avatar Mar 11 '20 02:03 guangyliu

I have the same problem, however I just checked in that location and I'm noting seeing anything other than those 3 files. Any other ideas?

jpcookie avatar May 01 '20 23:05 jpcookie

Same problem here. Could you please help?

karthikgrama avatar Jul 27 '20 13:07 karthikgrama

Hi, make sure you setup the conda env according to README

git clone https://github.com/microsoft/DialoGPT.git
cd DialoGPT
conda env create -f LSP-linux.yml -n LSP
conda activate LSP

This command will install some necessary packages and tools (e.g. nltk) for preparing training data from demo.py. If the environment is incomplete, cd reddit_extractor; make -j 8; cd .. will fail without showing error messages in CMD, then occurs the error "No such file or directory" at cmd = 'gzip -d ./train.tsv.gz'

SeanCherngTW avatar Dec 14 '20 07:12 SeanCherngTW

Or you can make your own train.tsv.gz tar file. Use the tar command with the arguments to convert train.tsv -> train.tsv.gz in the data folder.

ZetiMente avatar Jan 27 '21 01:01 ZetiMente