SimCSE icon indicating copy to clipboard operation
SimCSE copied to clipboard

wget: Command not found

Open sundavid2002 opened this issue 3 years ago • 3 comments

I'm trying to train SimCSE in a virtual environment (w/ python 3.7.0) I tried to run the download_wiki.sh file, but I got an error message saying "/Users/davidsun/Downloads/SimCSE/data/download_wiki.sh: line 1: wget: command not found" This error persisted even after I installed wget using pip. What should I do?

sundavid2002 avatar Aug 04 '22 12:08 sundavid2002

wget is a Linux command, you may use “sudo apt-get install wget” (for ubuntu and debian)or "sudo yum -y install wget" (for centos) to install wget for your operation system

dchenhe avatar Aug 05 '22 17:08 dchenhe

I'm sorry, but I don't have access to a Linux computer on which I have sudo permissions. Can this repo be run on a non-Linux OS?

sundavid2002 avatar Aug 07 '22 23:08 sundavid2002

Actually, you can just download the wiki dataset by yourself, instead of using wget command. if you don't have linux os, i recommend you to use colab to run the repo, google offers you an simple Linux-os platform and some free gpu computational resources. Hope that will work :)

dchenhe avatar Aug 08 '22 04:08 dchenhe

Hi,

This repo is only tested in a linux environment and it seems that you are using a Mac environment. The downloading command can be executed manually (see download_wiki.sh for the dataset link).

gaotianyu1350 avatar Aug 29 '22 13:08 gaotianyu1350