hello-ltr
hello-ltr copied to clipboard
msmarco downloads could get duplicated between solr and elastic
Right now the downloads go to a data/
folder that is a child of the respective search engine. This could cause two copies of the same large data files to be downloaded if users are switching between engines.
Should we move the msmarco data storage location up higher so it could be easily shared? Or check multiple locations for existance before downloading?