Added Glove.6B files to the dataset resources
Tried to create the necessary yaml-file for downloading the Glove.6B files. Please let me know if it is missing something.
I notice that before adding this to DKPro Core, probably some additional stuff needs to be considered:
- Should it really be version 1.0.0 or rather a timestamp?
- We are not using the original format but a custom binary format, so maybe the ID should be "glove-dl4jw2v"?
- Maybe better one dataset descriptor per file to allow a more fine-grained download?
- maybe better also add a timestamp/version number to the filenames on UKP public?
- is UKP public the place where this data should stay?
Answer to data upload repository issue: Figshare can be considered as one option for sharing the dataset https://figshare.com/ They provide a DOI for the landing page of the dataset but I'm not sure if they provide a permanent link for the dataset file.
CLARIN LINDAT or Zenodo would IMHO be more applicable options. Or just downloading the original files from Stanford and doing the conversion locally afterwards.
Can one of the admins verify this patch?
No relevant anymore.