ScandEval
ScandEval copied to clipboard
[BENCHMARK DATASET REQUEST] NorGLM
Dataset name
NorGLM
Dataset link
https://github.com/Smartmedia-AI/NorGLM
Dataset languages
- [ ] Danish
- [ ] Swedish
- [X] Norwegian (Bokmål or Nynorsk)
- [ ] Icelandic
- [ ] Faroese
- [ ] German
- [ ] Dutch
- [X] English
Describe the dataset
This s a dataset on summarization under a CC BY-NC-SA 4.0 DEED published by Peng and Lemei at NorwAI in connection with their recent EMNLP publication.
Don't know much about this (and the overlap with other datasets, but it is large and it's good just to alert you guys to it.