ScandEval icon indicating copy to clipboard operation
ScandEval copied to clipboard

[BENCHMARK DATASET REQUEST] NorGLM

Open larsbun opened this issue 1 year ago • 0 comments

Dataset name

NorGLM

Dataset link

https://github.com/Smartmedia-AI/NorGLM

Dataset languages

  • [ ] Danish
  • [ ] Swedish
  • [X] Norwegian (Bokmål or Nynorsk)
  • [ ] Icelandic
  • [ ] Faroese
  • [ ] German
  • [ ] Dutch
  • [X] English

Describe the dataset

This s a dataset on summarization under a CC BY-NC-SA 4.0 DEED published by Peng and Lemei at NorwAI in connection with their recent EMNLP publication.

Don't know much about this (and the overlap with other datasets, but it is large and it's good just to alert you guys to it.

larsbun avatar Sep 29 '24 09:09 larsbun