Open-Assistant icon indicating copy to clipboard operation
Open-Assistant copied to clipboard

Add multilangual subtitles datasets

Open sedthh opened this issue 1 year ago • 0 comments

  1. copy OpenSubtitles dataset to HF https://opus.nlpl.eu/OpenSubtitles-v2018.php

  2. optionall scrape more subtitles from different places as long as they are multilangual and their timestamps can be matched with other languages

sedthh avatar Feb 25 '23 16:02 sedthh