NPTEL2020-Indian-English-Speech-Dataset icon indicating copy to clipboard operation
NPTEL2020-Indian-English-Speech-Dataset copied to clipboard

Need clarification on licensing.

Open jeb-orcl opened this issue 3 years ago • 1 comments

It appears that this corpus is compiled from the "nptelhrd" playlist at https://www.youtube.com/playlist?list=UU640y4UvDAlya_WOj5U4pfA.

When you downloaded the videos, did you download all videos on the playlist or only the ones whose video description included the "Creative Commons Attribution license (reuse allowed)" link to the YouTube Creative Commons license page?

I am interested in this corpus, but the organization I work for will require that everything in the corpus allows reuse.

jeb-orcl avatar Jan 13 '22 18:01 jeb-orcl

Yes, all the videos are under CC license, which allow reuse (with attribution). We had sampled a good number of lecture videos (with subtitles) from the playlist to verify if all of them are CC.

If you want to be double safe, we have also released metadata for all audio, which would also include a field called "license", which you can use to be sure you're always using CC videos.

GokulNC avatar Jan 14 '22 06:01 GokulNC