Open-Assistant icon indicating copy to clipboard operation
Open-Assistant copied to clipboard

Letterbox Data source

Open VictorLG98 opened this issue 2 years ago • 2 comments

I was wondering if letterbox could be a good source for creating a Q/A dataset. My idea is to scrape this https://letterboxd.com/films/popular/ like: Q: Can you make a brief summary about the film {film}? A: In his second year of fighting crime, Batman uncovers corruption in Gotham City...

Let me know if this is a good idea and what are the steps to be followed

VictorLG98 avatar Feb 18 '23 12:02 VictorLG98

Looks like they use themoviedb for data, who explicitly forbid derived works:

https://www.themoviedb.org/terms-of-use

But you could get the popular film names, look them up on Wikipedia and use Wikipedia's data which is shared under an open license.

bitplane avatar Feb 19 '23 00:02 bitplane

Looks like they use themoviedb for data, who explicitly forbid derived works:

https://www.themoviedb.org/terms-of-use

But you could get the popular film names, look them up on Wikipedia and use Wikipedia's data which is shared under an open license.

Ok nice, you can assign it to me!

VictorLG98 avatar Feb 23 '23 10:02 VictorLG98