clickbait-detector icon indicating copy to clipboard operation
clickbait-detector copied to clipboard

Use youtube data

Open olypros opened this issue 8 years ago • 3 comments

Can you scrap the youtube channels all video titles known for clickbaits example( leafy,reactions,progamer etc ) and non clickbait chanels (idk ask reddit) and finetune the model ?

olypros avatar Feb 07 '17 11:02 olypros

The model can definitely be fine tuned. You can use the Youtube data API to fetch titles of videos by particular channels (https://developers.google.com/youtube/v3/docs/videos/list). Similarly, Reddit has an API too (https://www.reddit.com/dev/api/) from which you can get post titles

saurabhmathur96 avatar Feb 10 '17 07:02 saurabhmathur96

Yes I know that it can be fine tuned. But I'm don't know any ML. So it would be great if you create an example script to fine tune model. You don't have to use api just a small example associative array (or dict) containing title and its respective label.

Thanks

On Feb 10, 2017 12:39 PM, "Saurabh Mathur" [email protected] wrote:

The model can definitely be fine tuned. You can use the Youtube data API to fetch titles of videos by particular channels ( https://developers.google.com/youtube/v3/docs/videos/list). Similarly, Reddit has an API too (https://www.reddit.com/dev/api/) from which you can get post titles

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/saurabhmathur96/clickbait-detector/issues/7#issuecomment-278873608, or mute the thread https://github.com/notifications/unsubscribe-auth/AIGa2PmjxDmEavX04ugmlfiG4dadmn5qks5rbA0ngaJpZM4L5ZiX .

olypros avatar Feb 10 '17 08:02 olypros

I had some hand-labelled data from reddit (/r/savedyouaclick and /r/news). So I made a notebook showing how to fine tune the model on that data. Find the notebook here. Hope this helps.

saurabhmathur96 avatar Feb 10 '17 16:02 saurabhmathur96