project-ideas
project-ideas copied to clipboard
Data-requester (tbc?)
Data-requester (tbc?)
A suite to collect user data from social networks with their consent
Introduction
Data-requester is a tool/self-hosted platform to request social networks' consent to gather their data into datasets.
Why ?
Prior to the GDPR and similar regulations (and even afterwards), datasets have been collected without user consent on online platforms. The advent of Deep Learning and its needs for ever bigger datasets exacerbated the issue. Social networks are an ideal place to collect data, but this has to be done the right way, with user consent and understanding on how the data is expected to be used.
Description
The goal of the project is to build a series of tool including an easily self-hostable web platform that makes it possible to collect user data with their consent.
Use case
For the generation of a dataset of people with 2-4 faces per person. A particular hashtag on Twitter contains tweets with relevant images that should be collected. Data-requester should provide:
- A way to download possibly relevant tweets (based on the hashtag here) and their media (images here)
- A way to filter the tweets and images
- A way to deploy a webpage on which the twitter user would give their consent for their data to be made available in the dataset. The webpage should make it possible to collect custom informations (e.g. adding a form).
- A way to contact twitter users to request their consent (e.g. direct message)
- A way to generate the dataset, while taking into consideration people that gave their consent or took it back later
License
This project is ongoing here : https://github.com/osscameroon/data-requester
Hey, the link to the repo print the 404 error page