grano
grano copied to clipboard
Data quality ranks
Wikidata seems to have three ranks of data. This seems useful to rate user-provided property values over scraper-generated input. In effect, the following data quality ranks should be considered:
- Known to be false / discredited (score 10)
- Information submitted by uncredited user (score 20)
- Information acquired by an automatic process (score 30)
- Information validated by an trusted user / editor (score 40)
- Information with court-level evidence (score 50)
The question now is: does this apply to properties only, or also to entities and relations directly? If it does apply to the main objects, would that mean a new column on the relevant tables?
Keen for feedback, @MiguelPaz, @jazzido.
I am a huge fan of Karma or Reputation points and I like that idea. My favorite model is Stackoverflow. They do 3 things really well: 1. Tell you in a very visible way their game mechanics model. 2. Lead you to a page where it's explained in a very step by step attractive way http://stackoverflow.com/about 3. Layout the points details and what do they mean and split them into categories http://stackoverflow.com/help/privileges. From VozData by @jazzido and Free The Files: the leaderboard of contributors, the call to action, the detail about ammount of documents released, etc. From Buzzfeed`s USER post and how they show you how "this post was created by a user and has not been vetted or endorsed by BuzzFeed's editorial staff..." http://www.buzzfeed.com/slibby/10-things-you-might-not-know-about-press-freedom-w-pa8k