grano icon indicating copy to clipboard operation
grano copied to clipboard

Data quality ranks

Open pudo opened this issue 11 years ago • 2 comments

Wikidata seems to have three ranks of data. This seems useful to rate user-provided property values over scraper-generated input. In effect, the following data quality ranks should be considered:

  • Known to be false / discredited (score 10)
  • Information submitted by uncredited user (score 20)
  • Information acquired by an automatic process (score 30)
  • Information validated by an trusted user / editor (score 40)
  • Information with court-level evidence (score 50)

The question now is: does this apply to properties only, or also to entities and relations directly? If it does apply to the main objects, would that mean a new column on the relevant tables?

pudo avatar Mar 25 '14 10:03 pudo

Keen for feedback, @MiguelPaz, @jazzido.

pudo avatar Apr 14 '14 14:04 pudo

I am a huge fan of Karma or Reputation points and I like that idea. My favorite model is Stackoverflow. They do 3 things really well: 1. Tell you in a very visible way their game mechanics model. 2. Lead you to a page where it's explained in a very step by step attractive way http://stackoverflow.com/about 3. Layout the points details and what do they mean and split them into categories http://stackoverflow.com/help/privileges. From VozData by @jazzido and Free The Files: the leaderboard of contributors, the call to action, the detail about ammount of documents released, etc. From Buzzfeed`s USER post and how they show you how "this post was created by a user and has not been vetted or endorsed by BuzzFeed's editorial staff..." http://www.buzzfeed.com/slibby/10-things-you-might-not-know-about-press-freedom-w-pa8k

miguelpaz avatar May 08 '14 00:05 miguelpaz