Dataproofer icon indicating copy to clipboard operation
Dataproofer copied to clipboard

Test: Name consistency

Open newsroomdev opened this issue 9 years ago • 2 comments

Please read how to create a new test if you're interested in writing this test.

Does your data have Middle Eastern or East Asian names in it? Are you sure the surnames are always in the same place? Is it possible anyone in your dataset uses a mononym? These are the sorts of things that data creators habitually get wrong. If you're working with a list of ethnically diverse names—which is any list of names—then you should do at least a cursory review before assuming that joining the first_name and last_name columns will give you something that is appropriate to publish. -Quartz Bad Data Guide

If a column is designated automatically or by the user as a name column, provide a brief description of why missing cells in a name column is potentially bad.

newsroomdev avatar Jan 13 '16 18:01 newsroomdev

I think that this one might be hard to automate- @geraldarthur you okay to kill for now?

ejfox avatar Mar 17 '16 17:03 ejfox

Let's save this one for a rainy day or maybe a hackathon. It's an interesting data smell that's definitely testable, but not one we have the time to really work on. Happy to take pull requests from anyone on this.

newsroomdev avatar Mar 25 '16 18:03 newsroomdev