2017-new-coder-survey icon indicating copy to clipboard operation
2017-new-coder-survey copied to clipboard

GenderOther Variable Compromised

Open lama-ahmad opened this issue 6 years ago • 1 comments

There seems to be a problem with the GenderOther variable. When I loaded the dataset in R, I wanted to convert the GenderOther variable to a factor and got this:

51 Levels: 54 a abcdefghijklmnopqrstuvwxyz apache attack helicopter Apache attack helicopter ... why the hell are here more than two genders?!

lama-ahmad avatar Dec 02 '18 12:12 lama-ahmad

Yes, unfortunately, this has some irrelevant values. This was not ideal, but I didn't have time to flush out these values. I'm currently working on the 2018 dataset and there appears to be the same issue.

If you happen to write some code to help clean this part up, feel free to share it here for others to use as well until there is fix on this data. Thanks!

erictleung avatar Dec 02 '18 20:12 erictleung