ncaahoopR_data
ncaahoopR_data copied to clipboard
positions seem to have errors
I loaded these data into pandas and when i ran a unique over position
i got some positions that were not "F", "C", and "G"
Here's each of the labels in position
and the value counts (I'm only pulling in data from the 2013-14 season to present)
position | value count |
---|---|
G | 145540 |
F | 139096 |
C | 44783 |
e | 117 |
y | 44 |
d | 36 |
r | 33 |
s | 30 |
n | 16 |
l | 8 |
h | 1 |
k | 1 |
a | 1 |
g | 1 |
i | 1 |
. | 1 |
t | 1 |
Thanks for this. I pull rosters from before this season from http://www.barttorvik.com and I had seen a few strange positions. I think going forward I will assign anything that isn't G, F, C or some combination of those (F/C, G/F) to NA
given the relative infrequency of them.
I don't think this data will be be re-scraped before the end of the season so I'd suggest doing the same if positions are important to you in the mean time.
okay, it's not a big deal for me; if you'd like to close this issue, feel free but if it'll remind you to rerun that stuff later then feel free to keep it open. thanks!