ncaahoopR_data icon indicating copy to clipboard operation
ncaahoopR_data copied to clipboard

positions seem to have errors

Open nickwan opened this issue 4 years ago • 3 comments

I loaded these data into pandas and when i ran a unique over position i got some positions that were not "F", "C", and "G"

Here's each of the labels in position and the value counts (I'm only pulling in data from the 2013-14 season to present)

position value count
G 145540
F 139096
C 44783
e 117
y 44
d 36
r 33
s 30
n 16
l 8
h 1
k 1
a 1
g 1
i 1
. 1
t 1

nickwan avatar Mar 02 '20 01:03 nickwan

Thanks for this. I pull rosters from before this season from http://www.barttorvik.com and I had seen a few strange positions. I think going forward I will assign anything that isn't G, F, C or some combination of those (F/C, G/F) to NA given the relative infrequency of them.

lbenz730 avatar Mar 02 '20 01:03 lbenz730

I don't think this data will be be re-scraped before the end of the season so I'd suggest doing the same if positions are important to you in the mean time.

lbenz730 avatar Mar 02 '20 01:03 lbenz730

okay, it's not a big deal for me; if you'd like to close this issue, feel free but if it'll remind you to rerun that stuff later then feel free to keep it open. thanks!

nickwan avatar Mar 02 '20 01:03 nickwan