OSCI icon indicating copy to clipboard operation
OSCI copied to clipboard

Report issues - data does not add up

Open jerpelea opened this issue 2 years ago • 4 comments

I found 2 issues on the local generated reports:

people appear in the contributor ranking report but it are missing from the repository commits EX: cat OSCI_Contributors_ranking_YTD_2022-01-31.csv | grep enderborg Sony,peter enderborg,xxxxxxxxxxxxx@xxxxxxxxxx,57

cat Company-contributors-repository-commits_YTD_2022-01-31.csv | grep enderborg returns nothing

Do you have any idea why some persons are missing ?

In the same report my contributions are counted separate for the same email address cat OSCI_Contributors_ranking_YTD_2022-01-31.csv | grep Alin Sony,Alin Jerpelea,xxxxxxxxxxxxx@xxxxxxxxxx,90 Sony,Alin,xxxxxxxxxxxxx@xxxxxxxxxx,56 (the email address is the same)

Thanks

jerpelea avatar Aug 22 '22 07:08 jerpelea

@vlad-isayko can you please help

jerpelea avatar Aug 22 '22 07:08 jerpelea

@vlad-isayko bump

jerpelea avatar Oct 04 '22 08:10 jerpelea

@jerpelea Hello, sorry for the long reply. As for the different data in the reports, then the incorrect name Company-contributors-repository-commits_YTD_2022-01-31.csv has misled you. This report collects data for a specific day, not since the beginning of the year https://github.com/epam/OSCI/blob/979b6cbfd00af8ee3e16d63ec3beaa27e40ddc9a/osci/transformers/company_contributors_repository_commits.py#L57

And in the case of duplicates, the problem is that in our business logic both the name and the e-mail are the aggregation keys. I think for a discussion on this topic, we'd better call @cm-howard

vlad-isayko avatar Nov 25 '22 09:11 vlad-isayko

@vlad-isayko thanks for sharing the info

I would like to know more about the email bug

jerpelea avatar Nov 25 '22 11:11 jerpelea