RepoSense
RepoSense copied to clipboard
Author profile not found in deployed dashboard
The deployed dashboard doesn't show the profile for the author Eugene Peh. It works for all other authors.
The link that should open should be this link instead of the wrong link.
Notice the addition of %20
in the latter URL.
Maybe this particular author can be ignored using the repo-config.json
in this case? Not sure about the exact cause but it seems that the Github Id for this particular author generated in the json file is incorrect.
Referring to: https://api.github.com/repos/reposense/testrepo-Alpha/commits
Seems like backend uses commit.author.name
to determine the gitId.
This is consistent with the fakeauthor's username which leads to a person who never contributed to reposense.
However it should be using author.login
instead.
After some investigation, I think with the current way RepoSense works, it is impossible to extract out github id if it is different from author name.
Currently the gitId/Author name is sourced from git log
command, which provides the author name used when committing. In the the case that the github id is different from the author name (in the case of 'eugenepeh' and 'Eugene Peh' for example) there is simply no information provided by git that allows us to get the correct github id.
A possible solution is to request more information about the commit via the Github API as seen in gerhean's comment:
Referring to: https://api.github.com/repos/reposense/testrepo-Alpha/commits
This will require the need to work with not just Github, but also potentially GitLab's and BitBucket's API. (Another minor caveat is the need to differentiate between committer and author, but this shouldn't be too important.)
Another possibility is to add some sort of configuration file that maps the author names to the correct github API. This configuration file would then need to be set up for each repository that needs to be analyzed as long as there is a difference between the author name and the Github id. While this is certainly cumbersome, I can see some use cases for this.
My understanding is, author-config.csv
already provides a way to specify the mapping between a GitHub username and author names in the commits. This specific problem might be due to an error/omission in the author-config.csv
.
It seems that eugene peh is missing from the author-config.csv, so that's probably the issue.
What about using github api? Is there any form of consideration to move towards that solution?
What about using github api?
@clyveycui we prefer not to rely on the Github API, as it is vendor-specific, requires authentication, has rate limits, etc.