devml
devml copied to clipboard
Product of Pragmatic AI Labs: Machine Learning, Statistics and Utilities around Developer Productivity, Company Productivity and Project Productivity
Installing devml locally or on a standalone Jupyter notebook works fine, but installation on a Spark executor fails. ```Waiting for a Spark session to start... Spark Initialization Done! ApplicationId =...
Love the idea of distribution of "abnormal commits" http://onlinelibrary.wiley.com/doi/10.1002/smr.1893/full
https://github.com/src-d/enry
A few interesting clustering ideas: * Behavioral clustering: Active Ratio, Commits/Day * Defect clustering: Files, Churn Ratio * Identify similar developers
One pattern I see frequently is developers who check in thousands of lines of code from other projects or 3rd party libraries. Would be interesting to: A. Detect this B....
Use JSON validation: https://pypi.python.org/pypi/jsonschema
Best to be safe with shell escaping: i.e.: https://docs.python.org/2/library/subprocess.html#frequently-used-arguments
Would be useful to have an ability quantify lint information: i.e. Python code that has lint and warning errors from Pylint, then put that as a column in a Pandas...
Git can map users to a file, using .mailmap is one way to solve this.
looking for something that works in all languages.