style-analyzer icon indicating copy to clipboard operation
style-analyzer copied to clipboard

Improve feature selection process

Open zurk opened this issue 6 years ago • 1 comments

Context: https://github.com/src-d/style-analyzer/issues/595#issuecomment-466353578

Things we want to try:

  • [ ] preselect features by hands
  • [ ] add Feature selection to GridSearch
  • [ ] feature agglomeration
  • [ ] Give a try https://www.featuretools.com/
  • [ ] Train feature selection only once per repo (pin selection for the next runs)

zurk avatar Feb 22 '19 11:02 zurk

For the last point I was more thinking about selecting over a set of training repos (for example by taking the top 500 most selected features over all the repos selections) but indeed it might also be good to try this version.

m09 avatar Feb 22 '19 11:02 m09