yellowbrick icon indicating copy to clipboard operation
yellowbrick copied to clipboard

MostInformativeFeatures visualizer

Open majidaldo opened this issue 7 years ago • 4 comments

Is your feature request related to a problem? Please describe. This feature is motivated by the discussion in #510. The problem is how to visualize feature importance for multiple classes and/or instances. This requires solving two problems: 1. selecting the "most informative features" and 2. producing an appropriate visualization.

Describe the solution you'd like MIFV = MostInformativeFeatures MIFV().predict(X) would give a viz for the features most responsible for the prediction MIFV().poof()/fit(X) would show a viz for the features

For both cases, the proposed visual is a heat map where one axis is class labels or data and the other axis represents features.

Questions/Issues:

  • The estimators would have to (mathematically) support such a notion of feature importance.
  • Top 10 features doesn't make sense if all the features are similar in strength.
  • Does it make sense to numerically compare feature strengths over multiple classes?

majidaldo avatar Nov 28 '18 03:11 majidaldo

@majidaldo thanks for creating this issue and following / linking with other discussions.

I look forward to the community thinking through the questions and issues on this. =)

ndanielsen avatar Nov 29 '18 18:11 ndanielsen

Going to go ahead and close this one out since it's gone a bit stale; to anyone with interest/bandwidth — feel free to reopen!

rebeccabilbro avatar Jun 15 '20 02:06 rebeccabilbro

@rebeccabilbro this is one that is very interesting to me, perhaps I will take a crack at it later in the semester.

bbengfort avatar Jun 20 '20 18:06 bbengfort

That would be great @bbengfort, reopening!

rebeccabilbro avatar Jun 20 '20 19:06 rebeccabilbro