facets icon indicating copy to clipboard operation
facets copied to clipboard

Comparing features between multiple dataset points

Open kmkolasinski opened this issue 8 years ago • 2 comments
trafficstars

Hi, I've found Facets to be a very interesting visualization tool, however I'm wondering if it is possible to compare features between two selected examples (or even more)? I know that it's not an issue, but rather a question about capabilities of this software.

For example: consider a csv file with power consumption generated by different houses, and there is a pair of two houses which have similar properties, but completely different levels of power consumption. It would be nice to visually compare features of both, and try to understand why they differ so much. Having tabular view of differences between those samples would be the easiest way (in my opinion). Are you going to extend this feature somehow? Thank you in advance.

kmkolasinski avatar Jul 19 '17 14:07 kmkolasinski

Assuming you are working in a jupyter notebook, currently the best way to do that might be to take the two points you want to compare, put them each in their own pandas dataframe (each containing only a single point) and show them both in a single facets-overview visualization, like the overview demo jupyter notebook does for the UCI test and train datasets.

This isn't ideal, but might allow you to compare them feature by feature in a reasonable way.

In the future, I could imagine us making this easier to do in the dive visualization by selecting two points and directly comparing them.

jameswex avatar Jul 19 '17 15:07 jameswex

Thank you for answer. I was thinking about it, but it would be quite inconvenient, assuming that I would like to take a look into many pairs. Having this features integrated with mouse button would be much easier and I believe that would be also useful for other users.

kmkolasinski avatar Jul 19 '17 15:07 kmkolasinski