geni icon indicating copy to clipboard operation
geni copied to clipboard

Add support for the creation of histograms

Open andres-moreno opened this issue 3 years ago • 0 comments

Creating histograms is a very common activity. Geni offers cut which supports the creation of histograms as a function of bins, an array of values, but the user has to compute these bins manually.

Geni provides qcut to help users determine how wide each bin should be.

It would be helpful to provide support for a function, (g/histogram :column {:n-bins :bins-vector}) that would either compute the bins automatically if provided with an :n-bins parameter, or compute the histogram on the basis of the supplied :bins-vector.

Using the form with just the :n-bins argument is very useful for data analysis and review, while being able to provide a :bins-vector addresses the use case where histogram use is informed by business domain needs (e.g., bin populations into age brackets that align with survey methodology).

andres-moreno avatar Jan 16 '21 22:01 andres-moreno