FsLab icon indicating copy to clipboard operation
FsLab copied to clipboard

Hanging Rootogram

Open kMutagene opened this issue 1 year ago • 0 comments

Description

Comparing the distribution of data with a theoretical distribution from an ordinary histogram can be difficult because small frequencies are dominated by the larger frequencies and it is hard to perceive the pattern of differences between the histogram bars and the curve. Therefore John Tukey introduced the Hanging Rootogram in 1971 (also called Tukey’s Hanging Rootogram) in order to solve these problems. In this visualization the comparison is made easier by ‘hanging’ the observed results from the theoretical curve, so that the discrepancies are seen by comparison with the horizontal axis rather than a sloping curve. As in the rootogram, the vertical axis is scaled to the square-root of the frequencies so as to draw attention to discrepancies in the tails of the distribution. [1]

Example

References

  • [1] https://datavizproject.com/data-type/rootogram/
  • [2] https://www.jstor.org/stable/2683341?origin=crossref
  • [3] https://www.infinityinsight.com/blog/?p=368

Pointers

  • You will need to combine functions from FSharp.Stats and Plotly.NET to achieve this.
  • This plays well with #13 because this plot needs a reference visualization of a distribution
  • A variation of this plot that can be dubbed 'chandelier plot' [3] replaces bars by a lollipop plot, which is the focus of https://github.com/plotly/Plotly.NET/issues/417

kMutagene avatar Sep 22 '23 15:09 kMutagene