haddock3
haddock3 copied to clipboard
Presenting cluster information in tables and plots
The dataframes used for creating tables, scatter and box plots have three columns cluster-id
, cluster-ranking
and capri_rank
. Here are two examples where there are Unclustered
and Other
groups in the dataframes:
Cluster-id capri_rank cluster-ranking
0 - 1 -
1 - 1 -
2 - 1 -
3 - 1 -
4 - 1 -
Cluster-id capri_rank cluster-ranking
125 Other 11 11
129 Other 11 11
131 Other 11 11
92 Other 11 13
108 Other 11 13
109 Other 11 13
119 Other 11 13
111 Other 11 14
130 Other 11 14
The representation of data in plots and tables for these groups is not consistent. For example, a cluster with Cluster-id = "-"
is called "Unclustered"
in tables and in scatterplots whereas it is "-"
in box plots and shown as capri_rank=1
in the x-axis of box plot.
Another example, a cluster with Cluster-id="Other"
is called "Other"
in scatter plots and box plots legends whereas they are shown with cluster-ranking=11, 13, 14
in tables whereas it is shown as capri_rank=11
in the x-axis of box plot.
See more: